Illinois Data Bank
Log in with NetID
University Library, University of Illinois at Urbana-Champaign
Illinois Data Bank
Log in with NetID
25 per page
50 per page
Displaying datasets 26 - 50 of 478 in total
Generate Report from Search Results
Life Sciences (254)
Social Sciences (114)
Physical Sciences (68)
Technology and Engineering (38)
Arts and Humanities (1)
U.S. National Science Foundation (NSF) (139)
U.S. National Institutes of Health (NIH) (49)
U.S. Department of Energy (DOE) (42)
U.S. Department of Agriculture (USDA) (23)
Illinois Department of Natural Resources (IDNR) (10)
U.S. National Aeronautics and Space Administration (NASA) (5)
U.S. Geological Survey (USGS) (5)
Illinois Department of Transportation (IDOT) (1)
U.S. Army (1)
CC BY (186)
Winogradoff, David; Chou, Han-Yi; Maffeo, Christopher; Aksimentiev, Aleksei (2022): Trajectory files for "Percolation transition prescribes protein size-specific barrier to passive transport through the nuclear pore complex.". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5581194_V1
Nuclear pore complex; system files; trajectory files
Rahlin, Anastasia; Saunders, Sarah; Beilke, Stephanie (2022): Spatial drivers of wetland bird occupancy within an urbanized matrix in the Upper Midwestern United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1575830_V1
Dataset for "Spatial drivers of wetland bird occupancy within an urbanized matrix in the Upper Midwestern United States" manuscript contains occupancy data for ten wetland bird species used in single-species occupancy models at four spatial scales and four wetland habitat types. Data were collected from 2017-2019 in NE Illinois and NW Indiana. Dataset includes wetland bird occupancy data, habitat parameter values for each survey location, and R code used to run analyses.
wetland birds; occupancy; emergent wetland; urbanization; Great Lakes region
Fu, Yuanxi; Hsiao, Tzu-Kun; Joshi, Manasi Ballal (2022): The Salt Controversy Systematic Review Reports and Primary Study Reports Network Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6128763_V1
The salt controversy is the public health debate about whether a population-level salt reduction is beneficial. This dataset covers 82 publications--14 systematic review reports (SRRs) and 68 primary study reports (PSRs)--addressing the effect of sodium intake on cerebrocardiovascular disease or mortality. These present a snapshot of the status of the salt controversy as of September 2014 according to previous work by epidemiologists: The reports and their opinion classification (for, against, and inconclusive) were from Trinquart et al. (2016) (<a href="https://doi.org/10.1093/ije/dyv184">Trinquart, L., Johns, D. M., & Galea, S. (2016). Why do we think we know what we know? A metaknowledge analysis of the salt controversy. International Journal of Epidemiology, 45(1), 251–260. https://doi.org/10.1093/ije/dyv184</a>), which collected 68 PSRs, 14 SRRs, 11 clinical guideline reports, and 176 comments, letters, or narrative reviews. Note that our dataset covers only the 68 PSRs and 14 SRRs from Trinquart et al. 2016, not the other types of publications, and it adds additional information noted below. This dataset can be used to construct the inclusion network and the co-author network of the 14 SRRs and 68 PSRs. A PSR is "included" in an SRR if it is considered in the SRR's evidence synthesis. Each included PSR is cited in the SRR, but not all references cited in an SRR are included in the evidence synthesis or PSRs. Based on which PSRs are included in which SRRs, we can construct the inclusion network. The inclusion network is a bipartite network with two types of nodes: one type represents SRRs, and the other represents PSRs. In an inclusion network, if an SRR includes a PSR, there is a directed edge from the SRR to the PSR. The attribute file (report_list.csv) includes attributes of the 82 reports, and the edge list file (inclusion_net_edges.csv) contains the edge list of the inclusion network. Notably, 11 PSRs have never been included in any SRR in the dataset. They are unused PSRs. If visualized with the inclusion network, they will appear as isolated nodes. We used a custom-made workflow (Fu, Y. (2022). Scopus author info tool (1.0.1) [Python]. <a href="https://github.com/infoqualitylab/Scopus_author_info_collection">https://github.com/infoqualitylab/Scopus_author_info_collection</a> ) that uses the Scopus API and manual work to extract and disambiguate authorship information for the 82 reports. The author information file (salt_cont_author.csv) is the product of this workflow and can be used to compute the co-author network of the 82 reports. We also provide several other files in this dataset. We collected inclusion criteria (the criteria that make a PSR eligible to be included in an SRR) and recorded them in the file systematic_review_inclusion_criteria.csv. We provide a file (potential_inclusion_link.csv) recording whether a given PSR had been published as of the search date of a given SRR, which makes the PSR potentially eligible for inclusion in the SRR. We also provide a bibliography of the 82 publications (supplementary_reference_list.pdf). Lastly, we discovered minor discrepancies between the inclusion relationships identified by Trinquart et al. (2016) and by us. Therefore, we prepared an additional edge list (inclusion_net_edges_trinquart.csv) to preserve the inclusion relationships identified by Trinquart et al. (2016).
systematic reviews; evidence synthesis; network analysis; public health; salt controversy;
Haselhorst, Derek; Moreno, J. Enrique; Tcheng, David K.; Punyasena, Surangi W. (2022): Images and annotated counts for aerial pollen samples from the Barro Colorado Island megaplot, Panama (1994 – 2010). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2176715_V1
This dataset includes images and annotated counts for 150 airborne pollen samples from the Center for Tropical Forest Science 50 ha forest dynamics plot on Barro Colorado Island, Panama. Samples were collected once a year from April 1994 to June 2010.
aerial pollen traps; automated pollen identification; Barro Colorado Island; convolutional neural networks; Neotropics; palynology; phenology
Androwski, Rebecca; Asad, Nadeem; Wood, Janet; Hofer, Allison; Locke, Steven; Smith, Cassandra; Rose, Becky; Schroeder, Nathan (2020): Data From: Mutually exclusive dendritic arbors in C. elegans neurons share a common architecture and convergent molecular cues. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0818023_V1
This dataset includes neuronal development of the Caenorhabditis elegans dauer and adult.
Nematode; Dendrite; Stress
Rapti, Zoi; Clifton, Sara (2022): Code for "Phage-antibiotic synergy inhibited by temperate and chronic virus competition". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9460305_V1
Matlab codes for the article "Phage-antibiotic synergy inhibited by temperate and chronic virus competition". Code can be used to reproduce the article figures, perform the parameter sensitivity analysis and simulate the model.
bacterium-phage-antibiotic model; ODEs; Matlab; sensitivity analysis
Pan, Chao; Tabatabaei, S Kasra; Tabatabaei Yazdi, S. M. Hossein; Hernandez, Alvaro; Schroeder, Charles; Milenkovic, Olgica (2021): Rewritable Two-Dimensional DNA-Based Data Storage System (2DDNA) Sequencing Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2308557_V1
This dataset contains sequencing data obtained from Illumina MiSeq device to prove the concept of the proposed 2DDNA framework. Please refer to README.txt for detailed description of each file.
machine learning;image processing;computer vision;rewritable storage system;2D DNA-based data storage
Rapti, Zoi; Rivera Quinones, Vanessa; Stewart Merrill, Tara (2022): Codes for the analysis of an eco-immunological disease-transmission mathematical model. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1686840_V1
MATLAB files for the analysis of an ODE model for disease transmission. The codes may be used to find equilibrium points, study transient dynamics, evaluate the basic reproductive number (R0), and simulate the model when parameters depend on the independent variables. In addition, the codes may be used to perform local sensitivity analysis of R0 on the model parameters.
Lee, Sangjun; Huang, Edwin W.; Johnson, Thomas A.; Guo, Xuefei; Husain, Ali A.; Mitrano, Matteo; Lu, Kannan; Zakrzewski, Alexander V.; de la Pena, Gilberto A.; Peng, Yingying; Huang, Hai; Lee, Sang-Jun; Jang, Hoyoung; Lee, Jun-Sik; Joe, Young Il; Doriese, William B.; Szypryt, Paul; Swetz, Daniel S.; Chi, Songxue; Aczel, Adam A.; MacDougall, Gregory J.; Kivelson, Steven A. ; Fradkin, Eduardo; Abbamonte, Peter (2022): Data for "Generic character of charge and spin density waves in superconducting cuprates". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1757317_V1
Data for "Generic character of charge and spin density waves in superconducting cuprates". - Neutron scattering data for SDW - RSXS scans of CDW of LESCO x=0.10, 0.125, 0.15, 0.17, 0.20 at various temperatures. - Temperature dependence of CDW peak intensity, correlation length, Qcdw (Lorentzian fit, S(q,T) fit, Landau-Ginzburg fit) - XAS data of LESCO x=0.10, 0.125, 0.15, 0.17, 0.20
Crawford, Reed D.; Dodd, Luke E.; Tillman, Frank E.; O'Keefe, Joy M. (2022): Data for Evaluating bat boxes: Design and placement alter bioenergetic costs and overheating risk. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3592866_V1
This dataset contains our bi-hourly temperature recordings from 40 rocket box style artificial roosts of 5 designs deployed in Indiana and Kentucky, USA from April through September 2019. This dataset also includes our endothermic and faculatively heterothermic daily energy expenditure datasets used in our bioenergetic analysis, which were calculated from the bi-hourly rocket box temperature data. Lastly, we include our overheating counts dataset which summarizes daily overheating events (i.e., temperatures > 40 Celsius) in each rocket box style bat box over the course of the study period, these daily summaries were also calculated from the bi-hourly rocket box temperature recordings.
artificial roost; bat box; microcllimate; temperature
Kim, Hyunbin; Makhnenko, Roman (2022): Data on "Evaluation of CO2 sealing potential of heterogeneous Eau Claire shale". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5509498_V1
This dataset is provided to support the statements in Kim, H., and R.Y. Makhnenko. 2022. "Evaluation of CO2 sealing potential of heterogeneous Eau Claire shale". Journal of the Geological Society. In geologic carbon dioxide (CO2) storage in deep saline aquifers, buoyant CO2 tends to float upwards in the reservoirs overlaid by low permeable formations called caprocks. Caprocks should serve as barriers to potential CO2 leakage that can happen through a diffusion loss and permeation through faults, fractures, or pore spaces. The leakage through intact caprock would mainly depend on its permeability and CO2 breakthrough pressure, and is affected by the heterogeneities in the material. Here, we study the sealing potential of a caprock from Illinois Basin - Eau Claire shale, with sandy and shaly fractions distinguished via electron microscopy and grain/pore size and surface area characterization. The direct measurements of permeability of sandy shale provides the values ~ 10-15 m2, while clayey specimens are three orders of magnitude less permeable. The CO2 breakthrough pressure under in-situ stress conditions is 0.1 MPa for the sandy shale and 0.4 MPa for the clayey counterpart – these values are higher than those predicted by the porosimetry methods performed on the unconfined specimens. Sandy Eau Claire shale would allow penetration of large CO2 volumes at low overpressures, while the clayey formation can serve as a caprock in the absence of faults and fractures in it.
Geologic carbon storage; Caprock; Shale; CO2 breakthrough pressure; Porosimetry.
Wedell, Eleanor; Warnow, Tandy (2022): Biological and Simulated datasets for testing the SCAMPP framework for phylogenetic placement methods. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9257957_V1
Thank you for using these datasets! These files contain trees and reference alignments, as well as the selected query sequences for testing phylogenetic placement methods against and within the SCAMPP framework. There are four datasets from three different sources, each containing their source alignment and "true" tree, any estimated trees that may have been generated, and any re-estimated branch lengths that were created to be used with their requisite phylogenetic placement method. Three biological datasets (16S.B.ALL, PEWO/LTP_s128_SSU, and PEWO/green85) and one simulated dataset (nt78) is contained. See README.txt in each file for more information.
Phylogenetic Placement; Phylogenetics; Maximum Likelihood; pplacer; EPA-ng
Yan, Bin; Dietrich, Christopher; Yu, Xiaofei; Dai, Renhuai; Maofa, Yang (2022): Data files for phylogenetic analysis of Typhlocybinae (Hemiptera: Cicadellidae). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8761342_V1
The files are plain text and contain the original data used in phylogenetic analyses of of Typhlocybinae (Bin, Dietrich, Yu, Meng, Dai and Yang 2022: Ecology & Evolution, in press). The three files with extension .phy are text files with aligned DNA sequences in the standard PHYLIP format and correspond to Matrix 1 (amino acid alignment), Matrix 2 (nucleotide alignment of first two codon positions of protein-coding genes) and Matrix 3 (nucleotide alignment of protein-coding genes plus 2 ribosomal genes) described in the Methods section. An additional text file in NEXUS format (.nex extension) contains the morphological character data used in the ancestral state reconstruction (ASCR) analysis described in the Methods. NEXUS is a standard format used by various phylogenetic analysis software. For more information on data file content, see the included "readme" files.
Hemiptera; phylogeny; mitochondrial genome; morphology; leafhopper
Trivellone, Valeria; Cao, Yanghui; Dietrich, Christopher (2022): Alternative approaches for uncovering phytoplasma biodiversity and vector associations. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9804959_V1
This dataset contains nucleotide sequences of 16S rRNA gene from phytoplasmas and other bacteria detected in phloem-feeding insects (Hemiptera, Auchenorrhyncha). The datasets were used to compare traditional Sanger sequencing with a next-generation sequencing method, Anchored Hybrid Enrichment (AHE) for detecting and characterizing phytoplasmas in insect DNA samples. The file “Trivellone_etal_SangerSequencing.fas”, comprising 1397 positions (the longest sequence), includes 35 not aligned bacterial 16S rRNA sequences (16 phytoplasmas and 19 other bacterial strains) yielded using Sanger sequencing. The file “Trivellone_etal_AHEmethod1.fas” includes 34 not aligned bacterial 16S rRNA sequences (28 phytoplasmas and 6 other bacterial strains) and it contains 1530 positions (the longest sequence). Each sequence was assembled using assembled based on ABySS v2.1.0 pipeline. The file “Trivellone_etal_AHEmethod2.fas” includes 31 not aligned bacterial 16S rRNA sequences (27 phytoplasmas and 4 other bacterial strains) and it contains 1530 positions (the longest sequence). Each sequence was assembled based on the HybPiper v2.0.1 pipeline . Additional details in the "read_me_trivellone.txt" file attached below.
anchored hybrid enrichment; biodiversity, biorepository; nested PCR; Sanger sequencing
has sharing link
Wong, Tony; Oudshoorn, Luuk; Sofovich, Eliyahu; Green, Alex; Shah, Charmi; Indebetouw, Remy; Meixner, Margaret; Hacar, Alvaro; Nayak, Omnarayani; Tokuda, Kazuki; Bolatto, Alberto D.; Chevance, Melanie; De Marchi, Guido; Fukui, Yasuo; Hirschauer, Alec S.; Jameson, K. E.; Kalari, Venu; Lebouteiller, Vianney; Looney, Leslie W.; Madden, Suzanne C.; Onishi, Toshikazu; Roman-Duval, Julia; Rubio, Monica; Tielens, A. G. G. M. (2022): Data for: The 30 Doradus Molecular Cloud at 0.4 pc Resolution with ALMA: Physical Properties and the Boundedness of CO-emitting Structures. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1671495_V1
12CO and 13CO emission maps of the 30 Doradus molecular cloud in the Large Magellanic Cloud, obtained with the Atacama Large Millimeter/submillimeter Array (ALMA) during Cycle 7. See the associated article in the Astrophysical Journal, and README file, for details. Please cite the article if you use these data.
Xia, Yushu; Wander, Michelle; Kwon, Hoyoung (2021): County-level Data of Nitrogen Fertilizer and Manure Inputs for Corn Production in the United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3112432_V1
This dataset includes five files developed using the procedures described in the article 'Developing County-level Data of Nitrogen Fertilizer and Manure Inputs for Corn Production in the United States' and Supplemental Information published in the Journal of Cleaner Production in 2021. Citation: Xia, Yushu, Hoyoung Kwon, and Michelle Wander. "Developing county-level data of nitrogen fertilizer and manure inputs for corn production in the United States." Journal of Cleaner Production 309 (2021): e126957. Brief method: The fertilizer and manure inputs for corn were generated with a top-down approach by assigning county-level total N inputs reported by USGS to different crops using state- and county-level survey data. The corn N needs were estimated using empirical extension-based equations coupled with soil and environmental covariates. The estimates of fertilizer N inputs were further refined for corn grain and silage production at the county level and gap-filling (using state-level averages) was carried out to generate final files for U.S. county-level N inputs. The dataset is provided in an alternative format in Google Earth Engine: https://code.earthengine.google.com/13a0078e7ee727bc001e045ad0e8c6fc
Corn; Nitrogen Fertilizer; Manure; Conterminous U.S.
Chu, Gillian; Warnow, Tandy (2022): RNASim-VS2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8812049_V1
Provides RNASim-VS2 datasets used in Gillian's Master's thesis.
Southey, Bruce; Rodriguez-Zas, Sandra L. (2022): Data for changes in neuropeptide prohormone genes among Cetartio-dactyla livestock and wild species associated with evolution and domestication. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2071917_V1
This dataset contain information for the paper "Changes in neuropeptide prohormone genes among Cetartio-dactyla livestock and wild species associated with evolution and domestication" Veterinary Sciences, MDPI. Protein sequences were predicted using GeneWise for 98 neuropeptide prohormone genes from publicly available genomes of 118 Cetartiodactyla species. All predictions (CetartiodactylaSequences2022.zip) were manually verified. Sequences were aligned within each prohormone using MAFFT (MDPImultalign2022.zip includes multiple sequence alignment of all species available for each prohormone). Phylogenetic gene trees were constructed using PhyML and the species tree was constructed using ASTRAL (MDPItree2022.zip). The data is released under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0).
prohormone; neuropeptide; Cetartiodactyla; Cetartiodactyla; phylogenetics; gene tree; species tree
Madhavan, Vidya; Aishwarya, Anuva (2022): Data for Long-lifetime spin excitations near domain walls in 1T-TaS2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0883774_V1
The data files are for the paper entitled: Long-lifetime spin excitations near domain walls in 1T-TaS2 to be published in PNAS. The data was obtained on a 300 mK custom designed Unisoku scanning tunneling microscope using the Nanonis module. All the data files have been named based on the Figure numbers that they represent.
Mott Insulator; Spins; Charge Density Wave; Domain walls; Long lifetime
Clem, Scott; Hobson, Keith; Harmon-Threatt, Alexandra (2022): Raw data and code for the paper: Do Nearctic hover flies (Diptera: Syrphidae) engage in long-distance migration? An assessment of evidence and mechanisms. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6560908_V1
This dataset is for the publication "Do Nearctic hover flies (Diptera: Syrphidae) engage in long-distance migration? An assessment of evidence and mechanisms." It consists of 11 Excel spreadsheets and 4 R scripts which correspond to the analyses which were conducted. Paper abstract: Long-distance insect migration is poorly understood despite its tremendous ecological and economic importance. As a group, Nearctic hover flies (Diptera: Syrphidae: Syrphinae), which are crucial pollinators as adults and biological control agents as larvae, are almost entirely unrecognized as migratory despite examples of highly migratory behavior among several Palearctic species. Here, we examined evidence and mechanisms of migration for four hover fly species (Allograpta obliqua, Eupeodes americanus, Syrphus rectus, and Syrphus ribesii) common throughout eastern North America using stable hydrogen isotope (δ2H) measurements of chitinous tissue, morphological assessments, abundance estimations, and cold-tolerance assays. While further studies are needed, non-local isotopic values obtained from hover fly specimens collected in central Illinois support the existence of long-distance fall migratory behavior in Eu. americanus, and to a lesser extent S. ribesii and S. rectus. Elevated abundance of Eu. americanus during the expected autumn migratory period further supports the existence of such behavior. Moreover, high phenotypic plasticity of morphology associated with dispersal coupled with significant differences between local and non-local specimens suggest that Eu. americanus exhibits a unique suite of morphological traits that decrease costs associated with long-distance flight. Finally, compared to the ostensibly non-migratory A. obliqua, Eu. americanus was less cold tolerant, a factor that may be associated with migratory behavior. Collectively, our findings imply that fall migration occurs in Nearctic hover flies, but we consider methodological limitations of our study in addition to potential ecological and economic consequences of these novel findings.
Insect migration; hover fly; Syrphidae; stable isotopes; deuterium; morphometrics; cold tolerance
Seyfried, Georgia S.; Midgley, Meghan G.; Phillips, Richard; Yang, Wendy (2022): Data for Refining the role of nitrogen mineralization in mycorrhizal nutrient syndromes. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5586647_V1
This dataset includes data on soil properties, soil N pools, and soil N fluxes presented in the manuscript, "Refining the role of nitrogen mineralization in mycorrhizal nutrient syndromes". Please refer to that publication for details about methodologies used to generate these data and for the experimental design.
Nitrogen cycling; Ectomycorrhizal fungi; Arbuscular mycorrhizal fungi; Nitrogen fertilization; Gross mineralization
Getahun, Elias; Zavelle, Atticus; Keefer, Laura (2022): Illinois Coastal Zone Water Quality Database (ICoastalDB) . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7799136_V1
ICoastalDB, which was developed using Microsoft structured query language (SQL) Server, consists of water quality and related data in the Illinois coastal zone that were collected by various organizations. The information in the dataset includes, but is not limited to, sample data type, method of data sampling, location, time and date of sampling and data units.
Illinois Coastal Zone; Water Quality Data
Andrade, Flavia (2022): Data for A biopsychosocial examination of chronic back pain, limitations on usual activities, and treatment in Brazil, 2019. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2309514_V1
This dataset was created based on the publicly available microdata from PNS-2019, a national health survey conducted by the Instituto Brasileiro de Geografia e Estatistica (IBGE, Brazilian Institute of Geography and Statistics). IBGE is a federal agency responsible for the official collection of statistical information in Brazil – essentially, the Brazilian census bureau. Data on selected variables focusing on biopsychosocial domains related to pain prevalence, limitations and treatment are available. The Fundação Instituto Oswaldo Cruz has detailed information about the PNS, including questionnaires, survey design, and datasets (www.pns.fiocruz.br). The microdata can be found on the IBGE website (https://www.ibge.gov.br/estatisticas/downloads-estatisticas.html?caminho=PNS/2019/Microdados/Dados).
back pain; health status disparities; biopsychosocial; Brazil
Zinnen, Jack (2022): Data for: Functional traits and responses to nutrient and mycorrhizal addition are inconsistently related to wetland plant species’ coefficients of conservatism. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6489931_V1
This is the core data for Zinnen et al., "Functional traits and responses to nutrient and mycorrhizal addition are inconsistently related to wetland plant species’ coefficients of conservatism." This is submitted to Wetlands Ecology and Management. Two datasets are submitted here. The first is greenhouse-collected data of 9 plant traits and concurrent treatment responses of Illinois wetland plant species. The second are field-collected leaf trait data of Illinois wetland plant species. These data are analyzed in the paper. Please refer to the main manuscript to see how these data were produced and specific analyses.
ecological indicators; Floristic Quality Assessment; Floristic Quality Index; wetland degradation
Saleh, Ehsan; Ghaffari, Saba; Forsyth, David; Yu-Xiong, Wang (2022): Dataset for On the Importance of Firth Bias Reduction in Few-Shot Classification. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1016367_V1
This data repository includes the features and the trained backbone parameters used in the ICLR 2022 Paper "On the Importance of Firth Bias Reduction in Few-Shot Classification". The code accompanying this data is open-source and available at https://github.com/ehsansaleh/firth_bias_reduction The code and the data have three modules: 1. The "code_firth" module (10 files) relates to the basic ResNet backbones and logistic classifiers (e.g., Figures 2 and 3 in the main paper). 2. The "code_s2m2rf" module (2 files) relates to the S2M2R feature backbones and cosine classifiers (e.g., Figure 4 in the main paper). 3. The "code_dcf" module (3 files) relates to the few-shot Distribution Calibration (DC) method (e.g., Table 1 in the main paper). The relevant files for each module have the module name as a prefix in their name. 1. For instance, the "code_dcf_features.tar" file should be placed at the "features" directory of the "code_dcf" module. 2. As another example, "code_firth_features_cifarfs_novel.tar" should be placed in the "features" directory of the "code_firth" module, and it includes the features extracted from the novel split of mini-ImageNet dataset. Each tar-ball should be extracted in its relevant directory, and the md5 check-sums of the extracted files are also provided in the open-source code repository for verification. Please note that the actual datasets of images are not included here (since we do not own those datasets). However, helper scripts for automatically downloading the original datasets are also provided in the every module and sub-directory of the GitHub code repository.
Computer Vision; Few-Shot Classification; Few-Shot Learning; Firth Bias Reduction