Displaying datasets 151 - 175 of 276 in total

Subject Area

Life Sciences (276)
Social Sciences (0)
Physical Sciences (0)
Technology and Engineering (0)
Uncategorized (0)
Arts and Humanities (0)


Other (95)
U.S. National Science Foundation (NSF) (75)
U.S. Department of Energy (DOE) (29)
U.S. Department of Agriculture (USDA) (23)
U.S. National Institutes of Health (NIH) (22)
Illinois Department of Natural Resources (IDNR) (11)
U.S. Geological Survey (USGS) (3)
U.S. National Aeronautics and Space Administration (NASA) (2)
U.S. Army (2)
Illinois Department of Transportation (IDOT) (0)

Publication Year

2021 (66)
2020 (60)
2022 (52)
2019 (42)
2018 (23)
2017 (19)
2016 (12)
2023 (2)


CC0 (180)
CC BY (87)
custom (9)
published: 2020-10-27
The data file contains a list of included studies with their detailed metadata, taken from Cochrane reviews which were used in a project associated with the manuscript "Evaluation of an automated probabilistic RCT Tagger applied to published Cochrane reviews".
keywords: Cochrane reviews; automation; randomized controlled trial; RCT; systematic review
published: 2020-10-27
The data file contains detailed information of the Cochrane reviews that were used in a project associated with the manuscript (working title) "Evaluation of an automated probabilistic RCT Tagger applied to published Cochrane reviews".
keywords: Cochrane reviews; systematic reviews; randomized control trial; RCT; automation
published: 2020-10-16
Video footage of an Eastern Box Turtle (Terrapene carolina carolina) partially predating a Field Sparrow nest (Spizella pusilla) at 0845 h on the 31 of May 2020. Please note that the date on the video footage is incorrect due to user error, but the time is correct.
keywords: nest predation; turtle; songbird; nest camera; Terrapene carolina carolina; Spizella pusilla;
published: 2020-10-15
This dataset consists of various input data that are used in the GAMS model. All the data are in the format of .inc which can be read within GAMS or Notepad. Main data sources include: acreage data (acre), crop budget data ($/acre), crop yield data (e.g. bushel/acre), Soil carbon sequestration data (KgCO2/ha/yr). Model details can be found in the "Assessing the Additional Carbon Savings with Biofuel" and GAMS model package. ## File Description (1) GAMS Model.zip: This includes all the input files and scripts for running the model (2) Table*.csv: These files include the data from the tables in the manuscript (3) Figure2_3_4.csv: This contains the data used to create the figures in the manuscript (4) BaselineResults.csv: This includes a summary of the model results. (5) SensitivityResults_*.csv: Model results from the various sensitivity analyses performed (6) LUC_emission.csv: land use change emissions by crop reporting district for changes of pasturelands to annual crops.
keywords: Biogenic carbon intensity; Corn ethanol; Economic model; Dynamic optimization; Anticipated baseline approach; Life cycle carbon intenisty
published: 2020-10-14
Data on permanent plots at Fortuna and the Panama Canal Watershed, Republic of Panama, containing counts and percent of trees with one or more multiple stems >10cm diameter, with and without palms. Accompanying environmental data includes elevation, precipitation, soil type and soil chemical variables (pH, total N, NO3, NO4, resin P, mehlich Ca, K and Mg.
keywords: multiple stems; resprouting; Panama Canal Watershed; Fortuna Forest Reserve
published: 2020-10-01
Raw gas exchange data for photosynthetic induction in 6 rice accession flag leaves. Photosynthetic induction and point measurements were made at ambient [CO2]. Two accessions (AUS 278 and IR64) were selected to screen in greater detail in which photosynthetic induction was measured at six [CO2].
published: 2020-09-25
This repository contains the datasets and corresponding results for the paper "MAGUS: Multiple Sequence Alignment using Graph Clustering". The Datasets.zip archive contains the ROSE, balibase, Gutell, and RNASim datasets used in our experiments. The Results.zip archive contains the outputs of running our methods against these datasets. Datasets used: ROSE: 10 simulated nucleotide model conditions from the SATe paper, each with 20 replicates, and with 1000 sequences per replicate. The ROSE datasets were originally taken from <a href="https://sites.google.com/eng.ucsd.edu/datasets/alignment/sate-i">https://sites.google.com/eng.ucsd.edu/datasets/alignment/sate-i</a> RNASim: This is a collection of simulated nucleotide datasets that were generated under a model of evolution that reflects selection due to RNA structural constraints. We sampled 20 subsets of 1000 sequences each, as well as 10 subsets of 10000 each, by randomly sampling from the original million-sequence RNASim dataset. Gutell: 16S.M, 16S.3, 16S.T, 16S.B.ALL: Four biological nucleotide datasets from the Comparative Ribosomal Website (CRW) with cleaned reference alignments from SATe. Since PASTA is restricted to datasets without sequence length heterogeneity, these were modified to remove sequences that deviate by more than 20% from the median length. The scrubbed datasets range from 740 to 24,246 sequences. The pre-screened 16S datasets were taken from <a href="https://sites.google.com/eng.ucsd.edu/datasets/alignment/16s23s">https://sites.google.com/eng.ucsd.edu/datasets/alignment/16s23s</a> BAliBASE: We use eight BAliBASE amino acid datasets used in the PASTA paper. As above, we remove outlier sequences, which leaves us with sizes ranging from 195 to 732 sequences. The pre-screened Balibase datasets were taken from <a href="https://sites.google.com/eng.ucsd.edu/datasets/alignment/pastaupp">https://sites.google.com/eng.ucsd.edu/datasets/alignment/pastaupp</a>
published: 2020-09-18
Restriction site-associated DNA sequencing (RAD-seq) data from 643 Miscanthus accessions from a diversity panel, including 613 Miscanthus sacchariflorus, three M. sinensis, and 27 M. xgiganteus. DNA was digested with PstI and MspI, and single-end Illumina sequencing was performed adjacent to the PstI site. Variant and genotype calling was performed with TASSEL-GBSv2, using the Miscanthus sinensis v7.1 reference genome from Phytozome 12 (https://phytozome.jgi.doe.gov). Additional ploidy-aware genotype calling was performed by polyRAD v1.1.
keywords: variant call format (VCF); genotyping-by-sequencing (GBS); single nucleotide polymorphism (SNP); grass; genetic diversity; biomass
published: 2020-09-17
Data are from a long-term fire manipulation experiment in the Missouri Ozarks, USA. Data include the raw, annual ring-width increment (rwl), basal area increment (BAI), population-level annual growth resistance (Drs) and resilience (Drl) to drought, intrinsic water use efficiency values (WUEi) and oxygen isotopic composition of individual radial growth rings (δ18O) from southern red oak (Quercus falcata) and post oak (Q. stellata) trees. ---------------------- TITLE: Data for "Sixty-five years of fire manipulation reveals climate and fire interact to determine growth rates of Quercus spp." ---------------------- FILE OVERVIEW: This dataset contains four (4) CSV files as described below: Refsland_et_al_ECS20-0465_BAI.csv: annual basal area increment between 1948-2015 for trees across the fire manipulation experiment Refsland_et_al_ECS20-0465_DroughtIndices.csv: population-level drought resistance and resilience of trees during each target drought period Refsland_et_al_ECS20-0465_WUEi.csv: carbon isotope indicators of drought stress for trees across the fire manipulation experiment Refsland_et_al_ECS20-0465_d18Or.csv: oxygen isotope indicators of drought stress for trees across the fire manipulation experiment ---------------------- VARIABLE EXPLANATION: All the variables in those four files are explained as below: treeID: unique character string that identifies subject tree block: integer (1, 2) that identifies the study block plot: integer (1-12) that identifies the plot nested within each study block trt: character string (Annual, Control, Periodic) that identifies the fire treatment of a given plot species: character string (Quercus falcata, Quercus stellata) that identifies species of subject tree year: integer (1948-2015) that identifies the dated year of each tree ring rwl_mm: numerical value representing the annual tree ring-width, in mm bai_cm2: numerical value representing the annual basal area increment, in cm2 timeperiod: integer value (1953, 1964, 2007, 2012) representing the periods encompassing target dry and wet years Drs_2yr: numerical value representing the drought resistance, defined as the population-level annual growth of trees during drought years relative to pre-drought years for a given time period Drl_2yr: numerical value representing the drought resilience, defined as the population-level annual growth of trees following drought years relative to pre-drought years for a given time period stand_ba_m2ha: numerical value representing the total basal area of a given plot, in m2 per ha stand_density_stems_ha: numerical value representing the total stem density of a given plot, in stems per ha pool: numerical value (1-40) identifying the set of tree ring samples pooled for analysis. Samples were pooled by block, plot, year and species period: integer value (1953, 1964, 1980, 2007, 2012) representing the periods encompassing target dry and wet years type: character string (Dry, Wet) indicating the water availability of a given year d13C: numerical value representing the carbon isotopic composition of radial growth rings within a given sample pool, in per mil WUEi: numerical value representing the annual intrinsic water use efficiency of radial growth rings within a given sample pool d18O: numerical value representing the oxygen isotopic composition of radial growth rings within a given sample pool, in per mil
keywords: climate change adaptation; drought; fire; nitrogen availability; oak-hickory; radial growth; resilience; resistance; stand density; temperate broadleaf forest; water stress
published: 2020-09-07
This dataset contains BEPAM model code and input data to the replicate the results for "Assessing the Returns to Land and Greenhouse Gas Savings from Producing Energy Crops on Conservation Reserve Program Land." The dataset consists of: (1) The replication codes and data for the BEPAM model. The code file is named as output_0213-2020_Complete_daycent-agversion-[rental payment level]%_[biomass price].gms. (BEPAM-CRP model-Sep2020.zip) (2) Simulation results from the BEPAM model (BEPAM_Simulation_Results.csv) * Item (1) is in GAMS format. Item (2) is in text format.
keywords: Miscanthus; Switchgrass; soil carbon sequestration; greenhouse gas savings; rental payments; biomass price
published: 2020-07-15
This repository includes scripts and datasets for the paper, "Polynomial-Time Statistical Estimation of Species Trees under Gene Duplication and Loss."
keywords: Species tree estimation; gene duplication and loss; identifiability; statistical consistency; quartets; ASTRAL
published: 2020-06-30
This file contains 13 unique case studies that were created for the One health: Infectious diseases course offered at the University of Illinois at Urbana-Champaign campus. The case studies are being made available as educational resources for other One health courses. Each case study is focused on a theme/topic which is associated with One health. These case studies were created using publicly available information and references have been provided for each case study.
keywords: One health education; infectious diseases; case studies
published: 2020-02-12
This is the dataset used in the Landscape Ecology publication of the same name. This dataset consists of the following files: NWCA_Int_Veg.txt NWCA_Reg_Veg.txt NWCA_Site_Attributes.txt NWCA_Int_Veg.txt is a site and plot by species matrix. Column labeled SITES consists of site IDs. Column labeled Plots consist of Plot ID numbers. All other columns represent species abundances (estimates of percent cover, summed across five plots). NWCA_Reg_Veg.txt is a site by species matrix of species abundances. Column labeled SITES consist of site IDs. All other columns represent species abundances (estimates of percent cover within individual plots). NWCA_Site_Attributes.txt is a matrix of site attributes. Column labeled SITES consist of site IDs. Column labeled AA_CENTER_LAT consist of latitudinal coordinates for the Assessment Area center point in decimal degrees. Column labeled AA_CENTER_LONG consist of longitudinal coordinates for the Assessment Area center point in decimal degrees. Column REFPLUS_NWCA represents disturbance gradient classes including MIN (minimally disturbed), L (least disturbed), I (intermediate), M (most disturbed). Column REFPLUS_NWCA2 represents revised disturbance gradient classes based on protocols described in the article. These revised classes were used for analysis. Column labeled STRESS_HEAVYMETAL represents heavy metal stressor classes, used to ascertain which wetlands were missing soil data. Classes in the STRESS_HEAVYMETAL column include Low, Moderate, High, and Missing. Sites with Missing STRESS_HEAVYMETAL classes were removed from analysis. More information about this dataset: All of the data used in this analysis was gathered from the National Wetlands Condition Assessment. Wetland surveys were conducted from 4/4/2011 to 11/2/2011. The entire National Wetlands Condition Assessment Dataset, which includes 3640 unique taxonomic identities of plants, can be found at: https://www.epa.gov/national-aquatic-resource-surveys/data-national-aquatic-resource-surveys
keywords: Anthropogenic disturbance; β-Diversity; Biotic homogenization; Phalaris arundinacea; reed canary grass; Wetlands
published: 2020-06-06
These data are from an observational study and small experiment investigating reproductive biology and hybridization between two plants, Celastrus scandens L. and Celastrus orbiculatus Thunb. (Celastraceae). These data were collected during the 2008 growing season from the Indiana Dunes National Park (formerly Indiana Dunes National Lakeshore), just east of the municipality of Ogden Dunes, Indiana, USA. The five data files provide information on floral output of the two species, fertilization rate, fruit set rate, hybridization rate at two scales (individual flowers in both species, individual maternal plants in C. scandens), and the results of a hand-pollination experiment that exchanged pollen between the two species. There are six data files associated with this submission, five data files in comma-separated values format and one text file (‘readme.txt’) that includes detailed explanations of the data files.
keywords: Celastrus; invasive species; hybridization; heterospecific pollen; hand pollination
published: 2020-06-02
The text file contains the original data used in the phylogenetic analyses of Xue et al. (2020: Systematic Entomology, in press). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 89 taxa (species) and 2676 characters, indicate that the first 2590 characters are DNA sequence and the last 86 are morphological, that gaps inserted into the DNA sequence alignment and inapplicable morphological characters are indicated by a dash, and that missing data are indicated by a question mark. The file contains aligned nucleotide sequence data for 5 gene regions and 86 morphological characters. The positions of data partitions are indicated in the mrbayes block of commands for the phylogenetic program MrBayes at the end of the file (Subset1 = 16S gene; Subset2 = 28S gene; Subset3 = COI gene; Subset 4 = Histone H3 and H2A genes). The mrbayes block also contains instructions for MrBayes on various non-default settings for that program. These are explained in the original publication. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the supplementary document included as a separate pdf, also available from the journal website. The original raw DNA sequence data are available from NCBI GenBank under the accession numbers indicated in the supplementary file.
keywords: phylogeny; DNA sequence; morphology; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; 16S rDNA; histone H3; histone H2A; cytochrome oxidase I; Bayesian analysis
published: 2020-06-03
This dataset provides files for use in analysis of human land preference across Australasia, and in a localized analysis of land preference in Laos and Vietnam. All files can be imported into ArcGIS for visualization, and re-analyzed using the open source Maxent species distribution modeling program. CSV files contain known human presence sites for model validation. ASC files contain geographically coded environmental data for mean annual temperature and mean annual precipitation during the Last Glacial Maximum, as well as downward slope data. All ASC files are in the WGS 1984 Mercator map projection for visualization in ArcGIS and can be opened as text files in text editors supporting large file sizes.
keywords: human dispersal; ecological niche modeling; Australasia; Late Pleistocene; land preference
published: 2020-05-31
This repository includes a simulated dataset and related scripts used for the paper "Moss: Accurate Single-Nucleotide Variant Calling from Multiple Bulk DNA Tumor Samples".
keywords: Somatic Mutations; Bulk DNA Sequencing; Cancer Genomics
published: 2020-05-30
Original leaf gas exchange and absorptance data used in the Collison et al. (2020) Light, Not Age, Underlies the Q9 Maladaptation of Maize and Miscanthus Photosynthesis to Self-Shading - Frontiers in Plant Science doi: 10.3389/fpls.2020.00783
keywords: C4 photosynthesis; canopy; bioenergy; food security; quantum yield; shade acclimation; photosynthetic light-use efficiency; leaf aging
published: 2019-11-11
This repository includes scripts and datasets for the paper, "FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models." Note: The results from estimating species trees with ASTRID-multi (included in this repository) are *not* included in the FastMulRFS paper. We estimated species trees with ASTRID-multi in the fall of 2019, but ASTRID-multi had an important bug fix in January 2020. Therefore, the ASTRID-multi species trees in this repository should be ignored.
keywords: Species tree estimation; gene duplication and loss; statistical consistency; MulRF, FastRFS
published: 2020-04-02
Automatic and manual counts of black flies captured in Illinois.
keywords: black flies; simuliids; ImageJ; count method
published: 2020-04-22
Nest survival and Fledgling production data for Bell's Vireo and Willow Flycatcher nests.
keywords: Bell's Vireo;Willow Flycatcher;habitat selection;fitness;
published: 2017-12-22
TBP assessment raw data files of pre- and post- motion capture velocity and center of pressure force plate data. Labels are self-explanatory. The .mat files refer to data exported from the force plate for the time-to-stabilization assessments while the .txt files are the data collected for smoothness of gait assessments. These files do not relate to one another and are from separate assessments. Version2's files are the result from using Python code Data_Bank_Cleaner.py on version1's. Please find more information in READ_ME_databank.txt.
keywords: Multiple Sclerosis; Rehabilitation; Balance; Ataxia; Ballet; Dance; Targeted Ballet Program
published: 2020-04-20
Supplemental data sets for the Manuscript entitled "Contribution of fungal and invertebrate communities to mass loss and wood depolymerization in tropical terrestrial and aquatic habitats"
keywords: Coiba Island; wood decomposition; cellulose; hemicellulose; lignin breakdown; aquatic fungi