Illinois Data Bank Dataset Search Results
Results
published:
2025-10-01
Schetter, August; Lin, Cheng-Hsien; Zumpf, Colleen; Jang, Chunhwa; Hoffmann Jr., Leo; Rooney, William; Lee, DoKyoung
(2025)
Recently introduced photoperiod-sensitive (PS) biomass sorghum (Sorghum bicolor L. Moench) needs to be investigated for yield potential under different cultivation environments with reasonable nitrogen (N) inputs. The objectives of this study were to (1) evaluate the biomass yield and feedstock quality of four sorghum hybrids with different levels of PS ranging from very PS (VPS) hybrids and to moderate PS (MPS) hybrids, and (2) determine the optimal N inputs (0~168 kg N ha−1) under four environments: combinations of both temperate (Urbana, IL) and subtropical (College Station, TX) regions during 2018 and 2019. Compared to TX, the PS sorghums in central IL showed higher yield potential and steady feedstock production with an extended day length and with less precipitation variability, especially for the VPS hybrids. The mean dry matter (DM) yields of VPS hybrids were 20.5 Mg DM ha−1 and 17.7 Mg DM ha−1 in IL and TX, respectively. The highest N use efficiency occurred at a low N rate of 56 kg N ha−1 by improving approximately 33 kg DM ha−1 per 1.0 kg N ha−1 input. Approximately 70% of the PS sorghum biomass can be utilized for biofuel production, consisting of 58-65% of the cell-wall components and 4-11% of the soluble sugar. This study demonstrated that the rainfed temperate area (e.g., IL) has a great potential for the sustainable cultivation of PS energy sorghum due to their observed high yield potential, stable production, and low N requirements.
keywords:
Sustainability;Biomass Analytics;Field Data
published:
2020-07-15
Legried, Brandon; Molloy, Erin K.; Warnow, Tandy; Roch, Sebastien
(2020)
This repository includes scripts and datasets for the paper, "Polynomial-Time Statistical Estimation of Species Trees under Gene Duplication and Loss."
keywords:
Species tree estimation; gene duplication and loss; identifiability; statistical consistency; quartets; ASTRAL
published:
2023-07-01
Tonks, Adam; Hwang, Jeongwoo
(2023)
This is the data used in the paper "Assessment of spatiotemporal flood risk due to compound precipitation extremes across the contiguous United States".
Code from the Github repository https://github.com/adtonks/precip_extremes can be used with the data here to reproduce the paper's results. v1.0.0 of the code is also archived at https://doi.org/10.5281/zenodo.8104252
This dataset is derived from NOAA-CIRES-DOE 20th Century Reanalysis V3. The NOAA-CIRES-DOE Twentieth Century Reanalysis Project version 3 used resources of the National Energy Research Scientific Computing Center managed by Lawrence Berkeley National Laboratory which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 and used resources of NOAA's Remotely Deployed High Performance Computing Systems.
keywords:
spatiotemporal; CONUS; United States; precipitation; extremes; flooding
published:
2025-06-03
White, Andrew; Lambert, John
(2025)
GIS data and geoprocessing tools associated with White and Lambert (2025) modeling paper that assesses the potential impact of development on the archaeological resources of Illinois.
keywords:
development; archaeology; climate change; GIS
published:
2019-10-15
Choi, Sang Hyun; Rao, Vikyath; Gernat, Tim; Hamilton, Adam; Robinson, Gene; Goldenfeld, Nigel
(2019)
Filtered trophallaxis interactions for two honeybee colonies, each containing 800 worker bees and one queen. Each colony consists of bees that were administered a juvenile hormone analogy, a vehicle treatment, or a sham treatment to determine the effect of colony perturbation on the duration of trophallaxis interactions. Columns one and two display the unique identifiers for each bee involved in a particular trophallaxis exchange, and columns three and four display the Unix timestamp of the beginning/end of the interaction (in milliseconds), respectively.<br /><b>Note</b>: the queen interactions were omitted from the uploaded dataset for reasons that are described in submitted manuscript. Those bees that performed poorly are also omitted from the final dataset.
keywords:
honey bee; trophallaxis; social network
published:
2020-03-14
Rhoads, Bruce ; Lindroth, Evan
(2020)
Data on bank elevations determined from lidar data for the Upper Sangamon River, Illinois, the Mission River, Texas, and the White River in Indiana
keywords:
bank elevations, rivers, meandering, lowland
published:
2020-09-25
This repository contains the datasets and corresponding results for the paper "MAGUS: Multiple Sequence Alignment using Graph Clustering".
The Datasets.zip archive contains the ROSE, balibase, Gutell, and RNASim datasets used in our experiments.
The Results.zip archive contains the outputs of running our methods against these datasets.
Datasets used:
ROSE: 10 simulated nucleotide model conditions from the SATe paper, each with 20 replicates, and with 1000 sequences per replicate.
The ROSE datasets were originally taken from <a href="https://sites.google.com/eng.ucsd.edu/datasets/alignment/sate-i">https://sites.google.com/eng.ucsd.edu/datasets/alignment/sate-i</a>
RNASim: This is a collection of simulated nucleotide datasets that were generated under a model of evolution that reflects selection due to RNA structural constraints. We sampled 20 subsets of 1000 sequences each, as well as 10 subsets of 10000 each, by randomly sampling from the original million-sequence RNASim dataset.
Gutell: 16S.M, 16S.3, 16S.T, 16S.B.ALL: Four biological nucleotide datasets from the Comparative Ribosomal Website (CRW) with cleaned reference alignments from SATe. Since PASTA is restricted to datasets without sequence length heterogeneity, these were modified to remove sequences that deviate by more than 20% from the median length. The scrubbed datasets range from 740 to 24,246 sequences. The pre-screened 16S datasets were taken from <a href="https://sites.google.com/eng.ucsd.edu/datasets/alignment/16s23s">https://sites.google.com/eng.ucsd.edu/datasets/alignment/16s23s</a>
BAliBASE: We use eight BAliBASE amino acid datasets used in the PASTA paper. As above, we remove outlier sequences, which leaves us with sizes ranging from 195 to 732 sequences. The pre-screened Balibase datasets were taken from <a href="https://sites.google.com/eng.ucsd.edu/datasets/alignment/pastaupp">https://sites.google.com/eng.ucsd.edu/datasets/alignment/pastaupp</a>
published:
2024-04-05
Sinaiko, Guy; Cao, Yanghui; Dietrich, Christopher H.
(2024)
The following files include specimen information, DNA sequence data, and additional information on the analyses used to reconstruct the phylogeny of the leafhopper genus Neoaliturus as described in the Methods section of the original paper:
1. Taxon_sampling.csv: contains data on the individual specimens from which DNA was extracted, including sample code, taxon name, collection data (locality, date and name of collector) and museum unique identifier.
2. Alignments.zip: a ZIP archive containing 432 separate FASTA files representing the aligned nucleotide sequences of individual gene loci used in the analysis.
3. Concatenated_Matrix.fa: is a FASTA file containing the concatenated individual gene alignments used for the maximum likelihood analysis in IQ-TREE.
4. Genes_and_Loci.rtf: identifies the individual genes and loci used in the analysis. The partition name is the same as the name of the individual alignment file in the zipped Alignments folder.
5. Partitions_best_scheme.nex: is a text file in the standard NEXUS format that indicates the names of the individual data partitions and their locations in the concatenated matrix, and also indicates the substitution model for each partition.
6. (New in this version 2) Scripts & Description.zip includes 8 custom shell or perl scripts used to assemble the DNA sequence data by perform reciprocal blast searches between the reference sequences and assemblies for each sample, extract the best sequences based on the blast searches, screen the hits for each locus and keep only the best result, and generate the nucleotide sequence dataset for the predicted orthologues (see the file description.txt for details).
7. (New in this version 2) Full_genetic_distances_matrix.csv shows the genetic distances between pairs of samples in the datset (proportion of nucleotides that differ between samples).
keywords:
leafhopper; phylogeny; anchored-hybrid-enrichment; DNA sequence; insect
published:
2025-12-14
Fraterrigo, Jennifer; Chen, Weile
(2025)
This dataset contains information about absorptive roots from 170 plots along a latitudinal and temperature gradient in northern Alaska, including tussock sedges and deciduous alder, birch, and willow shrubs. This dataset accompanies the paper "Impacts of Arctic Shrubs on Root Traits and Belowground Nutrient Cycles Across a Northern Alaskan Climate Gradient," which was published in Frontiers in Plant Sciences.
<b>*Note:</b> in the "patch coordinates" tab, the same coordinates/elevation ("Long", "Lat", and "Elev (m)") apply to all patches that share a number. For ex: "Patch" W1, B1, and G1 share the same "Long", "Lat", and "Elev (m)" values as "Patch" A1.
keywords:
absorptive root traits; shrub expansion; Arctic; Alaskan tundra
published:
2020-04-20
Supplemental data sets for the Manuscript entitled "Contribution of fungal and invertebrate communities to mass loss and wood depolymerization in tropical terrestrial and aquatic habitats"
keywords:
Coiba Island; wood decomposition; cellulose; hemicellulose; lignin breakdown; aquatic fungi
published:
2020-01-31
Bradshaw, Therin M.; Blake-Bradshaw, Abigail G.; Fournier, Auriel M.V.; Lancaster, Joseph D. ; O'Connell, John; Jacques, Christopher N.; Eicholtz, Michael W.; Hagy, Heath M
(2020)
Data inputs, and scripts for the analysis detailed in Bradshaw et al, published in PlosONE 2020.
keywords:
Marsh birds; wetlands
published:
2025-02-07
Huang, Annie H.; Matthews, Jeffrey W.
(2025)
These data represent the raw data from the paper “Influence of light availability and water depth on competition between Phalaris arundinacea and herbaceous vines” published in Wetlands by Annie H. Huang and Jeffrey W. Matthews. The data are archived in one file: Huang&Matthews_mesocosm_data_archive. This file includes raw data collected during a greenhouse experiment described in the paper.
published:
2025-09-17
Kamara, Shasta; Glomb, Jackson; Suski, Cory
(2025)
Data was generated from juvenile paddlefish acclimated to one of three different temperatures (13.0°C, 17.5°C, or 22.0°C) for two weeks. After which, fish were subjected to one of two experiments, one being simulated angling in which physiological parameters (stress hormones, lactate, glucose, ions, and oxygen transport parameters were evaluated in plasma or whole blood), the other experiment consisted of critical thermal maxima tests. Data set includes physiological parameters, water quality temperatures, and morphometric data generated from these experiments and fish.
keywords:
Sport fish, critical thermal maximum, exercise, recovery, conservation, fisheries, management
published:
2025-08-01
Beach, Cheyenne R.; Koop, Jennifer A.H.; Fournier, Auriel M.V.
(2025)
Data from the 2025 publication in the Wilson Journal of Ornithology with the same name.
keywords:
Lesser Scaup; Waterfowl; Transmitter Effects
published:
2025-08-04
Hartman, Theodore; Studt, Jacob; VanLoocke, Andy; McDaniel, Marshall; Howe, Adina; Masters, Michael D. ; Mitchell, Corey; DeLucia, Evan H.; Heaton, Emily
(2025)
This dataset contains the data used for the publication “Aboveground rather than belowground productivity drives variability in Miscanthus x giganteus net primary productivity”. This dataset contains Miscanthus x giganteus biomass, carbon, and nitrogen tissue data for aboveground and belowground plant parts collected in 2021 for three different sites in Iowa with three different nitrogen application rates. Data at the Iowa sites were collected via biometric hand harvesting, belowground excavations, and soil coring both in-clump and beside-clump. Data were collected at two collection timepoints to calculate the contributions of belowground parts to Miscanthus x giganteus net primary productivity. This dataset also includes Miscanthus x giganteus and Switchgrass soil coring and excavation data collected in 2012 at the University of Illinois Urbana Champaign Energy Farm.
keywords:
Miscanthus; Net Primary Productivity; Excavation; Nitrogen fertilization; Translocation; Belowground Biomass; Carbon
published:
2025-09-24
Lee, Jaewon; Kwak, Suryang; Liu, Jing-Jing; Yu, Sora; Yun, Eun Ju; Kim, Dong Hyun; Liu, Cassie; Kim, Kyoung Heon; Jin, Yong-Su
(2025)
2′-Fucosyllactose (2′-FL), a human milk oligosaccharide with confirmed benefits for infant health, is a promising infant formula ingredient. Although Escherichia coli, Saccharomyces cerevisiae, Corynebacterium glutamicum, and Bacillus subtilis have been engineered to produce 2′-FL, their titers and productivities need be improved for economic production. Glucose along with lactose have been used as substrates for producing 2′-FL, but accumulation of by-products due to overflow metabolism of glucose hampered efficient production of 2′-FL regardless of a host strain. To circumvent this problem, we used xylose, which is the second most abundant sugar in plant cell wall hydrolysates and is metabolized through oxidative metabolism, for the production of 2′-FL by engineered yeast. Specifically, we modified an engineered S. cerevisiae strain capable of assimilating xylose to produce 2′-FL from a mixture of xylose and lactose. First, a lactose transporter (Lac12) from Kluyveromyces lactis was introduced. Second, a heterologous 2′-FL biosynthetic pathway consisting of enzymes Gmd, WcaG, and WbgL from E. coli was introduced. Third, we adjusted expression levels of the heterologous genes to maximize 2′-FL production. The resulting engineered yeast produced 25.5 g/L of 2′-FL with a volumetric productivity of 0.35 g/L∙h in a fed-batch fermentation with lactose and xylose feeding to mitigate the glucose repression. Interestingly, the major location of produced 2′-FL by the engineered yeast can be changed using different culture media. While 72% of the produced 2′-FL was secreted when a complex medium was used, 82% of the produced 2′-FL remained inside the cells when a minimal medium was used. As yeast extract is already used as food and animal feed ingredients, 2′-FL enriched yeast extract can be produced cost-effectively using the 2′-FL-accumulating yeast cells.
keywords:
Conversion;Genome Engineering
published:
2022-05-20
Haselhorst, Derek; Moreno, J. Enrique; Tcheng, David K.; Punyasena, Surangi W.
(2022)
This dataset includes images and annotated counts for 150 airborne pollen samples from the Center for Tropical Forest Science 50 ha forest dynamics plot on Barro Colorado Island, Panama. Samples were collected once a year from April 1994 to June 2010.
keywords:
aerial pollen traps; automated pollen identification; Barro Colorado Island; convolutional neural networks; Neotropics; palynology; phenology
published:
2022-08-20
Jones, Todd; Ward, Michael
(2022)
Dataset associated with Jones and Ward BEAS-D-21-00106R2 submission: Parasitic cowbird development up to fledging and subsequent post-fledging survival reflect life history variation found across host species. Excel CSV files and .inp file with data used in nest survival and Brown-headed Cowbird post-fledging analyses and file with descriptions of each column. The CSV file is setup for logistic exposure models in SAS or R and the .inp file is setup to be uploaded into program MARK for multi-state recaptures only analysis. Species included in the analyses: American Robin, Blue Grosbeak, Brown Thrasher, Blue-winged Warbler, Carolina Chickadee, Chipping Sparrow, Common Yellowthroat, Dickcissel, Eastern Bluebird, Eastern Phoebe, Eastern Towhee, Field Sparrow, Gray Catbird, House Wren, Indigo Bunting, Northern Cardinal, Red-winged Blackbird, Tree Swallow, Yellow-breasted Chat, and Yellow Warbler.
keywords:
brood parasitism; cowbird; carryover effects; phenotypic plasticity; post-fledging; songbirds
published:
2024-07-11
Pelech, Elena; Long, Steve
(2024)
This dataset includes the gas exchange and TDL (tunable diode laser) files between 4 accessions of Glycine soja and 1 elite accession of Glycine max (soybean) during light induction.
In this V2, code files for Matlab and R are also included to calculate mesophyll conductance and calculate the limitation on photosynthesis, respectively.
keywords:
photosynthesis; mesophyll conductance; soybean; light induction
published:
2019-08-13
Nowak, Jennifer E.; Sweet, Andrew D.; Weckstein, Jason D.; Johnson, Kevin P.
(2019)
Multiple sequence alignments from concatenated nuclear and mitochondrial genes and resulting phylogenetic tree files of fruit doves and their close relatives. Files include: BEAST input XML file (fruit_dove_beast_input.xml); a maximum clade credibility tree from a BEAST analysis (fruit_dove_beast_mcc.tre); concatenated multiple sequence alignment NEXUS files for the novel dataset (fruit_dove_concatenated_alignment.nex, 76 taxa, 4,277 characters) and the dataset with additional sequences (fruit_dove_plus_cibois_data_concatenated_alignment.nex, 204 taxa, 4,277 characters), both of which contain a MrBayes block including partition information; and 50% majority-rule consensus trees generated from MrBayes analyses, using the NEXUS alignment files as inputs (fruit_dove_mrbayes_consensus.tre, fruit_dove_plus_cibois_data_mrbayes_consensus.tre).
keywords:
fruit doves; multiple sequence alignment; phylogeny; Aves: Columbidae
published:
2019-12-20
Wang, Yu; Burgess, Steven J. ; de Becker, Elsa ; Long, Stephen P.
(2019)
This dynamic photosynthesis model of soybean canopy is developed by Yu Wang (yuwangcn@illinois.edu), IGB, University of Illinois.
If you want to know more details, please check the following publication
Yu Wang, Steven J. Burgess, Elsa de Becker, Stephen P. Long. Photosynthesis in the fleeting shadows: An overlooked opportunity for increasing crop productivity? The Plant Journal.
keywords:
Matlab; Soybean canopy; photosynthesis model
published:
2024-02-08
Martinez, Carlos; Pena, Gisselle; Wells, Kaylee K.
(2024)
This dataset contains transcribed entries from the "Prairie Directory of North America" (Adelman and Schwartz 2013) for the Tallgrass, Mixed Grass, and Shortgrass prairie regions of the united states. We identified the historical spatial extent of the Tallgrass, Mixed Grass, and Shortgrass prairie regions using Ricketts et al. (1999), Olson et al. (2001), and Dixon et al. (2014) and selected the counties entirely or partially within these boundaries from the USDA Forest Service (2022) file. The resulting lists of counties are included as separate files. The dataset contains information on publicly accessible grasslands and prairies in these regions including acreage and amenities like hunting access, restrooms, parking, and trails.
keywords:
grasslands; prairies; prairie directory of north america; site amenities; site attributes
published:
2020-03-13
Sweet, Andrew; Johnson, Kevin; Cameron, Stephen
(2020)
Data files associated with the assembly of mitochondrial minicircles from five species of parasitic lice. This includes data from four species in the genus Columbicola and from the human louse (Pediculus humanus). The files include FASTA sequences for all five species, reference sequences for read mapping approaches, resulting contigs produced by various assembly approaches, and alignments of human louse minicircles mapped to published sequences of the same species.
keywords:
mitochondria; FASTA; nucleotide sequences; alignment; Columbicola; Pediculus
published:
2021-05-12
Clem, Scott; Harmon-Threatt, Alexandra
(2021)
These are the data sets associated with our publication "Field borders provide winter refuge for beneficial predators and parasitoids: a case study on organic farms." For this project, we compared the communities of overwintering arthropod natural enemies in organic cultivated fields and wildflower-strip field borders at five different sites in central Illinois.
Abstract:
Semi-natural field borders are frequently used in midwestern U.S. sustainable agriculture. These habitats are meant to help diversify otherwise monocultural landscapes and provision them with ecosystem services, including biological control. Predatory and parasitic arthropods (i.e., potential natural enemies) often flourish in these habitats and may move into crops to help control pests. However, detailed information on the capacity of semi-natural field borders for providing overwintering refuge for these arthropods is poorly understood. In this study, we used soil emergence tents to characterize potential natural enemy communities (i.e., predacious beetles, wasps, spiders, and other arthropods) overwintering in cultivated organic crop fields and adjacent field borders. We found a greater abundance, species richness, and unique community composition of predatory and parasitic arthropods in field borders compared to arable crop fields, which were generally poorly suited as overwintering habitat. Furthermore, potential natural enemies tended to be positively associated with forb cover and negatively associated with grass cover, suggesting that grassy field borders with less forb cover are less well-suited as winter refugia. These results demonstrate that semi-natural habitats like field borders may act as a source for many natural enemies on a year-to-year basis and are important for conserving arthropod diversity in agricultural landscapes.
keywords:
Natural enemy; wildflower strips; conservation biological control; semi-natural habitat; field border; organic farming
published:
2024-07-11
Schneider, Amy; Suski, Cory
(2024)