Illinois Data Bank Dataset Search Results
Results
published:
2023-04-02
Lee, Yuanyao; Khanna, Madhu; Chen, Luoye
(2023)
Use of cellulosic biofuels from non-feedstocks are modeled using the BEPAM (Biofuel and Environmental Policy Analysis Model) model to quantifying the uncertainties about induced land use change effects, net greenhouse gas saving potential, and economic costs. The code is in GAMS, general algebraic modeling language.
NOTE: Column 3 is titled "BAU" in "merged_BAU.gdx", "merged_RFS.gdx", and "merged_CEM.gdx", but contains "RFS" data in "merged_RFS.gdx" and "CEM" data in "merged_CEM.gdx".
keywords:
cellulosic biomass; BEPAM; economic modeling
published:
2024-08-02
Morrow Plots Data Curation Working Group
(2024)
The Morrow Plots at the University of Illinois at Urbana-Champaign are the longest-running continuous experimental plots in the Americas. In continuous operation since 1876, the plots were established to explore the impact of crop rotation and soil treatment on corn crop yields. In 2018, The Morrow Plots Data Curation Working Group began to identify, collect and curate the various data records created over the history of the experiment. The resulting data table published here includes planting, treatment and yield data for the Morrow Plots since 1888. Please see the included codebook for a detailed explanation of the data sources and their content. This dataset will be updated as new yield data becomes available.
*NOTE: While digitized and accessed through IDEALS, the physical copy of the field notebook: <a href="https://archon.library.illinois.edu/archives/index.php?p=collections/controlcard&id=11846">Morrow Plots Notebook, 1876-1913, 1967</a> is also held at the University of Illinois Archives.
keywords:
Corn; Crop Science; Experimental Fields; Crop Yields; Agriculture; Illinois; Morrow Plots
published:
2025-11-20
Ahmed, Md Wadud; Esquerre, Carlos A.; Eilts, Kristen; Allen, Dylan P.; McCoy, Scott M.; Varela, Sebastian; Singh, Vijay; Leakey, Andrew; Kamruzzaman, Mohammad
(2025)
NIR spectroscopy is a rapid and accurate green technology for high-throughput biomass characterization, including sorghum (Sorghum bicolor), a promising energy crop for the biofuel industry. This study assessed the influence of particle size on NIR spectroscopic analysis (wavelength range: 867–2535 nm) of sorghum biomass composition. Grown under field conditions, a total of 113 types of genetically diverse sorghum accessions were dried, ground, and sieved (<250, 250–600, 600–850, and > 850 µm particle size) for developing partial least square regression (PLSR) prediction models for moisture, ash, extractive, glucan, xylan, acid-soluble lignin (ASL), acid-insoluble lignin (AIL), and total lignin (ASL + AIL). Overall, smaller particle sizes provided better model performance, while no single particle size provided the best performance for all the selected components. With only 9 selected bands and 4 latent variables (LVs), the best PLSR model was obtained for moisture with particle size of 600–850 µm with the square root of the coefficient of determination (R) of 0.85, the ratio of prediction to deviation (RPD) of 2.2, and the root mean square error (RMSE) of 0.46 % in external validation. Similar model performances were also obtained for ash, extractive, glucan, and xylan. This study showed that size reduction could effectively improve NIR spectroscopic analysis for lipid-producing sorghum biomass for the biofuel industry.
keywords:
Conversion;Feedstock Production;Biomass Analytics;Modeling;Sorghum
published:
2021-07-15
Castro, Daniel; Sweedler, Jonathan
(2021)
The dataset contains the high-throughput matrix-assisted laser desorption/ionization mass spectrometry XmL files for the atrial gland and red hemiduct of Aplysia californica.
keywords:
Dense-core vesicle; High-throughput; Mass Spectrometry; MALDI; Organelle; Image-Guided; Atrial gland; red hemiduct; Lucent Vesicle
published:
2022-04-20
This is the core data for Zinnen et al., "Functional traits and responses to nutrient and mycorrhizal addition are inconsistently related to wetland plant species’ coefficients of conservatism." This is submitted to Wetlands Ecology and Management.
Two datasets are submitted here. The first is greenhouse-collected data of 9 plant traits and concurrent treatment responses of Illinois wetland plant species. The second are field-collected leaf trait data of Illinois wetland plant species. These data are analyzed in the paper. Please refer to the main manuscript to see how these data were produced and specific analyses.
keywords:
ecological indicators; Floristic Quality Assessment; Floristic Quality Index; wetland degradation
published:
2022-09-08
Hartman, Jordan; Larson, Eric
(2022)
Data associated with the manuscript "Overlooked invaders? Ecological impacts of non-game, native transplant fishes in the United States" by Jordan H. Hartman and Eric R. Larson
keywords:
freshwater; non-game; native transplant; impacts; invasive species
published:
2022-10-27
Holiman, Haley; Kitaif, J. Carson; Fournier, Auriel M.V.; Iglay, Ray; Woodrey, Mark S.
(2022)
keywords:
marsh birds; automated recording units
published:
2021-09-17
Stern, Jessica; Herman, Brook D. ; Matthews, Jeffrey
(2021)
We studied vegetation metric robustness to environmental (season, interannual, and regional) and methodological (observer) variables, as well as adequate sample size for vegetation metrics across four regions of the United States.
keywords:
coefficients of conservatism; floristic quality assessment; restoration; vegetation metric;
published:
2022-03-31
Crawford, Reed D.; Dodd, Luke E.; Tillman, Frank E.; O'Keefe, Joy M.
(2022)
This dataset contains our bi-hourly temperature recordings from 40 rocket box style artificial roosts of 5 designs deployed in Indiana and Kentucky, USA from April through September 2019. This dataset also includes our endothermic and faculatively heterothermic daily energy expenditure datasets used in our bioenergetic analysis, which were calculated from the bi-hourly rocket box temperature data. Lastly, we include our overheating counts dataset which summarizes daily overheating events (i.e., temperatures > 40 Celsius) in each rocket box style bat box over the course of the study period, these daily summaries were also calculated from the bi-hourly rocket box temperature recordings.
keywords:
artificial roost; bat box; microcllimate; temperature
published:
2025-11-06
Salmonella HilD 3'UTR GRIL-seq sequencing data
keywords:
Salmonella; SPI1; hilD
published:
2025-04-24
Smith, Rebecca; Chakraborty, Sulagna; Lyons, Lee Ann; Winata, Fikriyah; Mateus-Pinilla, Nohra
(2025)
These are the datasets underlying the figures in the manuscript "Methods of active surveillance for hard ticks and associated tick-borne pathogens of public health importance in the contiguous United States: A Comprehensive Systematic Review".
The review considered only publications reporting on active tick or tick-borne pathogen surveillance in the contiguous United States published between 1944 and 2018. For the purposes of this review, we were only concerned with studies of Ixodidae (hard ticks) and/or studies of tick-borne pathogens (in humans, animals, or hard ticks) of public health importance to humans. Study designs included cross-sectional, serological, epidemiological, ecological, or observational studies. Only peer-reviewed publications published in the English language were included. Studies were excluded if they focused on a tick that is not a vector of a human pathogen or on a pathogen that does not cause disease in humans, if the tick or tick-borne pathogen findings were incidental, or if they did not include quantitative surveillance data. For the purpose of this study, we defined surveillance data as information on ticks or pathogens provided through active sampling in natural areas; it should be noted that this does not match the strict definition used by the CDC, which requires sustained sampling efforts across time. Studies were also excluded if they: explored regions other than the contiguous US; focused on treatment, vaccine, or therapeutics development and/or diagnostics of human disease; focused on tick or pathogen genetics; focused on experimental studies with ticks or hosts; were tick control and/or management studies; performed only passive surveillance; were review articles; were not peer reviewed; were in a language other than English; the full text was not available; and if the disease was not a risk to the general public. In addition, for articles which reported data that had previously been published, we only included previously unreported information collected by the authors, and we referenced the specific period of collection for these data to ensure we were not double-recording data. Due to publication delays, we also performed a non-systematic review of the literature of articles published between 2019 – 2023 on tick and tickborne pathogen surveillance methods conducted in the contiguous United States.
Keyword search was performed in PubMed Central and Web of Science Core Collection databases. The search algorithm keywords included tick(s), Amblyomma, Dermacentor, Ixodes, Rhipicephalus, Acari Ixodidea, tick host(s), Lyme disease, Rocky Mountain Spotted Fever, Spotted Fever Group, Rickettsiosis, Ehrlichiosis, Anaplasmosis, Borreliosis, Tularemia, Babesiosis, tick-borne pathogen, Powassan, Heartland, Bourbon, Colorado tick fever, Pacific Coast tick fever, tick surveillance, surveillance, (sero)epidemiology, prevalence, distribution, ecology, United States. The search algorithm utilized is provided as follows:
TI= ((ticks OR Ixodes OR Amblyomma OR Dermacentor OR Rhipicephalus OR "Acari Ixodidi" OR "tick hosts" OR "tick host") OR ("Lyme Disease" OR "Rocky Mountain Spotted Fever" OR "Spotted Fever Group" OR Rickettsiosis OR Rickettsial OR Ehrlichiosis OR Anaplasmosis OR Borreliosis OR Tularemia OR Babesiosis OR Borrelia OR Ehrlichia OR Anaplasma OR Rickettsia OR Babesia OR "tick-borne pathogen" OR "tick borne pathogen")) AND TS= ("tick surveillance" OR surveillance OR epidemiology OR seroepidemiology OR ecology) AND CU=("United States of America" OR "USA" OR "United States" OR United-States).
These datasets are the collated data underlying the figures in the manuscript. For more details, please see the publication.
The following are explanations for variables used in all the CSV files:
Tick: Species of tick collected
Tick_Method: Method of collecting ticks
Pathogen: Species of pathogen tested for
Path_Method: Method of testing for pathogens
Decade: Decade of publication
n: Number of publications
STATE: state in which study was conducted
COUNTY: county in which study was conducted
1944 - 2018 (Was surveillance performed?): was there at least one publication included with a publication date within the 1944-2018 period in this geographic region?
2019 - 2023 (Was surveillance performed?): was there at least one publication included with a publication date within the 2019-2023 period in this geographic region?
keywords:
ticks; systematic review; surveillance
published:
2021-06-16
Warnow , Tandy; Wedell, Eleanor
(2021)
Thank you for using these datasets.
These RNAsim aligned fragmentary sequences were generated from the query sequences selected by Balaban et al. (2019) in their variable-size datasets (https://doi.org/10.5061/dryad.78nf7dq). They were created for use for phylogenetic placement with the multiple sequence alignments and backbone trees provided by Balaban et al. (2019).
The file structures included here also correspond with the data Balaban et al. (2020) provided.
This includes:
Directories for five varying backbone tree sizes, shown as 5000, 10000, 50000, 100000, and 200000. These directory names are also used by Balaban et al. (2019), and indicate the size of the backbone tree included in their data.
Subdirectories for each replicate from the backbone tree size labelled 0 through 4. For the smaller four backbone tree sizes there are five replicates, and for the largest there is one replicate.
Each replicate contains 200 text files with one aligned query sequence fragment in fasta format.
keywords:
Fragmentary Sequences; RNAsim
published:
2021-03-05
Beilke, Elizabeth; Blakey, Rachel; O'Keefe, Joy
(2021)
Datasets that accompany Beilke, Blakey, and O'Keefe 2021 publication (Title: Bats partition activity in space and time in a large, heterogeneous landscape; Journal: Ecology and Evolution).
keywords:
spatiotemporal; chiroptera
published:
2022-11-28
Zhang, Na; Sharma, Bijay P.; Khanna, Madhu
(2022)
The compiled datasets include county-level variables used for simulating miscanthus and switchgrass production in 2287 counties across the rainfed US including 5-year (2012-2016) averaged growing season degree days (GDD), 5-year (2012-2016) averaged growing season cumulative precipitation, National Commodity Crop Productivity Index (NCCPI) values, regional dummies (only for miscanthus), the regional-level random effect of the yield response function, N price, land cash rent, the first year fixed cost (only for switchgrass), and separate datasets for simulating an alternative model assuming a constant N rate.
The GAMS codes are used to run the simulation to obtain the main results including the age-varying profit-maximizing N rate, biomass yields, and annual profits for miscanthus and switchgrass production across counties in the rainfed US. The STATA codes are used to merge and analyze simulation results and create summary statistics tables and key figures.
keywords:
Age; Miscanthus; Net present value; Nitrogen; Optimal lifespan; Profit maximization; Switchgrass; Yield; Center for Advanced Bioenergy and Bioproducts Innovation
published:
2025-04-30
This dataset represents the results of targeted eDNA assays via quantitative PCR for two imperiled freshwater species.
keywords:
Environmental DNA, Freshwater Mussel, Salamander, Conventional Surveys, Endangered Species, Habitat Use, Artificial Structures
published:
2023-05-30
Clem, C. Scott; Hart, Lily V.; McElrath, Thomas C.
(2023)
Primary occurrence data for Clem, Hart, & McElrath. 2023. A century of Illinois hover flies (Diptera: Syrphidae): Museum and citizen science data reveal recent range expansions, contractions, and species of potential conservation significance. Included are a license.txt file, the cleaned occurrences from each of the six merged datasets, and a cleaned, merged dataset containing all occurrence records in one spreadsheet, formatted according to Darwin Core standards, with a few extra fields such as GBIF identifiers that were included in some of the original downloads.
keywords:
csv; occurrences; syrphidae; hover flies; flies; biodiversity; darwin core; darwin-core; GBIF; citizen science; iNaturalist
published:
2025-06-16
Blanc-Betes, Elena; Gomez-Casanovas, Nuria; Bernacchi, Carl; Boughton, Elizabeth; Yang, Wendy; DeLucia, Evan
(2025)
Biometric, and ground-based and eddy covariance flux data to investigate the impact of sugarcane expansion across subtropical Florida on the carbon (C) budget over a three-year rotation.
Dataset includes: three-year record of daily fluxes, NPP and SOC input measurements, and estimates of carbon use efficiency and net ecosystem carbon balance in sugarcane and improved and semi-native pastures following pasture conversion to sugarcane.
keywords:
land use change; sugarcane expansion; bioenergy; carbon budget; CUE; NECB
published:
2019-07-11
Daniels, Melissa; Larson, Eric
(2019)
We studied the effect of windstorm disturbance on forest invasive plants in southern Illinois. This data includes raw data on plant abundance at survey points, compiled data used in statistical analyses, and spatial data for surveyed plots and units. This file package also includes a readme.doc file that describes the data in detail, including attribute descriptions.
keywords:
tornado, blowdowns, derecho, invasive plants, Shawnee National Forest, southern Illinois
published:
2025-05-15
Macleod, Brandi M.; Wilkins, Pamela A.; McCoy, Annette; Bishop, Rebecca C.
(2025)
Coagulation testing (VCM Vet™) was performed on 57 horses with acute abdominal pain at admission to the University of Illinois Veterinary Teaching Hospital. Additional clinical data were recorded retrospectively. ROC analysis was performed to determine the optimal number of abnormal coagulation parameters for coagulopathy diagnosis based on survival. General linear regression (GLM) and random forest (RF) classification models were developed to predict short-term survival. A training cohort of 40 horses was used for model development, and model performance was determined using the remaining 17 horses.
keywords:
horse; coagulation; colic; abdominal pain; survival; machine learning; blood clotting; viscoelastic testing
published:
2022-03-30
Tiemann, Jeremy S.; Stodola, Alison P.; Douglass, Sarah A.; Vinsel, Rachel M.; Cummings, Kevin S.
(2022)
This dataset is associated with a larger manuscript published in 2022 in the Illinois Natural History Survey Bulletin to summarize all known records for nonindigenous aquatic mollusks in Illinois, and full sources are referenced within the manuscript. We examined museum holdings, literature accounts, publicly available databases sponsored by the U.S. Geological Survey (USGS) - Nonindigenous Aquatic Species program (http://nas.er.usgs.gov/.) and InvertEBase (invertebase.org). We also included sporadic field survey data of encounters of nonindigenous aquatic species from colleagues within the Illinois Natural History Survey, Illinois Department of Natural Resources, U.S. Fish and Wildlife Service, county forest preserve districts, and other natural resource agencies about their encounters with nonindigenous aquatic mollusk species. Lastly, we examined the role and utility of citizen-science data to document occurrences of nonindigenous aquatic mollusk species. We queried iNaturalist (www.inaturalist.org) for all available nonindigenous freshwater mollusk data for Illinois.
Table heading descriptions (if not intuitive) are: “INHS verified” is whether an INHS staff member verified the record by observing vouchered specimen or photograph; “Source” is where a record was accessed or obtained; “individualCount” is number collected or observed in a record; “MuseumCode” is standard museum abbreviation or acronym; “Institution” is source that housed or reported a record, and this also includes the spelled-out museum code; “Collectors” typically indicates who collected the specimen or voucher; “Lat_Long determined by” denotes whether collection coordinates were stated by the collector or by a curator (using inference from data available); “fieldNumber” typically indicates a unique field number that a collector may have used in the field; “identifiedBy” typically explains who identified a specimen or verified a specimen identification.
keywords:
Illinois; Exotic species; Non-native aquatic species; NAS; Aquatic Invasive Species; AIS; Mollusk
published:
2024-10-16
Smith, Rebecca; Huang, Conghui
(2024)
School testing data were provided by Shield Illinois (ShieldIL), which conducted weekly in-school testing on behalf of the Illinois Department of Public Health (IDPH) for all participating schools in the state excluding Chicago Public Schools. The populations and proportions of students and employees in the studied school districts are reported by Elementary/Secondary Information System (ElSi) database.
keywords:
COVID-19; school testing
published:
2024-04-15
Belmont, Andrew; Gholamalamdari, Omid; Kumar, Pradeep
(2024)
The immunofluorescence and segmented images of three nuclear locales, (nuclear periphery, nuclear speckles, and nucleolus) in four human cells lines (H1-hESC, HCT116, HFFc6, and K562). For each of the cell lines, this dataset includes original, cropped, and binary 4D images (3D + antibody) in addition to max projected thumbnails of cell nuclei.
keywords:
microscopy; immunostaining; segmentation; human nuclei
published:
2024-12-20
Stuchiner, Emily; Xu, Jiacheng; Eddy, William C.; DeLucia, Evan H.; Yang, Wendy
(2024)
All data presented in the manuscript published in the Journal of Geophysical Research-Biogeosciences by Stuchiner et al. 2025, "Hot or not? An evaluation of methods for identifying hot moments of nitrous oxide emissions from soils." This includes hourly N2O flux measurements from 20 autochambers from May 2022 to April 2023 in a maize field in central Illinois, and various metrics used to assess hot moments that are evaluated in the manuscript. Note that chamber 5 for each sampling node is sampled from a deep soil collar (50 cm depth) that excludes roots for the purpose of measuring heterotrophic respiration rates.
keywords:
nitrous oxide; maize; hot moments; outlier detection; soil emissions
published:
2024-06-11
This dataset contains weather data taken at the University of Illinois Urbana-Champaign Energy Farm using automatic sensors and averaged every 15 minutes. Measurements include average air temperature, average relative humidity, average wind speed, maximum wind speed, average wind direction, average photosynthetically active radiation, total precipitation, and average air pressure.
keywords:
air temperature; relative humidity; wind speed; wind direction; photosynthetically active radiation; precipitation; air pressure
published:
2023-12-19
Bush, Daniel; Calla, Bernarda; Berenbaum, May
(2023)
Data for the Appendices of Bush et al. article published in Ecology and Evolution. Contains genomic analysis information for a strain of Aspergillus flavus isolated from bee bread in East Central Illinois.
keywords:
Excel; UIUC; Evolution and Ecology; Aspergillus flavus; genome