Illinois Data Bank Dataset Search Results
Results
published:
2023-12-18
Johnson, Claire A.; Benson, Thomas J.
(2023)
Data in this publication were used to examine the effects of habitat and landscape-level covariates on occupancy and interannual dynamics and the effects of environmental factors on detection of Black-billed Cuckoos and Yellow-billed Cuckoos. Data were collected between 2019-2020 in northern Illinois, USA. Procedures were approved by the Illinois Institutional Animal Care and Use Committee (IACUC), protocol no. 19086.
keywords:
Black-billed Cuckoo; habitat use; multi-scale; occupancy dynamics; turnover; Yellow-billed Cuckoo
published:
2023-01-01
Cao, Yanghui; Dietrich, Christopher H.; Kits, Joel; Dmitriev, Dmitry A.; Xu, Ye; Huang, Min
(2023)
The following files were used to reconstruct the phylogeny of the leafhopper subfamily Typhlocybinae, using IQ-TREE v1.6.12 and ASTRAL v 4.10.5.
<b>1) Taxon_sampling.csv:</b> contains the sample IDs (1st column) and the taxonomic information (2nd column). Sample IDs were used in the alignment files and partition files.
<b>2) concatenated_nt_complete.phy:</b> a complete concatenated nucleotide dataset used for the maximum likelihood analysis by IQ-TREE v1.6.12. The file lists the sequences of 248 samples with 154,992 nucleotide positions (intron included) from 665 loci. Hyphens are used to represent gaps.
<b>3) concatenated_nt_complete_partition.nex:</b> the partitioning schemes for concatenated_nt_complete.phy. The file partitions the 154,992 nucleotide characters into 426 character sets, and defines the best substitution model for each character set.
<b>4) concatenated_cds_complete.phy:</b> a complete concatenated coding DNA sequence dataset used for the maximum likelihood analysis by IQ-TREE v1.6.12. The file lists the sequences of 248 samples with 153,525 nucleotide positions (intron excluded) from 665 loci. Hyphens are used to represent gaps.
<b>5) concatenated_cds_complete_partition.nex:</b> the partitioning schemes for concatenated_cds_complete.phy. The file partitions the 153,525 nucleotide characters into 426 character sets, and defines the best substitution model for each character set.
<b>6) concatenated_nt_reduced.phy:</b> a reduced concatenated nucleotide dataset used for the maximum likelihood analysis by IQ-TREE v1.6.12. The file lists the sequences of 248 samples with 95,076 nucleotide positions (intron included) from 374 loci. Hyphens are used to represent gaps.
<b>7) concatenated_nt_reduced_partition.nex:</b> the partitioning schemes for concatenated_nt_reduced.phy. The file partitions the 95,076 nucleotide characters into 312 character sets, and defines the best substitution model for each character set.
<b>8) concatenated_aa_complete.phy:</b> a complete concatenated amino acid dataset used for the maximum likelihood analysis by IQ-TREE v1.6.12, corresponding to concatenated_cds_complete.phy. The file lists the sequences of 248 samples with 51,175 amino acid positions from 665 loci. Hyphens are used to represent gaps.
<b>9) concatenated_aa_complete_partition.nex:</b> the partitioning schemes for concatenated_aa_complete.phy. The file partitions the 51,175 amino acid characters into 426 character sets, and defines the best substitution model for each character set.
<b>10) concatenated_aa_reduced.phy:</b> a reduced concatenated amino acid dataset used for the maximum likelihood analysis by IQ-TREE v1.6.12, corresponding to concatenated_nt_reduced.phy. The file lists the sequences of 248 samples with 31,384 amino acid positions from 374 loci. Hyphens are used to represent gaps.
<b>11) concatenated_aa_reduced_partition.nex:</b> the partitioning schemes for concatenated_aa_reduced.phy. The file partitions the 31,384 amino acid characters into 312 character sets, and defines the best substitution model for each character set.
<b>12) Individual_gene_alignment.zip:</b> contains 426 FASTA files, each one is an alignment for a gene. Hyphens are used to represent gaps. These files were used to construct gene trees using IQ-TREE v1.6.12, followed by multispecies coalescent analysis using ASTRAL v 4.10.5 based the consensus trees with a minimum average bootstrap value of 70.
keywords:
Auchenorrhyncha, Cicadomorpha, Membracoidea, anchored hybrid enrichment
published:
2023-07-28
Njuguna, Joyce; Clark, Lindsay; Lipka , Alexander; Anzoua, Kossonou; Bagmet, Larisa; Chebukin, Pavel; Dwiyanti, Maria; Dzyubenko, Elena; Dzyubenko, Nicolay; Ghimire, Bimal; Jin, Xiaoli; Johnson, Douglas; Nagano, Hironori; Peng, Junhua; Petersen, Karen; Sabitov, Andrey; Seong, Eun; Yamada, Toshihiko; Yoo, Ji; Yu, Chang; Zhao, Hu; Long, Stephen; Sacks, Erik
(2023)
The dataset is for a study conducted to understand genome-wide association (GWA) and genomic prediction of biomass yield and 14 yield-components traits in Miscanthus sacchariflorus. We evaluated a diversity panel with 590 accessions of M. sacchariflorus grown across four years in one subtropical and three temperate locations and genotyped with 268,109 single nucleotide polymorphisms (SNPs).
keywords:
Miscanthus sacchariflorus; genome-wide association analysis; genomic prediction; bioenergy; biomass
published:
2023-07-26
Kantola, Ilsa B; Blanc-Betes, Elena; Masters, Michael; Chang, Elliot; Marklein, Alison; Moore, Caitlin; von Haden, Adam; Bernacchi, Carl; Wolf, Adam; Epihov, Dimitar; Beerling, David; DeLucia, Evan
(2023)
This data set contains data used for “Improved Net Carbon Budgets in the US Midwest through Direct Measured Impacts of Enhanced Weathering.” Data include biomass, soil bulk densities, soil respiration measurements, soil lanthanide element analysis, plant tissue analysis for major cations, and eddy covariance fluxes.
keywords:
agriculture; bioenergy crop; carbon budget; eddy covariance; net ecosystem carbon balance; net primary production; soil respiration; enhanced weathering; carbon dioxide removal; Illinois
published:
2022-12-11
The data are original electron micrographs from the lab of the late Dr. Burt Endo of the USDA. These data were digitized from photographic prints and glass plate negatives at 600 DPI as 16 bit TIFF files. This fourth version added 6 new ZIP files from the Endo data collection. "Endo folder database.xlsx" is updated to reflect the addition. Information in "Readme_FileNameFormatting.docx" remains the same as in V3.
keywords:
Heterodera glycines; Meloidogyne incognita; Burt Endo; nematode
published:
2023-12-06
Starbuck, Clarissa; DeSchepper, Logan; Hoggatt, Meredith; O'Keefe, Joy
(2023)
This dataset accompanies an article published in the journal Bioacoustics: "Tradeoffs in sound quality and cost for passive acoustic devices", https://doi.org/10.1080/09524622.2023.2290715. The dataset contains measurements for acoustic call files for free-flying bats simultaneously recorded on both Audiomoth and Anabat Swift passive acoustic recording devices in a conservation area in northeastern Missouri, USA. We paired calls from the two devices and compared indicators of recording quality measured in a proprietary program (Bat Call Identification Software). The dataset also contains a file enumerating the proportions of calls classified as low frequency, mid frequency, or Myotis (three phonic groups) for each type of recording device. The data were used to compare the quality and sensitivity of the two devices. The scripts for modeling procedures and figures are included in the dataset.
keywords:
Bats; echolocation; passive acoustic monitoring; sensors
published:
2023-12-08
Preza Fontes, Giovani; Greer, Kristin; Pittelkow, Cameron
(2023)
A two-year field study was conducted to test the hypothesis that biochar application increases inorganic soil N availability during maize growth, leading to higher grain yields and N recovery efficiency while reducing the risk of N leaching following harvest. Four N fertilizer rates (0, 90, 179, and 269 kg ha-1 as urea ammonium nitrate solution) were applied with or without biochar (10 Mg ha-1) before maize planting each year. This dataset contains selected summary statistics (average and standard deviation) on soil and plant measurements. This file package also includes a readme.txt file that describes the data in detail, including attribute descriptions.
keywords:
biochar; nitrogen fertilizer; nitrogen use efficiency; corn yield, soil inorganic nitrogen; nitrate leaching
published:
2023-07-05
Dalling, James William; Norden, Natalia
(2023)
Complete soils dataset for the La Planada forest dynamics plot associated with publication: John et al. (2007) "Soil nutrients influence the spatial distributions of tropical tree species" PNAS 104:864-869 www.pnas.org/cgi/doi/10.1073/pnas.0604666104
keywords:
tropical forest soil; montane forest; cation availability; spatial distribution of tree species
published:
2024-02-25
Coshic, Kush; Maffeo, Christopher; Winogradoff, David; Aksimentiev, Aleksei
(2024)
Simulation trajectory data and scripts for Nature manuscript "The structure and physical properties of a packaged bacteriophage particle" that reports the all-atom structure of a complete HK97 virion, including its entire 39,732 base pair genome, obtained through multi-resolution simulations.
keywords:
Virus capsid; Bacteriophage packaging; Multiresolution simulations; all-atom MD simulation
published:
2024-01-31
Wang, Xiudan; Dietrich, Christopher; Zhang, Yalin
(2024)
The included files were used to reconstruct the phylogeny of Coelidiinae using combined morphological and molecular data, estimate divergence times and reconstruct ancestral biogeographic areas as described in the manuscript submitted for publication. The file “Coelidiinae_dna_morph_combined.nex” is a text file in standard NEXUS format used by various phylogenetic analysis programs. This file includes the aligned and concatenated nucleotide sequences or five gene regions (mitochondrial COI and 16S, and nuclear 28S D-2, histone H3, histone H2A and wingless) indicated by standard “ACGT” nucleotide symbols with missing data indicated by “?”, and morphological character data as defined in Table S3 used in the analyses. The data partitions are indicated toward the end of the file by ranges of numbers (“charset Subset 1 – 4” for the DNA data and “charset morph” for the morphological characters) followed by commands for the phylogenetic analysis program MrBayes that specify the model settings for each data partition. Detailed data on species included (as rows) in the dataset, including collection localities and GenBank accession numbers are provided in the Table_S1_Specimen_information.csv file. The file "TablesS2-S4.pdf" lists the primers used for polymerase chain reaction amplification, the list of morphological character definitions, and the morphological character matrix. The file “RASP_Distribution.csv” contains a list of the species included in the phylogenetic dataset (first column) and a code (second column) indicating their distributions as follows: (A) Oriental, (B) Palaearctic, (C) Australian, (D) Afrotropical, (E) Neotropical, and (F) Nearctic. More than one letter indicates that the species occurs in more than one region. The file "infile_for_BEAST.txt" is the input file in XML format used for the molecular divergence time analysis using the program BEAST (Bayesian Evolutionary Analysis by Sampling Trees) as described in the Methods section of the manuscript. This file includes comments that document the steps of the analysis.
keywords:
leafhopper; phylogeny; DNA sequence; insect; timetree; biogeography
published:
2020-12-29
Viana, Jéssica; Turner, Benjamin; Dalling, James
(2020)
Three datasets: species_abundance_data, species_traits, and environmental_data. The three datasets were collected in the Fortuna Forest Reserve (8°45′ N, 82°15′ W) and Palo Seco Protected Forest (8°45′ N, 82°13′ W) located in western Panama. The two reserves support humid to super-humid rainforests, according to Holdridge (1947). The species_abundance_data and species_traits datasets were collected across 15 subplots of 25 m2 in 12 one-hectare permanent plots distributed across the two reserves. The subplots were spaced 20 m apart along three 5 m wide transects, each 30 m apart. Please read Prada et al. (2017) for details on the environmental characteristics of the study area.
Prada CM, Morris A, Andersen KM, et al (2017) Soils and rainfall drive landscape-scale changes in the diversity and functional composition of tree communities in a premontane tropical forest. J Veg Sci 28:859–870. https://doi.org/10.1111/jvs.12540
keywords:
functional traits; plants; ferns; environmental data; Fortuna; species data; community ecology
published:
2021-06-08
Todd, Jones; Michael, Ward
(2021)
Dataset associated with Jones and Ward JAE-2020-0031.R1 submission: Pre-to post-fledging carryover effects and the adaptive significance of variation in wing development for juvenile songbirds. Excel CSV files with data used in analyses and file with descriptions of each column. The flight ability variable in this dataset was derived from fledgling drop tests, examples of which can be found in the related dataset: Jones, Todd M.; Benson, Thomas J.; Ward, Michael P. (2019): Flight Ability of Juvenile Songbirds at Fledgling: Examples of Fledgling Drop Tests. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2044905_V1.
keywords:
fledgling; wing development; life history; adaptive significance; post-fledging; songbirds
published:
2023-05-08
Stickley, Samuel; Fraterrigo, Jennifer
(2023)
This dataset includes microclimate species distribution models at a ~3 m2 spatial resolution and free-air temperature species distribution models at ~0.85 km2 spatial resolution for three plethodontid salamander species (Demognathus wrighti, Desmognathus ocoee, and Plethodon jordani) across Great Smoky Mountains National Park. We also include heatmaps representing the differences between microclimate and free-air species distribution models and polygon layers representing the fragmented habitat for each species' predicted range. All datasets include predictions for 2010, 2030, and 2050.
keywords:
Ecological niche modeling, microclimate, species distribution model, spatial resolution, range loss, suitable habitat, plethodontid salamanders, montane ecosystems
published:
2020-12-02
Yang, Pan; Cai, Ximing; Khanna, Madhu
(2020)
The dataset includes the survey results about farmers’ perceptions of marginal land availability and the likelihood of a land pixel being marginal based on a machine learning model trained from the survey.
Two spreadsheet files are the farmer and farm characteristics (marginal_land_survey_data_shared.xlsx), and the existing land use of marginal lands (land_use_info_sharing.xlsx).
<b>Note:</b> the blank cells in these two spreadsheets mean missing values in the survey response.
The GeoTiff file includes two bands, one the marginal land likelihood in the Midwestern states (0-1), the other the dominant reason of land marginality (0-5; 0 for farm size, 1 for growing season precipitation, 2 for root zone soil water capacity, 3 for average slope, 4 for growing season mean temperature, and 5 for growing season diurnal range of temperature). To read the data, please use a GIS software such as ArcGIS or QGIS.
keywords:
marginal land; survey
published:
2022-09-29
Merrill, Loren; Jones, Todd; Brawn, Jeffrey; Ward, Michael
(2022)
Dataset associated with Merrill et al. ECE-2021-05-00793.R1 submission: Early life patterns of growth are linked to levels of phenotypic trait covariance and post-fledging mortality across avian species. Excel CSV files with all of the data used in analyses and file with descriptions of each column.
keywords:
canalization; developmental flexibility; early-life stress; nest predation; phenotypic correlation; trait covariance
published:
2022-11-07
Jones, Todd; Di Giovanni, Alexander; Hauber, Mark; Ward, Michael
(2022)
Dataset associated with Jones et al. ECY22-0118.R3 submission: Ontogenetic effects of brood parasitism by the Brown-headed Cowbird on host offspring. Excel CSV files with all of the data used in analyses and file with descriptions of each column.
keywords:
brood parasitism; cowbirds; host-parasite systems; ontogeny; post-fledging; songbirds
published:
2022-08-25
Souza-Cole, Ian; Ward, Michael; Rebecca, Mau; Jeffrey, Foster; Benson, Thomas
(2022)
Data in this publication were used to analyze the factors that influence the abundance of eastern whip-poor-wills in the Midwest and to describe the diet of this species. These data were collected in Illinois in 2019 and 2020. Procedures were approved by the Illinois Institutional Animal Care and Use Committee (IACUC), protocol no. 19006
keywords:
eastern whip-poor-will; Antrostomus vociferus; abundance; moths; nightjars; Lepidoptera; metabarcoding
published:
2020-11-06
Sashittal, Palash; Zhang, Chuanyi; El-Kebir, Mohammed
(2020)
This data contains bam files and transcripts in the simulated instances generated for the paper 'JUMPER: Discontinuous Transcript Assembly in SARS-CoV-2' submitted for RECOMB 2021. The folder 'bam' contained the simulated bam files aligned using STAR wile the reads were generated using the method polyester
Note: in the readme file, close to the end of the document, please ignore this sentence: 'Those files can be opened by using [name of software].'
keywords:
transcript assembly; SARS-CoV-2; discontinuous transcription; coronaviruses
published:
2022-08-23
Seyfried, Georgia; Corrales, Adriana; Kent, Angela; Dalling, James; Yang, Wendy
(2022)
This dataset contains soil chemical properties used to variation in soil fungal communities beneath Oreomunnea mexicana trees in the manuscript "Watershed-scale variation in potential fungal community contributions to ectomycorrhizal biogeochemical syndromes"
keywords:
Acid-base chemistry; Ectomycorrhizal fungi; Exploration type; Nitrogen cycling; Nitrogen isotopes; Plant-soil (below-ground) interactions; Saprotrophic fungi; Tropical forest
published:
2022-11-07
Sweedler, Jonathan; Castro, Daniel
(2022)
The dataset contains the data and code for Single-cell and Subcellular Analysis of freshly isolated cultured, uncultured P1 cells and uncultured Old cells. The .csv file named 'MagLab20220721' contains the sample and intensity information with the columns referring to the m/z values and the rows being the samples. The 'MagLabNameINdex.csv' file contains all the index information. The file named '20220721_MagLab.spydata' contains the loaded data of both the two previous files in Spyder. The .mat file contains the aligned data for the three groups.
keywords:
Single-cell; Subcellular; Mass Spectrometry; MALDI; Lipidomics; FTICR; 21 T
published:
2023-12-23
Rodriguez-Zas, Sandra
(2023)
Supplemental document corresponding to a submission to Physiological Genomics (Data supplements and source materials must now be deposited in a community-recognized data repository or to a generalist public access repository if no community resource is available. See "Author/Production Requirements" for more information.) https://pg.msubmit.net/
keywords:
Supplemental, Physiological Genomics
published:
2022-09-07
Long, Stephen P.; Wang, Yu; Stutz, Samantha S.
(2022)
We developed a new application of isotopic gas exchange which couples a tunable diode laser absorption spectroscope (TDL) with a leaf gas exchange system, analyzing leakiness through induction of C4 photosynthesis on dark to high-light transitions. The youngest fully expanded leaf was measured on 40-45 day-old maize(B73) and sorghum (Tx430).
Detail definition of each variable in raw Li-6400XT and Li-6800 (in "Original_data_AND_Data_processing_code.zip") is summarized in: <a href="https://www.licor.com/env/support/LI-6800/topics/symbols.html#const">https://www.licor.com/env/support/LI-6800/topics/symbols.html#const</a>
keywords:
leakiness; bundle sheath leakage; C4 photosynthesis; photosynthetic induction; non-steady-state photosynthesis; carbon isotope discrimination; photosynthetic efficiency; corn
published:
2022-11-28
Zhang, Na; Sharma, Bijay P.; Khanna, Madhu
(2022)
The compiled datasets include county-level variables used for simulating miscanthus and switchgrass production in 2287 counties across the rainfed US including 5-year (2012-2016) averaged growing season degree days (GDD), 5-year (2012-2016) averaged growing season cumulative precipitation, National Commodity Crop Productivity Index (NCCPI) values, regional dummies (only for miscanthus), the regional-level random effect of the yield response function, N price, land cash rent, the first year fixed cost (only for switchgrass), and separate datasets for simulating an alternative model assuming a constant N rate.
The GAMS codes are used to run the simulation to obtain the main results including the age-varying profit-maximizing N rate, biomass yields, and annual profits for miscanthus and switchgrass production across counties in the rainfed US. The STATA codes are used to merge and analyze simulation results and create summary statistics tables and key figures.
keywords:
Age; Miscanthus; Net present value; Nitrogen; Optimal lifespan; Profit maximization; Switchgrass; Yield; Center for Advanced Bioenergy and Bioproducts Innovation
published:
2022-08-06
Carson, Dawn; Kopsco, Heather; Gronemeyer, Peg; Mateus-Pinilla, Nohra; Smith, Genee; Sandstrom, Emma; Smith, Rebecca
(2022)
An online knowledge, attitudes, and practices survey on ticks and tick-borne diseases was distributed to medical professionals in Illinois during summer 2020 to fall 2021. These are the raw data associated with that survey and the survey questions used. Age, gender, and county of practice have been removed for identifiability. We have added calculated values (columns 165 to end), including: the tick knowledge score, TBD knowledge score, and total knowledge score, which are the sum of the total number of correct answers in each category, and score percent, which are the proportion of correct answers in each category; region, which is determined from the county of practice; TBD relevant practice, which separates the practice variable into TBD primary, secondary, and non-responders; and several variables which group categories.
keywords:
ticks; medicine; tick-borne disease; survey