Illinois Data Bank Dataset Search Results
Results
published:
2024-07-11
Schneider, Amy; Suski, Cory
(2024)
published:
2020-08-25
Allan, Brian; Fredericks, Lisa
(2020)
The Allan Lab has published a Fluidigm pipeline online. This is the url: https://github.com/HPCBio/allan-fluidigm-pipeline.
This url includes a tutorial for running the pipeline. However it does not have test datasets yet.
This tarball hosted at the Illinois Data Bank is the dataset that completes the github tutorial.
It includes inputs (custom database of tick pathogens and fluidigm raw reads) and output files (tables of samples with taxonomic classifications).
keywords:
custom database of tick pathogens; fluidigm pipeline; fluidigm paired reads; fluidigm tutorial
published:
2020-04-02
Parker, Christine; Meador, Morgan; Hoover, Jeffrey
(2020)
Automatic and manual counts of black flies captured in Illinois.
keywords:
black flies; simuliids; ImageJ; count method
published:
2020-12-01
This is the data set from the published manuscript 'Vertebrate scavenger guild composition and utilization of carrion in an East Asian temperate forest' by Inagaki et al.
keywords:
Japan;Sika Deer
published:
2020-09-27
Data extracted from Text, Tables and Figures of publications in summarizing crop responses to Free-Air CO2 Elevation (FACE)
keywords:
Free Air CO2 Elevation; FACE; wheat, rice, soybean, cassava;
published:
2021-10-15
Jianhao, Peng; Idoia, Ochoa
(2021)
This is the 5 states 5000 cells synthetic expression file we used for validation of SimiC, a single cell gene regulatory network inference method with similarity constraints. Ground truth GRNs are stored in Numpy array format, and expression profiles of all states combined are stored in Pandas DataFrame in format of Pickle files.
keywords:
Numpy array; GRNs; Pandas DataFrame;
published:
2020-02-01
Williams, Benjamin R.; Benson, Thomas J.
(2020)
This data describes habitat use, availability, landscape level influences, and daily movement of dabbling ducks in the Wabash River Valley of southeastern Illinois and southwestern Indiana. It contains triangulated locations of individual ducks, associated habitat assignments of those locations, flood survey data to determine water availability, and randomly generated points to assess landscape level questions.
keywords:
waterfowl; ducks; dabbling; mallard; teal; habitat
published:
2020-06-01
Hoover, Jeffrey P; Davros, Nicole M; Schelsky, Wendy; Brawn, Jeffry D
(2020)
Dataset associated with Hoover et al AUK-19-093 submission: Local conspecific density does not influence reproductive output in a secondary cavity-nesting songbird. Excel CSV with all of the data used in analyses.
Description of variables
YEARS: year
ORDINAL_DATE: number for what day of the year it is with 1 January = 1,……30 December = 365
SITE: acronym for each study site
BOX: unique nest box identifier on each study site
TREAT: designates whether nest box was in a high- or low- nest box density area within each study site
ACTUAL_NO_NEIGHBORS: number of pairs of warblers using a nest box within 200 m of a given pair’s nest box
CLUTCH_SIZE: number of warbler eggs in nest at the onset of incubation
PROWN: number of warbler nestlings once eggs have hatched
PROWF: number of warbler nestlings that fledged out of the nest box
HATCH_SUCCESS: proportion of eggs in the nest that hatched
FLEDG_SUCCESS: proportion of the nestlings that fledged from the nest box
HATCH_SUCCESS2: binary category where “0” indicates there was some, and “1” indicates there was no hatching failure
FLEDG_SUCCESS2: binary category where “0” indicates there was some, and “1” indicates there was no nestling failure (i.e. nestling death)
BHCO_PARASIT2: binary category where “0” indicates no cowbird parasitism, and “1” indicates there was cowbird parasitism
BHCOE: number of cowbird eggs in clutch
BHCOF: number of cowbird nestlings that fledged from the nest
PAIRID: unique number that identifies a male and female warbler that are together at a nest box and this number is the same in a subsequent nesting attempt or year if the same male and female are together again
FEMALE_ID: unique identifier for each female which represents her leg band combination. Each letter represents a band with letters preceding the hyphen being on the right leg and after the hyphen the left leg
FEM_AGE: binary category where “0” indicates a 1-year-old bird and “1” indicates a >1-year-old bird
FEMALE_BREEDING_ATTEMPT: “1” indicates first, “2” indicates second,……..breeding attempt within a given year
SECOND_ATTEMPT: for any female that fledged a brood in a given year, binary category where “0” represents that they did not, and “1” indicates that they did attempt a second brood that year
F_TOT_PROWF: total reproductive output (number of warbler fledglings produced) for a given female in a given year
MALE_ID: unique identifier for each male which represents his leg band combination. Each letter represents a band with letters preceding the hyphen being on the right leg and after the hyphen the left leg
MALE_AGE2: binary category where “0” indicates a 1-year-old bird and “1” indicates a >1-year-old bird
Provisioning_rate: total number of food provisions per nestling per hour by male and female warbler combined
BROOD_MASS: average nestling mass (g) for the brood
BROOD_TARSUS: average nestling tarsus length (mm) for the brood
Brood_condition: unit-less index of nestling condition that uses the residuals of the BROOD_MASS/BROOD_TARSUS relationship
A period (“.”) represents where data were not collected, not available, or because individual nest or female did not qualify for consideration of a category assignment.
An empty cell represents no data available for this particular cell.
keywords:
conspecific density; density dependence; food limitation; hatching success; nestling body condition; nestling provisioning; Prothonotary Warbler; reproductive output
published:
2016-05-16
This dataset contains the protein sequences and trees used to compare Non-Ribosomal Peptide Synthetase (NRPS) condensation domains in the AMB gene cluster and was used to create figure S1 in Rojas et al. 2015. Instead of having to collect representative sequences independently, this set of condensation domain sequences may serve as a quick reference set for coarse classification of condensation domains.
keywords:
NRPS; biosynthetic gene cluster; antimetabolite; Pseudomonas; oxyvinylglycine; secondary metabolite; thiotemplate; toxin
published:
2022-10-13
Xue, Qingquan; Xue, Qingquan; Dietrich, Christopher H.; Dietrich, Christopher H.; Zhang, Yalin; Zhang, Yalin
(2022)
The text file contains the original DNA nucleotide sequence data used in the phylogenetic analyses of Xue et al. (in review), comprising the 13 protein-coding genes and 2 ribosomal gene subunits of the mitochondrial genome. The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 30 taxa (species) and 13078 characters, indicate that the characters are DNA sequence, that gaps inserted into the DNA sequence alignment are indicated by a dash, and that missing data are indicated by a question mark. The positions of data partitions are indicated in the mrbayes block of commands for the phylogenetic program MrBayes (version 3.2.6) beginning near the end of the file. The mrbayes block also contains instructions for MrBayes on various non-default settings for that program. These are explained in the Methods section of the submitted manuscript. Two supplementary tables in the provided PDF file provide additional information on the species in the dataset, including the GenBank accession numbers for the sequence data (Table S1) and the DNA substitution models used for each of the individual mitochondrial genes and for different codon positions of the protein-coding genes used for analyses in the programs MrBayes and IQ-Tree (version 1.6.8) (Table S2). Full citations for references listed in Table S1 can be found by searching GenBank using the corresponding accession number. The supplemental tables will also be linked to the article upon publication at the journal website.
keywords:
Hemiptera; phylogeny; mitochondrial genome; morphology; leafhopper
published:
2023-05-08
Stickley, Samuel; Fraterrigo, Jennifer
(2023)
This dataset includes microclimate species distribution models at a ~3 m2 spatial resolution and free-air temperature species distribution models at ~0.85 km2 spatial resolution for three plethodontid salamander species (Demognathus wrighti, Desmognathus ocoee, and Plethodon jordani) across Great Smoky Mountains National Park. We also include heatmaps representing the differences between microclimate and free-air species distribution models and polygon layers representing the fragmented habitat for each species' predicted range. All datasets include predictions for 2010, 2030, and 2050.
keywords:
Ecological niche modeling, microclimate, species distribution model, spatial resolution, range loss, suitable habitat, plethodontid salamanders, montane ecosystems
published:
2019-09-17
Fraebel, David T.; Kuehn, Seppe
(2019)
BAM files for evolved strains from migration rate selection experiments conducted in low viscosity (0.2% w/v) agar plates containing M63 minimal medium with 1mM of mannose, melibiose, N-acetylglucosamine or galactose
published:
2022-03-09
Rapti, Zoi; Rivera Quinones, Vanessa; Stewart Merrill, Tara
(2022)
MATLAB files for the analysis of an ODE model for disease transmission. The codes may be used to find equilibrium points, study transient dynamics, evaluate the basic reproductive number (R0), and simulate the model when parameters depend on the independent variables. In addition, the codes may be used to perform local sensitivity analysis of R0 on the model parameters.
published:
2025-02-06
Ward, Michael; Tyndel, Stephen; Sperry, Jinelle; Katz, Aron
(2025)
Data from a study on the behavior of blue-winged and golden-winged warblers. We were investigating vocalizations and how the species reconizes each other. There are banding, behavioral data from a playback study, and song data.
keywords:
warblers; songs; species recognition
published:
2025-10-10
Clark, Teresa J.; Schwender, Jorg
(2025)
Upregulation of triacylglycerols (TAGs) in vegetative plant tissues such as leaves has the potential to drastically increase the energy density and biomass yield of bioenergy crops. In this context, constraint-based analysis has the promise to improve metabolic engineering strategies. Here we present a core metabolism model for the C4 biomass crop Sorghum bicolor (iTJC1414) along with a minimal model for photosynthetic CO2 assimilation, sucrose and TAG biosynthesis in C3 plants. Extending iTJC1414 to a four-cell diel model we simulate C4 photosynthesis in mature leaves with the principal photo-assimilatory product being replaced by TAG produced at different levels. Independent of specific pathways and per unit carbon assimilated, energy content and biosynthetic demands in reducing equivalents are about 1.3 to 1.4 times higher for TAG than for sucrose. For plant generic pathways, ATP- and NADPH-demands per CO2 assimilated are higher by 1.3- and 1.5-fold, respectively. If the photosynthetic supply in ATP and NADPH in iTJC1414 is adjusted to be balanced for sucrose as the sole photo-assimilatory product, overproduction of TAG is predicted to cause a substantial surplus in photosynthetic ATP. This means that if TAG synthesis was the sole photo-assimilatory process, there could be an energy imbalance that might impede the process. Adjusting iTJC1414 to a photo-assimilatory rate that approximates field conditions, we predict possible daily rates of TAG accumulation, dependent on varying ratios of carbon partitioning between exported assimilates and accumulated oil droplets (TAG, oleosin) and in dependence of activation of futile cycles of TAG synthesis and degradation. We find that, based on the capacity of leaves for photosynthetic synthesis of exported assimilates, mature leaves should be able to reach a 20% level of TAG per dry weight within one month if only 5% of the photosynthetic net assimilation can be allocated into oil droplets. From this we conclude that high TAG levels should be achievable if TAG synthesis is induced only during a final phase of the plant life cycle.
keywords:
Feedstock Production;Modeling
published:
2019-07-04
Sashittal, Palash; El-Kebir, Mohammed
(2019)
Results generated using SharpTNI on data collected from the 2014 Ebola outbreak in Sierra Leone.
published:
2019-12-03
These are the alignments of transcriptome data used for the analysis of members of Heteroptera. This dataset is analyzed in "Deep instability in the phylogenetic backbone of Heteroptera is only partly overcome by transcriptome-based phylogenomics" published in Insect Systematics and Diversity.
keywords:
Heteroptera; Hemiptera; Phylogenomics; transcriptome
published:
2020-10-20
Romero, Ingrid; Urban, Michael A.; Punyasena, Surangi
(2020)
This dataset includes a total of 501 images of 42 fossil specimens of Striatopollis and 459 specimens of 45 extant species of the tribe Amherstieae-Fabaceae. These images were taken using Airyscan confocal superresolution microscopy at 630X magnification (63x/NA 1.4 oil DIC). The images are in the CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or in ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/microscope-software/zen/czi.html#microscope---image-data].
keywords:
Striatopollis catatumbus; superresolution microscopy; Cenozoic; tropics; Zeiss; CZI; striate pollen.
published:
2019-10-18
Supporting secondary data used in a manuscript currently in submission regarding the invasion dynamics of the asian tiger mosquito, Aedes albopictus, in the state of Illinois
keywords:
albopictus;mosquito
published:
2025-01-15
Suski, Cory; Hay, Allison
(2025)
Data was generated from acoustic transmitters implanted in tournament caught and non-angled control largemouth bass across multiple seasons. This data was used to quantify post-release movement, behavior, and mortality in response to angling tournaments at different times of year and varying water temperatures.
published:
2019-07-08
Krichels, Alexander
(2019)
These files contain the data presented in the manuscript entitles "Iron redox reactions can drive microtopographic variation in upland soil carbon dioxide and nitrous oxide emissions".
keywords:
Iron; redox; carbon dioxide; nitrous oxide; chemodenitrification; Feammox; dissimilatory iron reduction; upland soils; flooding; global change
published:
2020-10-15
Khanna, Madhu; Wang, Weiwei; Wang, Michael
(2020)
This dataset consists of various input data that are used in the GAMS model. All the data are in the format of .inc which can be read within GAMS or Notepad. Main data sources include: acreage data (acre), crop budget data ($/acre), crop yield data (e.g. bushel/acre), Soil carbon sequestration data (KgCO2/ha/yr). Model details can be found in the "Assessing the Additional Carbon Savings with Biofuel" and GAMS model package.
## File Description
(1) GAMS Model.zip: This includes all the input files and scripts for running the model
(2) Table*.csv: These files include the data from the tables in the manuscript
(3) Figure2_3_4.csv: This contains the data used to create the figures in the manuscript
(4) BaselineResults.csv: This includes a summary of the model results.
(5) SensitivityResults_*.csv: Model results from the various sensitivity analyses performed
(6) LUC_emission.csv: land use change emissions by crop reporting district for changes of pasturelands to annual crops.
keywords:
Biogenic carbon intensity; Corn ethanol; Economic model; Dynamic optimization; Anticipated baseline approach; Life cycle carbon intenisty
published:
2024-03-25
Xia, Yushu; Kwon, Hoyoung; Wander, Michelle
(2024)
This accompanying study is published under the title "Estimating soil N2O emissions induced by organic and inorganic fertilizer inputs using a Tier-2, regression-based meta-analytic approach for U.S. agricultural lands" at Science of the Total Environment. The study is authored by Dr. Yushu Xia, Dr. Hoyoung Kwon, and Dr. Michelle Wander. The DOI for this study is <a href="https://doi.org/10.1016/j.scitotenv.2024.171930">https://doi.org/10.1016/j.scitotenv.2024.171930</a>.
keywords:
soil; nitrous oxide; agriculture; fertilizers; meta-analysis
published:
2025-07-30
Skorupa, A. J.; Bried, J. T.
(2025)
This dataset includes three data files for linking species' climate sensitivity, trait combinations, and listing status. It contains species occurrence data within Hydrologic Unit Code 12 (HUC12) watersheds, along with trait information and Rarity and Climate Sensitivity (RCS) index scores for lotic caddisflies, stoneflies, mussels, dragonflies, and crayfish across all Midwest Climate Adaptation Science Center states: Minnesota, Iowa, Missouri, Wisconsin, Illinois, Indiana, Michigan, and Ohio. For mussels, the geographic scope is expanded to include all Midwest Regional Species of Greatest Conservation Need (RSGCN) states—North Dakota, South Dakota, Nebraska, Kansas, and Kentucky. However, occurrence data for mussels is not included due to data-sharing agreements. Metadata are included with each data file. Please refer to the associated manuscript for original data sources, trait references, and details on the RCS index calculation.
keywords:
climate sensitivity; conservation status; traits; aquatic invertebrates; Midwest
published:
2019-08-05
Skinner, Rachel; Dietrich, Christopher; Walden, Kimberly; Gordon, Eric; Sweet, Andrew; Podsiadlowski, Lars; Petersen, Malte; Simon, Chris; Takiya, Daniela; Johnson, Kevin
(2019)
The data in this directory corresponds to:
Skinner, R.K., Dietrich, C.H., Walden, K.K.O., Gordon, E., Sweet, A.D., Podsiadlowski, L., Petersen, M., Simon, C., Takiya, D.M., and Johnson, K.P.
Phylogenomics of Auchenorrhyncha (Insecta: Hemiptera) using Transcriptomes: Examining Controversial Relationships via Degeneracy Coding and Interrogation of Gene Conflict.
Systematic Entomology.
Correspondance should be directed to: Rachel K. Skinner, rskinn2@illinois.edu
If you use these data, please cite our paper in Systematic Entomology.
The following files can be found in this dataset:
Amino_acid_concatenated_alignment.phy: the amino acid alignment used in this analysis in phylip format.
Amino_acid_raxml_partitions.txt (for reference only): the partitions for the amino acid alignment, but a partitioned amino acid analysis was not performed in this study.
Amino_acid_concatenated_tree.newick: the best maximum likelihood tree with bootstrap values in newick format.
ASTRAL_input_gene_trees.tre: the concatenated gene tree input file for ASTRAL
README_pie_charts.md: explains the the scripts and data needed to recreate the pie charts figure from our paper. There is also another
Corresponds to the following files:
ASTRAL_species_tree_EN_only.newick: the species tree with only effective number (EN) annotation
ASTRAL_species_tree_pp1_only.newick: the species tree with only the posterior probability 1 (main topology) annotation
ASTRAL_species_tree_q1_only.newick: the species tree with only the quartet scores for the main topology (q1)
ASTRAL_species_tree_q2_only.newick: the species tree with only the quartet scores for the first alternative topology (q2)
ASTRAL_species_tree_q3_only.newick: the species tree with only the quartet scores for the second alternative topology (q3)
print_node_key_files.py: script needed to create the following files:
node_keys.key: text file with node IDs and topologies
complete_q_scores.key: text file with node IDs multiplied q scores
EN_node_vals.key: text file with node IDs and EN values
create_pie_charts_tree.py: script needed to visualize the tree with pie charts, pp1, and EN values plotted at nodes
ASTRAL_species_tree_full_annotation.newick: the species tree with full annotation from the ASTRAL analysis.
NOTE: It may be more useful to examine individual value files if you want to visualize the tree,
e.g., in figtree, since the full annotations are extensive and can make viewing difficult.
Complete_NT_concatenated_alignment.phy: the nucleotide alignment that includes unmodified third codon positions. The alignment is in phylip format.
Complete_NT_raxml_partitions.txt: the raxml-style partition file of the nucleotide partitions
Complete_NT_concatenated_tree.newick: the best maximum likelihood tree from the concatenated complete analysis NT with bootstrap values in newick format
Complete_NT_partitioned_tree.newick: the best maximum likelihood tree from the partitioned complete NT analysis with bootstrap values in newick format
Degeneracy_coded_nt_concatenated_alignment.phy: the degeneracy coded nucleotide alignment in phylip format
Degeneracy_coded_nt_raxml_partitions.txt: the raxml-style partition file for the degeneracy coded nucleotide alignment
Degeneracy_coded_nt_concatenated_tree.newick: the best maximum likelihood tree from the degeneracy-coded concatenated analysis with bootstrap values in newick format
Degeneracy_coded_nt_partitioned_tree.newick: the best maximum likelihood tree from the degeneracy-coded partitioned analysis with bootstrap values in newick format
count_ingroup_taxa.py: script that counts the number of ingroup and/or outgroup taxa present in an alignment
keywords:
Auchenorrhyncha; Hemiptera; alignment; trees