Illinois Data Bank
Deposit Dataset
Find Data
Policies
Guides
Contact Us
Log in with NetID
Toggle navigation
Illinois Data Bank
Deposit Dataset
Find Data
Policies
Guides
Contact Us
Log in with NetID
Displaying 76 - 100 of 738 in total
<
1
2
3
4
5
6
7
8
9
…
29
30
>
25 per page
50 per page
Show All
Go
Clear Filters
Generate Report from Search Results
Subject Area
Life Sciences (395)
Social Sciences (139)
Physical Sciences (106)
Technology and Engineering (68)
Uncategorized
Arts and Humanities (1)
Funder
Other (227)
U.S. National Science Foundation (NSF) (211)
U.S. Department of Energy (DOE) (76)
U.S. National Institutes of Health (NIH) (73)
U.S. Department of Agriculture (USDA) (50)
Illinois Department of Natural Resources (IDNR) (20)
U.S. Geological Survey (USGS) (7)
U.S. National Aeronautics and Space Administration (NASA) (6)
Illinois Department of Transportation (IDOT) (4)
U.S. Army (2)
Publication Year
2024 (110)
2021 (108)
2022 (106)
2020 (96)
2023 (75)
2019 (72)
2018 (61)
2025 (39)
2017 (36)
2016 (30)
2009 (1)
2011 (1)
2012 (1)
2014 (1)
2015 (1)
License
CC0 (403)
CC BY (312)
custom (23)
Illinois Data Bank Dataset Search Results
Dataset Search Results
published: 2022-07-25
Jett, Jacob (2022): SBKS - Species Noisy Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7146216_V1
This dataset is derived from the raw dataset (https://doi.org/10.13012/B2IDB-4950847_V1) and collects entity mentions that were manually determined to be noisy, non-species entities.
keywords:
synthetic biology; NERC data; species mentions, noisy entities
published: 2022-07-25
Jett, Jacob (2022): SBKS - Species Not Found Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5491578_V1
This dataset is derived from the raw entity mention dataset (https://doi.org/10.13012/B2IDB-4950847_V1) for species entities and represents those that were determined to be species (i.e., were not noisy entities) but for which no corresponding concept could be found in the NCBI taxonomy database.
keywords:
synthetic biology; NERC data; species mentions, not found entities
published: 2022-07-25
Jett, Jacob (2022): SBKS - Chemical - Cleaned & Grounded Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3396059_V1
This dataset represents the results of manual cleaning and annotation of the entity mentions contained in the raw dataset (https://doi.org/10.13012/B2IDB-4163883_V1). Each mention has been consolidated and linked to an identifier for a matching concept from the NCBI's taxonomy database.
keywords:
synthetic biology; NERC data; chemical mentions; cleaned data; ChEBI ontology
published: 2022-07-25
Jett, Jacob (2022): SBKS - Chemical Noisy Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7228767_V1
This dataset is derived from the raw dataset (https://doi.org/10.13012/B2IDB-4163883_V1) and collects entity mentions that were manually determined to be noisy, non-chemical entities.
keywords:
synthetic biology; NERC data; chemical mentions, noisy entities
published: 2022-07-25
Jett, Jacob (2022): SBKS - Chemical Not Found Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4570128_V1
This dataset is derived from the raw entity mention dataset (https://doi.org/10.13012/B2IDB-4163883_V1) for checmical entities and represents those that were determined to be chemicals (i.e., were not noisy entities) but for which no corresponding concept could be found in the ChEBI ontology.
keywords:
synthetic biology; NERC data; chemical mentions, not found entities
published: 2022-07-25
Jett, Jacob (2022): SBKS - Genes Raw Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3887275_V1
A set of gene and gene-related entity mentions derived from an NERC dataset analyzing 900 synthetic biology articles published by the ACS. This data is associated with the Synthetic Biology Knowledge System repository (https://web.synbioks.org/). The data in this dataset are raw mentions from the NERC data.
keywords:
synthetic biology; NERC data; gene mentions
published: 2022-09-16
Zhong, Jia; Khanna, Madhu (2022): Model Code and Data for "Assessing the Efficiency Implications of Renewable Fuel Policy Design in the United States". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6803176_V1
This dataset contains model code (including input data) to replicate the outcomes for "Assessing the Efficiency Implications of Renewable Fuel Policy Design in the United States". The model consists of: (1) The replication codes and data for the model. To run the model, using GAMS to run the "Models.gms" file.
keywords:
Renewable Fuel Standard; Nested structure; cellulosic waiver credit; RIN
published: 2022-08-22
Pastrana-Otero, Isamar; Majumdar, Sayani; Kraft, Mary L. (2022): Raman spectra of individual, living hematopoietic stem and progenitor cells. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9950442_V1
This dataset contains Raman spectra, each acquired from an individual, living, primary murine cell belonging to one of the six most immature hematopoietic cell populations found in the body: hematopoietic stem cell (HSC), mutipotent progenitor 1 (MPP1), multipotent progenitor 2 (MPP2), multipotent progenitor 3 (MPP3), common lymphoid progenitor, common myeloid progenitor (CLP). These spectra are useful for identifying spectral signatures that are characteristic of each hematopoietic stem or early progenitor cell population. *NOTE: __MACOSX folder and files start with “._[file name]” found in "Raman spectra of single cells text files.zip" were created by the computer operation system, in unreadable format, which are not part of the data and can be removed/ignored when using the data.
keywords:
Raman spectroscopy; single-cell spectrum; hematopoietic cell; hematopoietic stem cell; multipotent progenitor cell; common myeloid progenitor; common lymphoid progenitor
published: 2022-09-07
Jiang, Chongya; Guan, Kaiyu; Khanna, Madhu; Chen, Luoye; Peng, Jian (2022): Data for Assessing Marginal Land Availability Based on Land Use Change Information in the Contiguous United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6395937_V1
The availability of economically marginal land for energy crops is identified using the Cropland Data Layer and other soil, wind, climate data resources. All data are recognized on a 30m spatial resolution across the continental United States.
keywords:
marginal land; biofuel production; remote sensing; land use change; Cropland Data Layer
published: 2023-01-12
Mischo, William; Schlembach, Mary C. (2023): Processing and Pearson Correlation Scripts for the C&RL Article on the Relationships between Publication, Citation, and Usage Metrics at the University of Illinois at Urbana-Champaign Library . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0931140_V1
These processing and Pearson correlational scripts were developed to support the study that examined the correlational relationships between local journal authorship, local and external citation counts, full-text downloads, link-resolver clicks, and four global journal impact factor indices within an all-disciplines journal collection of 12,200 titles and six subject subsets at the University of Illinois at Urbana-Champaign (UIUC) Library. This study shows strong correlations in the all-disciplines set and most subject subsets. Special processing scripts and web site dashboards were created, including Pearson correlational analysis scripts for reading values from relational databases and displaying tabular results. The raw data used in this analysis, in the form of relational database tables with multiple columns, is available at <a href="https://doi.org/10.13012/B2IDB-6810203_V1">https://doi.org/10.13012/B2IDB-6810203_V1</a>.
keywords:
Pearson Correlation Analysis Scripts; Journal Publication; Citation and Usage Data; University of Illinois at Urbana-Champaign Scholarly Communication
published: 2022-09-19
Detmer, Thomas (2022): ShelbyvilleZooplankton. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2467544_V1
Data characterize zooplankton in Shelbyville Reservoir, Illinois, United States of America. Zooplankton were sampled with a conical zooplankton net (0.5m diameter mouth) when water was deeper than 2 m and by grab sample when water was shallower. Zooplankton samples were concentrated and subsampled with a Hensen-Stempel pipette following protocols described in Detmer et al. (2019). Zooplankton were identified to the lowest feasible taxonomic unit according to Pennak (1989) and Thorp and Covich (2001) and were enumerated in a 1 mL Sedgewick-Rafter cell. Subsamples were analyzed until at least 200 individuals were enumerated from each site.were counted across for each of the three main taxonomic groups (cladocerans, copepods, and rotifers). Given the variation in zooplankton concentrations at each site, this process often lead to far more than 200 individuals being counted (x̄ = 269, min = 200, max = 487). A summary of the sample size from each site can be found in Supplementary Table S2. Abundances were corrected for volume of water filtered. For rare taxa (< 20 individuals per sample), all individuals were measured for length. For abundant taxa, length measurements were collected on the first 20 organisms of each abundant taxon encountered in a subsample. Dry mass was calculated from equations for microcrustaceans, rotifers, and Chaoborus sp. (Rosen ,1981; Botrell et al., 1976; Dumont and Balvay, 1979).
keywords:
Reservoir; Zooplankton
published: 2022-10-13
Xue, Qingquan; Xue, Qingquan; Dietrich, Christopher H.; Dietrich, Christopher H.; Zhang, Yalin; Zhang, Yalin (2022): NEXUS file for Phylogenetic analysis of the Idiocerus genus group (Hemiptera: Cicadellidae). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5026417_V1
The text file contains the original DNA nucleotide sequence data used in the phylogenetic analyses of Xue et al. (in review), comprising the 13 protein-coding genes and 2 ribosomal gene subunits of the mitochondrial genome. The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 30 taxa (species) and 13078 characters, indicate that the characters are DNA sequence, that gaps inserted into the DNA sequence alignment are indicated by a dash, and that missing data are indicated by a question mark. The positions of data partitions are indicated in the mrbayes block of commands for the phylogenetic program MrBayes (version 3.2.6) beginning near the end of the file. The mrbayes block also contains instructions for MrBayes on various non-default settings for that program. These are explained in the Methods section of the submitted manuscript. Two supplementary tables in the provided PDF file provide additional information on the species in the dataset, including the GenBank accession numbers for the sequence data (Table S1) and the DNA substitution models used for each of the individual mitochondrial genes and for different codon positions of the protein-coding genes used for analyses in the programs MrBayes and IQ-Tree (version 1.6.8) (Table S2). Full citations for references listed in Table S1 can be found by searching GenBank using the corresponding accession number. The supplemental tables will also be linked to the article upon publication at the journal website.
keywords:
Hemiptera; phylogeny; mitochondrial genome; morphology; leafhopper
published: 2022-03-31
Crawford, Reed D.; Dodd, Luke E.; Tillman, Frank E.; O'Keefe, Joy M. (2022): Data for Evaluating bat boxes: Design and placement alter bioenergetic costs and overheating risk. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3592866_V1
This dataset contains our bi-hourly temperature recordings from 40 rocket box style artificial roosts of 5 designs deployed in Indiana and Kentucky, USA from April through September 2019. This dataset also includes our endothermic and faculatively heterothermic daily energy expenditure datasets used in our bioenergetic analysis, which were calculated from the bi-hourly rocket box temperature data. Lastly, we include our overheating counts dataset which summarizes daily overheating events (i.e., temperatures > 40 Celsius) in each rocket box style bat box over the course of the study period, these daily summaries were also calculated from the bi-hourly rocket box temperature recordings.
keywords:
artificial roost; bat box; microcllimate; temperature
published: 2022-04-29
Wedell, Eleanor; Warnow, Tandy (2022): Biological and Simulated datasets for testing the SCAMPP framework for phylogenetic placement methods. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9257957_V1
Thank you for using these datasets! These files contain trees and reference alignments, as well as the selected query sequences for testing phylogenetic placement methods against and within the SCAMPP framework. There are four datasets from three different sources, each containing their source alignment and "true" tree, any estimated trees that may have been generated, and any re-estimated branch lengths that were created to be used with their requisite phylogenetic placement method. Three biological datasets (16S.B.ALL, PEWO/LTP_s128_SSU, and PEWO/green85) and one simulated dataset (nt78) is contained. See README.txt in each file for more information.
keywords:
Phylogenetic Placement; Phylogenetics; Maximum Likelihood; pplacer; EPA-ng
published: 2022-07-25
Jett, Jacob (2022): SBKS - Celllines Raw Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8851803_V1
A set of cell-line entity mentions derived from an NERC dataset analyzing 900 synthetic biology articles published by the ACS. This data is associated with the Synthetic Biology Knowledge System repository (https://web.synbioks.org/). The data in this dataset are raw mentions from the NERC data.
keywords:
synthetic biology; NERC data; cell-line mentions
published: 2022-08-31
Seyfried, Georgia; Midgley, Meghan; Phillips, Richard; Yang, Wendy (2022): Data for Refining the role of nitrogen mineralization in mycorrhizal nutrient syndromes. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5586647_V2
This dataset includes data on soil properties, soil N pools, and soil N fluxes presented in the manuscript, "Refining the role of nitrogen mineralization in mycorrhizal nutrient syndromes". Please refer to that publication for details about methodologies used to generate these data and for the experimental design. For this verison 2, we added specific gross nitrogen mineralization rates (ugN/gOM/d), microbial biomass carbon (ugC/gdw), microbial biomass nitrogen (ugN/gdw) and microbial biomass C:N ratios to the newest version of the data set. Additionally, we updated values for gross nitrogen mineralization, microbial NO3 assimilation and microbial NH4 assimilation to reflect slight changes in data processing. Those changes are reflected in "220829_All data_repository.csv". "220829_nitrogen_mineralization_readme.txt " is updated readme for the new file. The other 2 files begin with “220426_” are older version and same as in V1.
keywords:
Nitrogen cycling; Ectomycorrhizal fungi; Arbuscular mycorrhizal fungi; Nitrogen fertilization; Gross mineralization
published: 2024-01-01
Edmonds, Devin; Bach, Elizabeth; Colton, Andrea; Jaquet, Izabelle; Kessler, Ethan; Dreslik, Michael (2024): Data for Ornate Box Turtle (Terrapene ornata) Emergence. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7298951_V1
These data were used to make a predictive model of when ornate box turtles (Terrapene ornata) are likely to be above ground and at risk from fire. The data were generated using shell temperatures, soil temperatures at 0.35 m deep from known overwintering sites, and the spring and fall soil temperature inversion dates during 2019–2022 to infer if 26 individual radio-tracked turtles were above or below ground at three sites in Illinois.
keywords:
turtle; conservation; controlled burn; fire management; ectotherm; hibernation; brumation; reptile
published: 2021-09-03
Clark, Lindsay V.; Mays, Wittney; Lipka, Alexander E.; Sacks, Erik J. (2021): Dataset for evaluating the Hind/He statistic in polyRAD. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4814898_V1
All of the files in this dataset pertain to the evaluation of a novel statistic, Hind/He, for distinguishing Mendelian loci from paralogs. They are derived from a RAD-seq genotyping dataset of diploid and tetraploid Miscanthus sacchariflorus.
published: 2021-10-15
Perez, Sierra; Dalling, James; Fraterrigo, Jennifer (2021): Trelease and Brownfields Woods tree decay dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4547091_V1
Information on the location, dimensions, time of treefall or death, decay state, wood nutrient, wood pH and wood density data, and soil moisture, slope, distance from forest edge and soil nutrient data associated with the publication "Interspecific wood trait variation predicts decreased carbon residence time in changing forests" authored by Sierra Perez, Jennifer Fraterrigo, and James Dalling. ** <b>Note:</b> Blank cells indicate that no data were collected.
keywords:
wood decay; carbon residence time; coarse woody debris; decomposition, temperate forests
published: 2021-10-22
Carney, Sean; Ma, Wen; Chemla , Yann (2021): Source data for Kinetic and structural mechanism for DNA unwinding by a non-hexameric helicase. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5556865_V1
This dataset includes the source data for Figures 1-4 and supplementary figures 1-10 for the manuscript "Kinetic and structural mechanism for DNA unwinding by a non-hexameric helicase".
published: 2021-11-18
Pan, Chao; Tabatabaei, S Kasra; Tabatabaei Yazdi, S. M. Hossein; Hernandez, Alvaro; Schroeder, Charles; Milenkovic, Olgica (2021): Rewritable Two-Dimensional DNA-Based Data Storage System (2DDNA) Sequencing Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2308557_V1
This dataset contains sequencing data obtained from Illumina MiSeq device to prove the concept of the proposed 2DDNA framework. Please refer to README.txt for detailed description of each file.
keywords:
machine learning;image processing;computer vision;rewritable storage system;2D DNA-based data storage
published: 2022-01-30
Bakken, George; Tillman, Francis; O'Keefe, Joy (2022): Data for "Methods for assessing artificial thermal refuges: spatiotemporal analysis more informative than averages". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9980397_V1
This dataset contains temperature measurements in four different bat box designs deployed in central Indiana, USA from May to September 2018. Hourly environmental data (temperature, solar radiation, and wind speed) are also included for days and hours sampled. Bat box temperature data were used as inputs in a free program, GNU Octave, to assess design performance with respect to suitability indices for endothermic metabolism and pup development. Scripts are included in the dataset.
keywords:
bats;thermal refuge;reproduction;conservation;bat box;microclimate
published: 2023-06-01
Pan, Chao; Peng, Jianhao; Chien, Eli; Milenkovic, Olgica (2023): Embedded dataset in Poincare Balls. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6901251_V1
This dataset contains four real-world sub-datasets with data embedded into Poincare ball models, including Olsson's single-cell RNA expression data, CIFAR10, Fashion-MNIST and mini-ImageNet. Each sub-dataset has two corresponding files: one is the data file, the other one is the pre-computed reference points for each class in the sub-dataset. Please refer to our paper (https://arxiv.org/pdf/2109.03781.pdf) and codes (https://github.com/thupchnsky/PoincareLinearClassification) for more details.
keywords:
Hyperbolic space; Machine learning; Poincare ball models; Perceptron algorithm; Support vector machine
published: 2022-02-04
Addepalli, Amulya; Ann Subin, Karen; Schneider, Jodi (2022): Dataset for Testing the Keystone Framework by Analyzing Positive Citations to Wakefield's 1998 Paper. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2532850_V1
keywords:
retracted papers; knowledge maintenance; keystone citations, Wakefield; misinformation in science; Information Quality Lab
published: 2022-02-11
Lu, Yiyang; Bohn-Wippert, Kathrin; Pazerunas, Patrick J.; Moy, Jennifer M.; Singh, Harpal; Dar, Roy D. (2022): Time-lapse Fluorescence Microscopy Images and Gene Expression Data of Single T-Cells Infected with a Minimal HIV Feedback Circuit under 1,806 Drug Treatments. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8103861_V1
Upon treatment removal, spontaneous and random reactivation of latently infected T cells remains a major barrier toward curing HIV. Due to its stochastic nature, fluctuations in gene expression (or “noise”) can bias HIV reactivation from latency, and conventional drug screens for mean gene expression neglect compounds that modulate noise. Here we present a time-lapse fluorescence microscopy image set obtained from a Jurkat T-cell line, infected with a minimal HIV gene circuit, treated with 1,806 small molecule compounds, and imaged for 48 hours. In addition, the single-cell time-dependent reporter dynamics (single-cell gene expression intensity and noise trajectories) extracted from the image dataset are included. Based on this dataset, a total of 5 latency promoting agents of HIV was found through further experimentation in Lu et al., PNAS 2021 (doi: 10.1073/pnas.2012191118). For a detailed description of the dataset, please refer to the readme file.
keywords:
HIV; latency; drug screen; fluorescence microscopy; time-lapse; microscopy; single-cell data; noise; gene expression fluctuation;