Illinois Data Bank
Log in with NetID
University Library, University of Illinois at Urbana-Champaign
Illinois Data Bank
Log in with NetID
25 per page
50 per page
Displaying datasets 201 - 225 of 397 in total
Generate Report from Search Results
Life Sciences (221)
Social Sciences (88)
Physical Sciences (53)
Technology and Engineering (34)
Arts and Humanities (1)
U.S. National Science Foundation (NSF) (104)
U.S. National Institutes of Health (NIH) (41)
U.S. Department of Energy (DOE) (35)
U.S. Department of Agriculture (USDA) (21)
Illinois Department of Natural Resources (IDNR) (9)
U.S. National Aeronautics and Space Administration (NASA) (4)
U.S. Geological Survey (USGS) (4)
U.S. Army (1)
CC BY (162)
Bradshaw, Therin M.; Blake-Bradshaw, Abigail G.; Fournier, Auriel M.V.; Lancaster, Joseph D. ; O'Connell, John; Jacques, Christopher N.; Eicholtz, Michael W.; Hagy, Heath M (2020): Marsh bird occupancy of wetlands managed for waterfowl in the Midwestern USA - Analysis Inputs. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5152821_V1
Data inputs, and scripts for the analysis detailed in Bradshaw et al, published in PlosONE 2020.
Marsh birds; wetlands
XSEDE-Extreme Science and Engineering Discovery Environment (2020): XSEDE: Allocations Awards for the NSF Cyberinfrastructure Portfolio, 2004-2019. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0757799_V1
The XSEDE program manages the database of allocation awards for the portfolio of advanced research computing resources funded by the National Science Foundation (NSF). The database holds data for allocation awards dating to the start of the TeraGrid program in 2004 to present, with awards continuing through the end of the second XSEDE award in 2021. The project data include lead researcher and affiliation, title and abstract, field of science, and the start and end dates. Along with the project information, the data set includes resource allocation and usage data for each award associated with the project. The data show the transition of resources over a fifteen year span along with the evolution of researchers, fields of science, and institutional representation.
allocations; cyberinfrastructure; XSEDE
de Moya, Robert (2019): Heteroptera Transcriptome Set. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7784896_V1
These are the alignments of transcriptome data used for the analysis of members of Heteroptera. This dataset is analyzed in "Deep instability in the phylogenetic backbone of Heteroptera is only partly overcome by transcriptome-based phylogenomics" published in Insect Systematics and Diversity.
Heteroptera; Hemiptera; Phylogenomics; transcriptome
Zhang, Jun; Wuebbles, Donald; Kinnison, Douglas; Saiz López, Alfonso (2020): Data for: Revising the Ozone Depletion Potentials for Short-Lived Chemicals such as CF3I and CH3I. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5952573_V1
This datasets provide basis of our analysis in the paper - Revising the Ozone Depletion Potentials for Short-Lived Chemicals such as CF3I and CH3I. All datasets here are from the model output (CAM4-chem). All the simulations (background and perturbation) were run to steady-state and only the last year outputs used in analysis are archived here.
Illinois Data Bank; NetCDF; Ozone Depletion Potential; CF3I and CH3I
Zahniser, James; Dietrich, Christopher (2020): NEXUS data file for phylogenetic analysis of Deltocephalinae (Hemiptera: Cicadellidae) . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7290912_V1
The Delt_Comb.NEX text file contains the original data used in the phylogenetic analyses of Zahniser & Dietrich, 2013 (European Journal of Taxonomy, 45: 1-211). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first nine lines of the file indicate the file type (Nexus), that 152 taxa were analyzed, that a total of 3971 characters were analyzed, the format of the data, and specification for two symbols used in the dataset. There are four datasets separated into blocks, one each for: 28S rDNA gene, Histone H3 gene, morphology, and insertion/deletion characters scored based on the alignment of the 28S rDNA dataset. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the publication using this dataset. A text file, Delt_morph_char.txt, is available here that states the morphological characters and characters states that were scored in the Delt_Comb.NEX dataset. The original DNA sequence data are available from NCBI GenBank under the accession numbers indicated in publication. Chromatogram files for each sequencing read are available from the first author upon request.
phylogeny; DNA sequence; morphology; parsimony analysis; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; histone H3; bayesian analysis
Williams, Benjamin R.; Benson, Thomas J. (2020): Habitat Use of Spring Migrating Dabbling Ducks in the Wabash River Valley. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7017235_V1
This data describes habitat use, availability, landscape level influences, and daily movement of dabbling ducks in the Wabash River Valley of southeastern Illinois and southwestern Indiana. It contains triangulated locations of individual ducks, associated habitat assignments of those locations, flood survey data to determine water availability, and randomly generated points to assess landscape level questions.
waterfowl; ducks; dabbling; mallard; teal; habitat
Miao, Guofang; Guan, Kaiyu (2020): Sun-induced chlorophyll fluorescence of two Nebraska maize sites in 2017. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5893373_V1
This dataset includes two data files that provide the time series (Jul. - Sep. 2017) data of sun-induced chlorophyll fluorescence (SIF_760) collected under sunny conditions at two maize sites (one rainfed and the other irrigated) in Nebraska in 2017. Data contain 392 SIF_760 records at the rainfed site and 707 records at the irrigated site. The timestamp uses local standard time. Data are available for the sunny conditions from 8 am to 5 pm (corresponding to 9 am to 6 pm local time) throughout the study period.
sun-induced chlorophyll fluorescence (SIF); maize; gross primary production(GPP); light use efficiency(LUE); SIF yield
Cisneros, Julia (2020): Data for: Dunes in the world’s big rivers are characterised by low-angle leeside slopes and a complex shape. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7525764_V2
Morphologic data of dunes in the World's big rivers. Morphologic descriptors for large dunes include: dune height, dune mean leeside angle, dune maximum leeside angle, dune wavelength, dune flow depth (at the crest), and the fractional height of the maximum slope on the leeside for each dune. Morphologic descriptors for small dunes include: dune height, dune mean leeside angle, dune maximum leeside angle, dune wavelength, and dune flow depth (at the crest).
dune; bedform; rivers; morphology;
Zachwieja, Alexandra (2019): Data for: Climate, competition, and environment shaped human land use in Late Pleistocene Southeast Asia. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8284108_V1
Dataset providing calculation of a Competition Index (CI) for Late Pleistocene carnivore guilds in Laos and Vietnam and their relationship to humans. Prey mass spectra, Prey focus masses, and prey class raw data can be used to calculate the CI following Hemmer (2004). Mass estimates were calculated for each species following Van Valkenburgh (1990). Full citations to methodological papers are included as relationships with other resources
competition; Southeast Asia; carnivores; humans
Wang, Yu; Burgess, Steven J. ; de Becker, Elsa ; Long, Stephen P. (2019): Data and code for: Photosynthesis in the fleeting shadows: An overlooked opportunity for increasing crop productivity?. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9453481_V1
This dynamic photosynthesis model of soybean canopy is developed by Yu Wang (email@example.com), IGB, University of Illinois. If you want to know more details, please check the following publication Yu Wang, Steven J. Burgess, Elsa de Becker, Stephen P. Long. Photosynthesis in the fleeting shadows: An overlooked opportunity for increasing crop productivity? The Plant Journal.
Matlab; Soybean canopy; photosynthesis model
Zhang, Yujie; Araiza Bravo, Rodrigo; Chitambar, Eric; Lorenz, Virginia (2019): Dataset for "Channel Activation of CHSH Nonlocality". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3686727_V1
This dataset provides the raw data, code and related figures for the paper, "Channel Activation of CHSH Nonlocality"
Super-activation; Non-locality breaking channel
Kamuda, Mark; Huff, Kathryn (2019): Automated Isotope Identification and Quantification Using Artificial Neural Networks. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4860767_V1
This dataset contains gamma-ray spectra templates for a source interdiction and uranium enrichment measurement task. This dataset also contains Keras machine learning models trained using datasets created using these templates.
gamma-ray spectroscopy; neural networks; machine learning; isotope identification; uranium enrichment; sodium iodide; NaI(Tl)
Yang, Pan; Zhao, Qiankun; Cai, Ximing (2019): Land productivity and land availability for growing bioenergy crop in the Contiguous US. Center for Advanced Bioenergy and Bioproducts Innovation (CABBI). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4584681_V1
The dataset consists of two types of data: the estimate of land productivity (the maximum productivity, MP) and the estimate of land that has low productivity for any major crops planted in the Contiguous United States and then may be available for growing bioenergy crops (the marginal land, ML). All data items are in GeoTiff format, under the World Geodetic System (WGS) 84 project, and with a resolution of 0.0020810045 degree (~250 m). The MP values are calculated based on machine learning model estimated yields of major crops in the CONUS, and its expected value (MP_mean.tif), and associated uncertainty (MP_IDP.tif). The ML availability data have two versions: a deterministic version and a version with uncertainty. The deterministic MLs are determined as the land pixels with expected MP values falling in the range defined in the following criteria, and the MLs with uncertainty are determined as the probability that the MP value of a land pixel falls in the range defined in the following criteria: Criteria_____Description S1________ Current crop and pasture land with MP <= P50 S2________ Current crop and pasture land with MP <= P25 S3________ S1 + current grass and shrub land with P25 < MP < P50 S4________ S2 + current grass and shrub land with P10 < MP < P25 Economic__ Current crop and pasture land with potential profitability < 0 Here P10, P25 and P50 are the 10th, 25th and 50th percentile of crop MP values
Land productivity;marginal land;land use
de Moya, Robert (2019): Feather Louse Orthology set. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0440388_V1
This is the data set associated with the manuscript titled "Extensive host-switching of avian feather lice following the Cretaceous-Paleogene mass extinction event." Included are the gene alignments used for phylogenetic analyses and the cophylogenetic input files.
phylogenomics, cophylogenetics, feather lice, birds
Zhang, Chuanyi; Ochoa, Idoia (2019): VCF files used for VEF: a Variant Filtering tool based on Ensemble methods. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9401259_V1
VCF files used to analyze a novel filtering tool VEF, presented in the article "VEF: a Variant Filtering tool based on Ensemble methods".
VCF files; filtering; VEF
Snyder, Corey; Do, Minh (2019): Data for STREETS: A Novel Camera Network Dataset for Traffic Flow. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3671567_V1
This dataset accompanies the paper "STREETS: A Novel Camera Network Dataset for Traffic Flow" at Neural Information Processing Systems (NeurIPS) 2019. Included are: *Over four million still images form publicly accessible cameras in Lake County, IL. The images were collected across 2.5 months in 2018 and 2019. *Directed graphs describing the camera network structure in two communities in Lake County. *Documented non-recurring traffic incidents in Lake County coinciding with the 2018 data. *Traffic counts for each day of images in the dataset. These counts track the volume of traffic in each community. *Other annotations and files useful for computer vision systems. Refer to the accompanying "readme.txt" or "readme.pdf" for further details.
camera network; suburban vehicular traffic; roadways; computer vision
Sherman, Garrick (2019): TREC document topic annotations. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9822674_V1
Human annotations of randomly selected judged documents from the AP 88-89, Robust 2004, WT10g, and GOV2 TREC collections. Seven annotators were asked to read documents in their entirety and then select up to ten terms they felt best represented the main topic(s) of the document. Terms were chosen from among a set sampled from the document in question and from related documents.
TREC; information retrieval; document topicality; document description
Saurabh, Jha; Archit, Patke; Mike, Showerman; Jeremy, Enos; Greg, Bauer; Zbigniew, Kalbarczyk; Ravishankar, Iyer; William , Kramer (2019): Monet - Blue Waters Network Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2921318_V1
This dataset contains collected and aggregated network information from NCSA’s Blue Waters system, which is comprised of 27,648 nodes connected via Cray Gemini* 3D torus (dimension 24x24x24) interconnect, from Jan/01/2017 to May/31/2017. Network performance counters for links are exposed via Cray's gpcdr (<a href="https://github.com/ovis-hpc/ovis/wiki/gpcdr-kernel-module">https://github.com/ovis-hpc/ovis/wiki/gpcdr-kernel-module</a>) kernel module. Lightweight Distributed Metric Service ([LDMS](<a href="https://github.com/ovis-hpc/ovis">https://github.com/ovis-hpc/ovis</a>)) is used to sampled the performance counters at 60 second intervals. Please read "README.md" file. <b>Acknowledgement:</b> This dataset is collected as a part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications.
HPC; Interconnect; Network; Congestion; Blue Waters; Dataset
Rezapour, Rezvaneh (2019): Data for: How do moral values differ in tweets on social movements?. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9614170_V1
We are sharing the tweet IDs of four social movements: #BlackLivesMatter, #WhiteLivesMatter, #AllLivesMatter, and #BlueLivesMatter movements. The tweets are collected between May 1st, 2015 and May 30, 2017. We eliminated the location to the United States and focused on extracting the original tweets, excluding the retweets. Recommended citations for the data: Rezapour, R. (2019). Data for: How do Moral Values Differ in Tweets on Social Movements?. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9614170_V1 and Rezapour, R., Ferronato, P., and Diesner, J. (2019). How do moral values differ in tweets on social movements?. In 2019 Computer Supported Cooperative Work and Social Computing Companion Publication (CSCW’19 Companion), Austin, TX.
Twitter; social movements; black lives matter; blue lives matter; all lives matter; white lives matter
Ouldali, Hadjer; Sarthak, Kumar; Ensslen, Tobias; Piguet, Fabien; Manivet, Philippe; Pelta, Juan; Behrends, Jan C.; Aksimentiev, Aleksei; Oukhaled, Abdelghani (2019): Experiment and simulation raw data for "Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4905767_V1
Raw MD simulation trajectory, input and configuration files, SEM current data, and experimental raw data accompanying the publication, "Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore". README.md contains a description of all associated files.
molecular dynamics; protein sequencing; aerolysin; nanopore sequencing
Xia, Yushu; Wander, Michelle (2019): Correlation Between Soil Quality Indictors including β-glucosidase, Fluorescein Diacetate Hydrolysis and Permanganate Oxidizable Carbon, and Ecosystem Functions represented by Crop Productivity and Greenhouse Gas Emissions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4693684_V2
We summarize peer reviewed literature reporting associations between for three soil quality indicators (SQIs) (β-glucosidase (BG), fluorescein diacetate (FDA) hydrolysis, and permanganate oxidizable carbon (POXC)) and crop yield and greenhouse gas emissions. Peer-reviewed articles published between January of 1990 and May 2018 were searched using the Thomas Reuters Web of Science database (Thomas Reuters, Philadelphia, Pennsylvania) and Google Scholar to identify studies reporting results for: “β-glucosidase”, “permanganate oxidizable carbon”, “active carbon”, “readily oxidizable carbon”, or “fluorescein diacetate hydrolysis”, together with one or more of the following: “crop yield”, “productivity”, “greenhouse gas’, “CO2”, “CH4”, or “N2O”. Meta-data for records include the following descriptor variables and covariates useful for scoring function development: 1) identifying factors for the study site (location, duration of the experiment), 2) soil textural class, pH, and SOC, 3) depth of soil sampling, 4) units used in published works (i.e.: equivalent mass, concentration), 5) SQI abundances and measured ecosystem functions, and 6) summary statistics for correlation between SQIs and functions (yield and greenhouse gas emissions). *Note: Blank values in tables are considered unreported data.
Soil health promoting practices; Soil quality indicators; β-glucosidase; fluorescein diacetate hydrolysis; Permanganate oxidizable carbon; Greenhouse gas emissions; Scoring curves; Soil Management Assessment Framework
Corey, Ryan M.; Skarha, Matthew D.; Singer, Andrew C. (2019): Massive Distributed Microphone Array Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6216881_V1
Large, distributed microphone arrays could offer dramatic advantages for audio source separation, spatial audio capture, and human and machine listening applications. This dataset contains acoustic measurements and speech recordings from 10 loudspeakers and 160 microphones spread throughout a large, reverberant conference room. The distributed microphone system contains two types of array: four wearable microphone arrays of 16 sensors each placed near the ears and across the upper body, and twelve tabletop arrays of 8 microphones each in enclosures designed to resemble voice-assistant speakers. The dataset includes recordings of chirps that can be used to measure impulse responses and of speech clips derived from the CSTR VCTK corpus. The speech clips are recorded both individually and as a mixture to support source separation experiments. The uncompressed files are about 13.4 GB.
microphone arrays; audio source separation; augmented listening; wireless sensor networks
Smith, Rebecca (2019): Spatial and Temporal Invasion Dynamics of Aedes albopictus (Diptera: Culicidae) in Illinois. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7540359_V1
Supporting secondary data used in a manuscript currently in submission regarding the invasion dynamics of the asian tiger mosquito, Aedes albopictus, in the state of Illinois
Choi, Sang Hyun; Rao, Vikyath; Gernat, Tim; Hamilton, Adam; Robinson, Gene; Goldenfeld, Nigel (2019): Honeybee trophallaxis event data for The origin of heavy tails in honeybee and human interaction times. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2712449_V1
Filtered trophallaxis interactions for two honeybee colonies, each containing 800 worker bees and one queen. Each colony consists of bees that were administered a juvenile hormone analogy, a vehicle treatment, or a sham treatment to determine the effect of colony perturbation on the duration of trophallaxis interactions. Columns one and two display the unique identifiers for each bee involved in a particular trophallaxis exchange, and columns three and four display the Unix timestamp of the beginning/end of the interaction (in milliseconds), respectively.<br /><b>Note</b>: the queen interactions were omitted from the uploaded dataset for reasons that are described in submitted manuscript. Those bees that performed poorly are also omitted from the final dataset.
honey bee; trophallaxis; social network
Choi, Sang Hyun; Rao, Vikyath D.; Gernat, Tim; Hamilton, Adam R.; Robinson, Gene E.; Goldenfeld, Nigel (2019): Honeybee F2F event data for The origin of heavy tails in honeybee and human interaction times. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4021786_V1
Dataset for F2F events of honeybees. F2F events are defined as face-to-face encounters of two honeybees that are close in distance and facing each other but not connected by the proboscis, thus not engaging in trophallaxis. The first and the second columns show the unique id's of honeybees participating in F2F events. The third column shows the time at which the F2F event started while the fourth column shows the time at which it ended. Each time is in the Unix epoch timestamp in milliseconds.