Displaying 126 - 150 of 781 in total

Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2018-09-06

XSEDE-Extreme Science and Engineering Discovery Environment (2018): XSEDE: Allocations Awards for the NSF Cyberinfrastructure Portfolio, 2004-2017. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4817808_V1

The XSEDE program manages the database of allocation awards for the portfolio of advanced research computing resources funded by the National Science Foundation (NSF). The database holds data for allocation awards dating to the start of the TeraGrid program in 2004 to present, with awards continuing through the end of the second XSEDE award in 2021. The project data include lead researcher and affiliation, title and abstract, field of science, and the start and end dates. Along with the project information, the data set includes resource allocation and usage data for each award associated with the project. The data show the transition of resources over a fifteen year span along with the evolution of researchers, fields of science, and institutional representation.

keywords: allocations; cyberinfrastructure; XSEDE

published: 2018-11-21

Clark, Lindsay V.; Lipka, Alexander E.; Sacks, Erik J. (2018): Scripts for testing the error rate of polyRAD. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9729830_V2

This set of scripts accompanies the manuscript describing the R package polyRAD, which uses DNA sequence read depth to estimate allele dosage in diploids and polyploids. Using several high-confidence SNP datasets from various species, allelic read depth from a typical RAD-seq dataset was simulated, then genotypes were estimated with polyRAD and other software and compared to the true genotypes, yielding error estimates.

keywords: R programming language; genotyping-by-sequencing (GBS); restriction site-associated DNA sequencing (RAD-seq); polyploidy; single nucleotide polymorphism (SNP); Bayesian genotype calling; simulation

published: 2023-06-10

Cheng, Xi; Kontou, Eleftheria (2023): Data for Estimating the Electric Vehicle Charging Demand of Multi-Unit Dwelling Residents in the United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4230392_V1

Data and code supporting the paper titled "Estimating the Electric Vehicle Charging Demand of Multi-Unit Dwelling Residents in the United States" by Xi Cheng and Eleftheria Kontou at the University of Illinois Urbana-Champaign. The data and the code enable analytics and assessment of multi-unit dwelling residents travel patterns and their electric vehicle charging demand.

keywords: multi-unit residents; electric vehicles; home charging; travel patterns; energy use

published: 2023-01-12

Mischo, William; Schlembach, Mary C.; Cabada, Elisandro (2023): Data for: Relationships between Journal Publication, Citation, and Usage Metrics within a Carnegie R1 University Collection: A Correlation Analysis. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6810203_V1

This dataset was developed as part of a study that examined the correlational relationships between local journal authorship, local and external citation counts, full-text downloads, link-resolver clicks, and four global journal impact factor indices within an all-disciplines journal collection of 12,200 titles and six subject subsets at the University of Illinois at Urbana-Champaign (UIUC) Library. While earlier investigations of the relationships between usage (downloads) and citation metrics have been inconclusive, this study shows strong correlations in the all-disciplines set and most subject subsets. The normalized Eigenfactor was the only global impact factor index that correlated highly with local journal metrics. Some of the identified disciplinary variances among the six subject subsets may be explained by the journal publication aspirations of UIUC researchers. The correlations between authorship and local citations in the six specific subject subsets closely match national department or program rankings. All the raw data used in this analysis, in the form of relational database tables with multiple columns. Can be opned using MS Access. Description for variables can be viewed through "Design View" (by right clik on the selected table, choose "Design View"). The 2 PDF files provide an overview of tables are included in each MDB file. In addition, the processing scripts and Pearson correlation code is available at <a href="https://doi.org/10.13012/B2IDB-0931140_V1">https://doi.org/10.13012/B2IDB-0931140_V1</a>.

keywords: Usage and local citation relationships; publication; citation and usage metrics; publication; citation and usage correlation analysis; Pearson correlation analysis

published: 2022-09-29

Levine, Nathaniel (2022): 3DIFICE: A Synthetic Dataset for Training Computer Vision Algorithms to Recognize Earthquake Damage to Reinforced Concrete Structures. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6415287_V1

3DIFICE: 3-dimensional Damage Imposed on Frame structures for Investigating Computer vision-based Evaluation methods This dataset contains 1,396 synthetic images and label maps with various types of earthquake damage imposed on reinforced concrete frame structures. Damage includes: cracking, spalling, exposed transverse rebar, and exposed longitudinal rebar. Each image has an associated label map that can be used for training machine learning algorithms to recognize the various types of damage.

keywords: computer vision; earthquake engineering; structural health monitoring; civil engineering; structural engineering;

published: 2023-06-01

Storms, Suzanna (2023): RT-LAMP as diagnostic tool for Influenza-A Virus detection in swine. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2079467_V1

Results of RT-LAMP reactions for influenza A virus diagnostic development.

keywords: swine influenza; LAMP; gBlock

published: 2022-07-25

Jett, Jacob (2022): SBKS - Species Noisy Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7146216_V1

This dataset is derived from the raw dataset (https://doi.org/10.13012/B2IDB-4950847_V1) and collects entity mentions that were manually determined to be noisy, non-species entities.

keywords: synthetic biology; NERC data; species mentions, noisy entities

published: 2022-07-25

Jett, Jacob (2022): SBKS - Species Not Found Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5491578_V1

This dataset is derived from the raw entity mention dataset (https://doi.org/10.13012/B2IDB-4950847_V1) for species entities and represents those that were determined to be species (i.e., were not noisy entities) but for which no corresponding concept could be found in the NCBI taxonomy database.

keywords: synthetic biology; NERC data; species mentions, not found entities

published: 2022-07-25

Jett, Jacob (2022): SBKS - Chemical - Cleaned & Grounded Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3396059_V1

This dataset represents the results of manual cleaning and annotation of the entity mentions contained in the raw dataset (https://doi.org/10.13012/B2IDB-4163883_V1). Each mention has been consolidated and linked to an identifier for a matching concept from the NCBI's taxonomy database.

keywords: synthetic biology; NERC data; chemical mentions; cleaned data; ChEBI ontology

published: 2022-07-25

Jett, Jacob (2022): SBKS - Chemical Noisy Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7228767_V1

This dataset is derived from the raw dataset (https://doi.org/10.13012/B2IDB-4163883_V1) and collects entity mentions that were manually determined to be noisy, non-chemical entities.

keywords: synthetic biology; NERC data; chemical mentions, noisy entities

published: 2022-07-25

Jett, Jacob (2022): SBKS - Chemical Not Found Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4570128_V1

This dataset is derived from the raw entity mention dataset (https://doi.org/10.13012/B2IDB-4163883_V1) for checmical entities and represents those that were determined to be chemicals (i.e., were not noisy entities) but for which no corresponding concept could be found in the ChEBI ontology.

keywords: synthetic biology; NERC data; chemical mentions, not found entities

published: 2022-07-25

Jett, Jacob (2022): SBKS - Genes Raw Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3887275_V1

A set of gene and gene-related entity mentions derived from an NERC dataset analyzing 900 synthetic biology articles published by the ACS. This data is associated with the Synthetic Biology Knowledge System repository (https://web.synbioks.org/). The data in this dataset are raw mentions from the NERC data.

keywords: synthetic biology; NERC data; gene mentions

published: 2022-09-16

Zhong, Jia; Khanna, Madhu (2022): Model Code and Data for "Assessing the Efficiency Implications of Renewable Fuel Policy Design in the United States". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6803176_V1

This dataset contains model code (including input data) to replicate the outcomes for "Assessing the Efficiency Implications of Renewable Fuel Policy Design in the United States". The model consists of: (1) The replication codes and data for the model. To run the model, using GAMS to run the "Models.gms" file.

keywords: Renewable Fuel Standard; Nested structure; cellulosic waiver credit; RIN

published: 2022-08-22

Pastrana-Otero, Isamar; Majumdar, Sayani; Kraft, Mary L. (2022): Raman spectra of individual, living hematopoietic stem and progenitor cells. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9950442_V1

This dataset contains Raman spectra, each acquired from an individual, living, primary murine cell belonging to one of the six most immature hematopoietic cell populations found in the body: hematopoietic stem cell (HSC), mutipotent progenitor 1 (MPP1), multipotent progenitor 2 (MPP2), multipotent progenitor 3 (MPP3), common lymphoid progenitor, common myeloid progenitor (CLP). These spectra are useful for identifying spectral signatures that are characteristic of each hematopoietic stem or early progenitor cell population. *NOTE: __MACOSX folder and files start with “._[file name]” found in "Raman spectra of single cells text files.zip" were created by the computer operation system, in unreadable format, which are not part of the data and can be removed/ignored when using the data.

keywords: Raman spectroscopy; single-cell spectrum; hematopoietic cell; hematopoietic stem cell; multipotent progenitor cell; common myeloid progenitor; common lymphoid progenitor

published: 2022-09-07

Jiang, Chongya; Guan, Kaiyu; Khanna, Madhu; Chen, Luoye; Peng, Jian (2022): Data for Assessing Marginal Land Availability Based on Land Use Change Information in the Contiguous United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6395937_V1

The availability of economically marginal land for energy crops is identified using the Cropland Data Layer and other soil, wind, climate data resources. All data are recognized on a 30m spatial resolution across the continental United States.

keywords: marginal land; biofuel production; remote sensing; land use change; Cropland Data Layer

published: 2023-01-12

Mischo, William; Schlembach, Mary C. (2023): Processing and Pearson Correlation Scripts for the C&RL Article on the Relationships between Publication, Citation, and Usage Metrics at the University of Illinois at Urbana-Champaign Library . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0931140_V1

These processing and Pearson correlational scripts were developed to support the study that examined the correlational relationships between local journal authorship, local and external citation counts, full-text downloads, link-resolver clicks, and four global journal impact factor indices within an all-disciplines journal collection of 12,200 titles and six subject subsets at the University of Illinois at Urbana-Champaign (UIUC) Library. This study shows strong correlations in the all-disciplines set and most subject subsets. Special processing scripts and web site dashboards were created, including Pearson correlational analysis scripts for reading values from relational databases and displaying tabular results. The raw data used in this analysis, in the form of relational database tables with multiple columns, is available at <a href="https://doi.org/10.13012/B2IDB-6810203_V1">https://doi.org/10.13012/B2IDB-6810203_V1</a>.

keywords: Pearson Correlation Analysis Scripts; Journal Publication; Citation and Usage Data; University of Illinois at Urbana-Champaign Scholarly Communication

published: 2022-09-19

Detmer, Thomas (2022): ShelbyvilleZooplankton. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2467544_V1

Data characterize zooplankton in Shelbyville Reservoir, Illinois, United States of America. Zooplankton were sampled with a conical zooplankton net (0.5m diameter mouth) when water was deeper than 2 m and by grab sample when water was shallower. Zooplankton samples were concentrated and subsampled with a Hensen-Stempel pipette following protocols described in Detmer et al. (2019). Zooplankton were identified to the lowest feasible taxonomic unit according to Pennak (1989) and Thorp and Covich (2001) and were enumerated in a 1 mL Sedgewick-Rafter cell. Subsamples were analyzed until at least 200 individuals were enumerated from each site.were counted across for each of the three main taxonomic groups (cladocerans, copepods, and rotifers). Given the variation in zooplankton concentrations at each site, this process often lead to far more than 200 individuals being counted (x̄ = 269, min = 200, max = 487). A summary of the sample size from each site can be found in Supplementary Table S2. Abundances were corrected for volume of water filtered. For rare taxa (< 20 individuals per sample), all individuals were measured for length. For abundant taxa, length measurements were collected on the first 20 organisms of each abundant taxon encountered in a subsample. Dry mass was calculated from equations for microcrustaceans, rotifers, and Chaoborus sp. (Rosen ,1981; Botrell et al., 1976; Dumont and Balvay, 1979).

keywords: Reservoir; Zooplankton

published: 2022-10-13

Xue, Qingquan; Xue, Qingquan; Dietrich, Christopher H.; Dietrich, Christopher H.; Zhang, Yalin; Zhang, Yalin (2022): NEXUS file for Phylogenetic analysis of the Idiocerus genus group (Hemiptera: Cicadellidae). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5026417_V1

The text file contains the original DNA nucleotide sequence data used in the phylogenetic analyses of Xue et al. (in review), comprising the 13 protein-coding genes and 2 ribosomal gene subunits of the mitochondrial genome. The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 30 taxa (species) and 13078 characters, indicate that the characters are DNA sequence, that gaps inserted into the DNA sequence alignment are indicated by a dash, and that missing data are indicated by a question mark. The positions of data partitions are indicated in the mrbayes block of commands for the phylogenetic program MrBayes (version 3.2.6) beginning near the end of the file. The mrbayes block also contains instructions for MrBayes on various non-default settings for that program. These are explained in the Methods section of the submitted manuscript. Two supplementary tables in the provided PDF file provide additional information on the species in the dataset, including the GenBank accession numbers for the sequence data (Table S1) and the DNA substitution models used for each of the individual mitochondrial genes and for different codon positions of the protein-coding genes used for analyses in the programs MrBayes and IQ-Tree (version 1.6.8) (Table S2). Full citations for references listed in Table S1 can be found by searching GenBank using the corresponding accession number. The supplemental tables will also be linked to the article upon publication at the journal website.

keywords: Hemiptera; phylogeny; mitochondrial genome; morphology; leafhopper

published: 2022-03-31

Crawford, Reed D.; Dodd, Luke E.; Tillman, Frank E.; O'Keefe, Joy M. (2022): Data for Evaluating bat boxes: Design and placement alter bioenergetic costs and overheating risk. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3592866_V1

This dataset contains our bi-hourly temperature recordings from 40 rocket box style artificial roosts of 5 designs deployed in Indiana and Kentucky, USA from April through September 2019. This dataset also includes our endothermic and faculatively heterothermic daily energy expenditure datasets used in our bioenergetic analysis, which were calculated from the bi-hourly rocket box temperature data. Lastly, we include our overheating counts dataset which summarizes daily overheating events (i.e., temperatures > 40 Celsius) in each rocket box style bat box over the course of the study period, these daily summaries were also calculated from the bi-hourly rocket box temperature recordings.

keywords: artificial roost; bat box; microcllimate; temperature

published: 2022-04-29

Wedell, Eleanor; Warnow, Tandy (2022): Biological and Simulated datasets for testing the SCAMPP framework for phylogenetic placement methods. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9257957_V1

Thank you for using these datasets! These files contain trees and reference alignments, as well as the selected query sequences for testing phylogenetic placement methods against and within the SCAMPP framework. There are four datasets from three different sources, each containing their source alignment and "true" tree, any estimated trees that may have been generated, and any re-estimated branch lengths that were created to be used with their requisite phylogenetic placement method. Three biological datasets (16S.B.ALL, PEWO/LTP_s128_SSU, and PEWO/green85) and one simulated dataset (nt78) is contained. See README.txt in each file for more information.

keywords: Phylogenetic Placement; Phylogenetics; Maximum Likelihood; pplacer; EPA-ng

published: 2022-07-25

Jett, Jacob (2022): SBKS - Celllines Raw Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8851803_V1

A set of cell-line entity mentions derived from an NERC dataset analyzing 900 synthetic biology articles published by the ACS. This data is associated with the Synthetic Biology Knowledge System repository (https://web.synbioks.org/). The data in this dataset are raw mentions from the NERC data.

keywords: synthetic biology; NERC data; cell-line mentions

published: 2022-08-31

Seyfried, Georgia; Midgley, Meghan; Phillips, Richard; Yang, Wendy (2022): Data for Refining the role of nitrogen mineralization in mycorrhizal nutrient syndromes. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5586647_V2

This dataset includes data on soil properties, soil N pools, and soil N fluxes presented in the manuscript, "Refining the role of nitrogen mineralization in mycorrhizal nutrient syndromes". Please refer to that publication for details about methodologies used to generate these data and for the experimental design. For this verison 2, we added specific gross nitrogen mineralization rates (ugN/gOM/d), microbial biomass carbon (ugC/gdw), microbial biomass nitrogen (ugN/gdw) and microbial biomass C:N ratios to the newest version of the data set. Additionally, we updated values for gross nitrogen mineralization, microbial NO3 assimilation and microbial NH4 assimilation to reflect slight changes in data processing. Those changes are reflected in "220829_All data_repository.csv". "220829_nitrogen_mineralization_readme.txt " is updated readme for the new file. The other 2 files begin with “220426_” are older version and same as in V1.

keywords: Nitrogen cycling; Ectomycorrhizal fungi; Arbuscular mycorrhizal fungi; Nitrogen fertilization; Gross mineralization

published: 2024-01-01

Edmonds, Devin; Bach, Elizabeth; Colton, Andrea; Jaquet, Izabelle; Kessler, Ethan; Dreslik, Michael (2024): Data for Ornate Box Turtle (Terrapene ornata) Emergence. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7298951_V1

These data were used to make a predictive model of when ornate box turtles (Terrapene ornata) are likely to be above ground and at risk from fire. The data were generated using shell temperatures, soil temperatures at 0.35 m deep from known overwintering sites, and the spring and fall soil temperature inversion dates during 2019–2022 to infer if 26 individual radio-tracked turtles were above or below ground at three sites in Illinois.

keywords: turtle; conservation; controlled burn; fire management; ectotherm; hibernation; brumation; reptile

published: 2021-09-03

Clark, Lindsay V.; Mays, Wittney; Lipka, Alexander E.; Sacks, Erik J. (2021): Dataset for evaluating the Hind/He statistic in polyRAD. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4814898_V1

All of the files in this dataset pertain to the evaluation of a novel statistic, Hind/He, for distinguishing Mendelian loci from paralogs. They are derived from a RAD-seq genotyping dataset of diploid and tetraploid Miscanthus sacchariflorus.

published: 2021-10-15

Perez, Sierra; Dalling, James; Fraterrigo, Jennifer (2021): Trelease and Brownfields Woods tree decay dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4547091_V1

Information on the location, dimensions, time of treefall or death, decay state, wood nutrient, wood pH and wood density data, and soil moisture, slope, distance from forest edge and soil nutrient data associated with the publication "Interspecific wood trait variation predicts decreased carbon residence time in changing forests" authored by Sierra Perez, Jennifer Fraterrigo, and James Dalling. ** <b>Note:</b> Blank cells indicate that no data were collected.

keywords: wood decay; carbon residence time; coarse woody debris; decomposition, temperate forests