Displaying 101 - 125 of 781 in total

Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2020-05-13

Althaus, Scott; Bajjalieh, Joseph; Jungblut, Marc; Shalmon, Dan; Ghosh, Subhankar; Joshi, Pradnyesh (2020): Responsible Terrorism Coverage (ResTeCo) Project New York Times (NYT) Dataset . University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-4638196_V1

Terrorism is among the most pressing challenges to democratic governance around the world. The Responsible Terrorism Coverage (or ResTeCo) project aims to address a fundamental dilemma facing 21st century societies: how to give citizens the information they need without giving terrorists the kind of attention they want. The ResTeCo hopes to inform best practices by using extreme-scale text analytic methods to extract information from more than 70 years of terrorism-related media coverage from around the world and across 5 languages. Our goal is to expand the available data on media responses to terrorism and enable the development of empirically-validated models for socially responsible, effective news organizations. This particular dataset contains information extracted from terrorism-related stories in the New York Times published between 1945 and 2018. It includes variables that measure the relative share of terrorism-related topics, the valence and intensity of emotional language, as well as the people, places, and organizations mentioned. This dataset contains 3 files: 1. "ResTeCo Project NYT Dataset Variable Descriptions.pdf" <ul> <li>A detailed codebook containing a summary of the Responsible Terrorism Coverage (ResTeCo) Project New York Times (NYT) Dataset and descriptions of all variables. </li> </ul> 2. "resteco-nyt.csv" <ul><li>This file contains the data extracted from terrorism-related media coverage in the New York Times between 1945 and 2018. It includes variables that measure the relative share of topics, sentiment, and emotion present in this coverage. There are also variables that contain metadata and list the people, places, and organizations mentioned in these articles. There are 53 variables and 438,373 observations. The variable "id" uniquely identifies each observation. Each observation represents a single news article. </li> <li> Please note that care should be taken when using "resteco-nyt.csv". The file may not be suitable to use in a spreadsheet program like Excel as some of the values get to be quite large. Excel cannot handle some of these large values, which may cause the data to appear corrupted within the software. It is encouraged that a user of this data use a statistical package such as Stata, R, or Python to ensure the structure and quality of the data remains preserved.</li> </ul> 3. "README.md" <ul><li>This file contains useful information for the user about the dataset. It is a text file written in mark down language</li> </ul> Citation Guidelines 1) To cite this codebook please use the following citation: Althaus, Scott, Joseph Bajjalieh, Marc Jungblut, Dan Shalmon, Subhankar Ghosh, and Pradnyesh Joshi. 2020. Responsible Terrorism Coverage (ResTeCo) Project New York Times (NYT) Dataset Variable Descriptions. Responsible Terrorism Coverage (ResTeCo) Project New York Times Dataset. Cline Center for Advanced Social Research. May 13. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-4638196_V1 2) To cite the data please use the following citation: Althaus, Scott, Joseph Bajjalieh, Marc Jungblut, Dan Shalmon, Subhankar Ghosh, and Pradnyesh Joshi. 2020. Responsible Terrorism Coverage (ResTeCo) Project New York Times Dataset. Cline Center for Advanced Social Research. May 13. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-4638196_V1

keywords: Terrorism, Text Analytics, News Coverage, Topic Modeling, Sentiment Analysis

published: 2025-01-26

Liu, Shanshan; Vlachokostas, Alex; Kontou, Eleftheria (2025): Data for Leveraging electric vehicles as a resiliency solution for residential backup power during outages. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-2525945_V1

Data and code supporting the paper titled "Leveraging electric vehicles as a resiliency solution for residential backup power during outages" by Shanshan Liu, Alex Vlachokostas, and Eleftheria Kontou. The data and the code enable spatiotemporal analytics and assessment of electric vehicle charging demand, remaining driving range, residential energy use, and vehicle-to-home (V2H) energy system resilience metrics.

keywords: Electric vehicles; Power outages; Vehicle-to-home energy system; Residential loads; Bidirectional energy exchange

published: 2025-01-27

Shen, Chengze; Wedell, Eleanor; Pop, Mihai; Warnow, Tandy (2025): TIPP3 Benchmark Data and Simulated Reads. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-5467027_V1

The zip file contains the benchmark data used for the TIPP3 simulation study. See the README file for more information.

keywords: TIPP3;abundance profile;reference database;taxonomic identification;simulation

published: 2025-01-27

Zinnen, Jack; Chase, Marissa; Charles, Brian; Meissen, Justin; Matthews, Jeffrey (2025): RELIX: a dataset of vascular plant species presence for 353 prairie remnants in the Midwestern United States, with prairie remnant metadata. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-5653911_V1

This is the core data for RELIX, a dataset of vascular plant species presence for 353 prairie remnants in the Midwestern United States and associated dataset of prairie remnant metadata. The primary data file contains a list of the vascular plant species observed in the prairie remnants, as well as a metadata table with more information about the prairie remnant in question and the species list itself. The data was compiled from a variety of written sources, private and published, chronicling observations made between the mid-twentieth century and 2021. It also contains a supplementary data table of vascular plant species observed in at least 8 of the prairie remnants in RELIX, as well as a list of acknowledgements for the associated manuscript.

keywords: prairie peninsula; prairie relict; prairie soil; species inventories; tallgrass prairie

published: 2024-04-10

Konar, Megan; Ruess, Paul J.; Wanders, Niko; Bierkens, Marc F.P. (2024): Data for Total irrigation by crop in the Continental United States from 2008 to 2020. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-2656127_V1

This dataset provides estimates of total Irrigation Water Use (IWU) by crop, county, water source, and year for the Continental United States. Total irrigation from Surface Water Withdrawals (SWW), total Groundwater Withdrawals (GWW), and nonrenewable Groundwater Depletion (GWD) is provided for 20 crops and crop groups from 2008 to 2020 at the county spatial resolution. In total, there are nearly 2.5 million data points in this dataset (3,142 counties; 13 years; 3 water sources; and 20 crops). This dataset supports the paper by Ruess et al (2024) "Total irrigation by crop in the Continental United States from 2008 to 2020", Scientific Data, doi: 10.1038/s41597-024-03244-w When using, please cite as: Ruess, P.J., Konar, M., Wanders, N., and Bierkens, M.F.P. (2024) Total irrigation by crop in the Continental United States from 2008 to 2020, Scientific Data, doi: 10.1038/s41597-024-03244-w

keywords: water use; irrigation; surface water; groundwater; groundwater depletion; counties; crops; time series

published: 2024-10-31

Liu, Shanshan; Vlachokostas, Alex; Kontou, Eleftheria (2024): Data for Resilience and environmental benefits of electric school buses as backup power for educational functions continuation during outages. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-4925630_V1

School buses transport 20 million students annually and are currently undergoing electrification in the US. With Vehicle-to-Building (V2B) technology, electric school buses (ESBs) can supply energy to school buildings during power outages, ensuring continued operation and safety. This study proposes assessing the resilience of secondary schools during outages by leveraging ESB fleets as backup power across various US climate regions. The findings indicate that the current fleet of ESBs in representative cities across different climate regions in the US is insufficient to meet the power demands of an entire school or even its HVAC system. However, we estimated the number of ESBs required to support the school's power needs, and we showed that the use of V2B technology significantly reduces carbon emissions compared to backup diesel generators. While adjusting HVAC setpoints and installing solar panels have limited impacts on enhancing school resilience, gathering students in classrooms during outages significantly improved resilience in our case study in Houston, Texas. Given the ongoing electrification of school buses, it is essential for schools to complement ESBs with stationary batteries and other backup power sources, such as solar and/or diesel generators, to effectively address prolonged outages. Determining the deployment of direct current fast and Level 2 chargers can reduce infrastructure costs while maintaining the resilience benefits of ESBs. This dataset includes the simulation process and results of this study.

keywords: Electric school bus; Power outages,;Vehicle-to-Building technology; Carbon emission reduction; Backup power source

published: 2025-01-23

Smith, Rebecca; Mateus-Pinilla, Nohra (2025): Assessing contact between humans and white-tailed deer in Illinois: a cross-sectional survey. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-1661924_V1

These are the responses to an open, convenience sample survey of residents of Illinois to understand their interactions with wild deer. The survey was available on REDCap between December 19, 2022 and December 19, 2023, and was publicized through listserves, Facebook groups, and media reporting. The file "COVID Deer Survey _ REDCap.pdf" contains the codebook for the survey, including the questions; all factor variables have ".factor" added to their name in the dataset. The file "DeerSurveyData.csv" contains the dataset. The file "Score_calculation_for_sharing.R" is the code to create the cleaned dataset used for analysis from the raw survey responses. Throughout, NA is used to represent null/not available/not applicable; this is most likely either a failure to answer the question or, in some cases, a question that was not presented as it is not relevant based on answers to previous questions.

keywords: deer; survey

published: 2023-07-14

Punyasena, Surangi W.; Urban, Michael A.; Adaime, Marc-Elie; Romero, Ingrid; Jaramillo, Carlos (2023): Pollen of Podocarpus (Podocarpaceae): Airyscan confocal superresolution images. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-8817604_V1

This dataset includes a total of 300 images of 45 extant species of Podocarpus (Podocarpaceae) and nine images of fossil specimens of the morphogenus Podocarpidites. The goal of this dataset is to capture the diversity of morphology within the genus and create an image database for training machine learning models. The images were taken using Airyscan confocal superresolution microscopy at 630x magnification (63x/NA 1.4 oil DIC). The images are in the CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or open microscopy software, such as ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/software/zeiss-zen/czi-image-file-format.html] Please cite this dataset and listed publications when using these images.

keywords: optical superresolution microscopy; Zeiss Airyscan; CZI images; conifer; saccate pollen; Podocarpus; Podocarpidites; Smithsonian Tropical Research Institute

published: 2025-01-15

Suski, Cory; Hay, Allison (2025): Seasonal Variation in Responses of Largemouth Bass Caught During Live-Release Angling Tournaments. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0025771_V1

Data was generated from acoustic transmitters implanted in tournament caught and non-angled control largemouth bass across multiple seasons. This data was used to quantify post-release movement, behavior, and mortality in response to angling tournaments at different times of year and varying water temperatures.

published: 2023-05-02

Lee, Jou; Schneider, Jodi (2023): Crossref data for Assessing the agreement in retraction indexing across 4 multidisciplinary sources: Crossref, Retraction Watch, Scopus, and Web of Science. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-9099305_V1

Tab-separated value (TSV) file. 14745 data rows. Each data row represents publication metadata as retrieved from Crossref (http://crossref.org) 2023-04-05 when searching for retracted publications. Each row has the following columns: Index - Our index, starting with 0. DOI - Digital Object Identifier (DOI) for the publication Year - Publication year associated with the DOI. URL - Web location associated with the DOI. Title - Title associated with the DOI. May be blank. Author - Author(s) associated with the DOI. Journal - Publication venue (journal, conference, ...) associated with the DOI RetractionYear - Retraction Year associated with the DOI. May be blank. Category - One or more categories associated with the DOI. May be blank. Our search was via the Crossref REST API and searched for: Update_type=( 'retraction', 'Retraction', 'retracion', 'retration', 'partial_retraction', 'withdrawal','removal')

keywords: retraction; metadata; Crossref; RISRS

published: 2020-04-22

Endres, A. Bryan; Endres, Renata; Krstinić Nižić, Marinela (2020): Croatian Restaurant Allergy Disclosures. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-9891298_V1

Data on Croatian restaurant allergen disclosures on restaurant websites, on-line menus and social media comments

keywords: restaurant; allergen; disclosure; tourism

published: 2024-11-12

Zinnen, Jack; Chase, Marissa; Charles, Brian; Harmon-Threatt, Alexandra; Matthews, Jeffrey (2024): Data for Pollinator seed mixes are phenologically dissimilar to prairie remnants. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-0185914_V1

This is the data set for the article entitled "Pollinator seed mixes are phenologically dissimilar to prairie remnants," a manuscript pending publication in Restoration Ecology. This represents the core phenology data of prairie remnant and pollinator seed mixes that were used for the main analyses. Note that additional data associated with the manuscript are intended to be published as a supplement in the journal.

keywords: native plants; ecological restoration; tallgrass prairie; native plant materials

published: 2024-08-16

Halligan, Susannah; Schummer, Michael; Fournier, Auriel; Musni, Vergie; Davis, J. Brian; Downs, Cynthia; Lavretsky, Philip (2024): Morphological differences between wild and game-farm Mallards in North America. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-3363781_V1

Dataset used for the paper entitled "Morphological differences between wild and game-farm Mallards in North America". Large-scale releases of domesticated, game-farm Mallards to supplement wild populations have resulted in wide-spread introgressive hybridization that changed the genetic constitution of wild populations in eastern North America. The resulting gene flow is well-documented between game-farm and wild Mallards, but the mechanistic consequences from such interactions remain unknown in North America. We provide the first study to characterize and investigate potential differences in morphology between genetically known, wild and game-farm Mallards in North America. We used nine morphological measurements to discriminate between wild and game-farm Mallards with 96% accuracy. Compared to their wild counterparts, game-farm Mallards had longer bodies and tarsi, shorter heads and wings, and shorter, wider, and taller bills. The nail on the end of the bill of game-farm Mallards was longer, and game-farm Mallard bills had a greater lamellae:bill length ratio than wild Mallards. Differences in body morphologies between wild and game-farm Mallards are consistent with an artificial, terrestrial life whereby game-farm Mallards are fed pelleted foods resulting in artificial selection for a more “goose-like” bill. We posit that 1) game-farm Mallards have diverged from their wild ancestral traits of flying and filter feeding towards becoming optimized to run and peck for food; 2) game-farm morphological traits optimized over the last 400 years in domestic environments are likely to be maladaptive in the wild; and 3) the introgression of such traits into wild populations is likely to reduce fitness. Understanding effects of game-farm Mallard introgression requires analysis of various game-farm × wild hybrid generations to determine how domestically-derived traits persist or diminish with each generation.

keywords: Mallard; Game Farm; Morphology; Waterfowl; Duck

published: 2023-07-01

Tonks, Adam; Hwang, Jeongwoo (2023): Data for the paper "Assessment of spatiotemporal flood risk due to compound precipitation extremes across the contiguous United States". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6626437_V1

This is the data used in the paper "Assessment of spatiotemporal flood risk due to compound precipitation extremes across the contiguous United States". Code from the Github repository https://github.com/adtonks/precip_extremes can be used with the data here to reproduce the paper's results. v1.0.0 of the code is also archived at https://doi.org/10.5281/zenodo.8104252 This dataset is derived from NOAA-CIRES-DOE 20th Century Reanalysis V3. The NOAA-CIRES-DOE Twentieth Century Reanalysis Project version 3 used resources of the National Energy Research Scientific Computing Center managed by Lawrence Berkeley National Laboratory which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 and used resources of NOAA's Remotely Deployed High Performance Computing Systems.

keywords: spatiotemporal; CONUS; United States; precipitation; extremes; flooding

published: 2023-07-05

Njuguna, Joyce; Clark, Lindsay; Lipka, Alexander; Anzoua, Kossonou; Bagmet, Larisa; Chebukin, Pavel; Dwiyanti, Maria; Dzyubenko, Elena; Dzyubenko, Nicolay; Ghimire, Bimal; Jin, Xiaoli; Johnson, Douglas; Kjeldsen, Jens; Nagano, Hironori; Oliveira, Ivone; Peng, Junhua; Petersen, Karen; Sabitov, Andrey; Seong, Eun; Yamada, Toshihiko; Yoo, Ji; Yu, Chang; Zhao, Hu; Munoz, Patricio; Long, Stephen; Sacks, Erik (2023): Impact of genotype-calling methodologies on genome-wide association and genomic prediction in polyploids. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4829913_V2

This dataset contains all data used in the paper "Impact of genotype-calling methodologies on genome-wide association and genomic prediction in polyploids". The dataset includes genotypes and phenotypic data from two autotetraploid species Miscanthus sacchariflorus and Vaccinium corymbosum that was used used for genome wide association studies and genomic prediction and the scripts used in the analysis. In this V2, 2 files have the raw data are added: "Miscanthus_sacchariflorus_RADSeq.vcf" is the VCF file with the raw SNP calls of the Miscanthus sacchariflorus data used for genotype calling using the 6 genotype calling methods. "Blueberry_data_read_depths.RData" is the a RData file with the read depth data that was used for genotype calling in the Blueberry dataset.

keywords: Polyploid; allelic dosage; Bayesian genotype-calling; Genome-wide association; Genomic prediction

published: 2023-07-11

Parulian, Nikolaus (2023): Data for A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6827044_V1

The dissertation_demo.zip contains the base code and demonstration purpose for the dissertation: A Conceptual Model for Transparent, Reusable, and Collaborative Data Cleaning. Each chapter has a demo folder for demonstrating provenance queries or tools. The Airbnb dataset for demonstration and simulation is not included in this demo but is available to access directly from the reference website. Any updates on demonstration and examples can be found online at: https://github.com/nikolausn/dissertation_demo

published: 2023-10-22

Davidson, Ruth; Vachaspati, Pranjal; Mirarab, Siavash; Warnow, Tandy (2023): Data from: Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6670066_V1

HGT+ILS datasets from Davidson, R., Vachaspati, P., Mirarab, S., & Warnow, T. (2015). Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer. BMC genomics, 16(10), 1-12. Contains model species trees, true and estimated gene trees, and simulated alignments.

keywords: evolution; computational biology; bioinformatics; phylogenetics

published: 2024-10-12

Langeslay, Blake; Juarez, Gabriel (2024): Data for "Strain rate controls alignment in growing bacterial monolayers". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3501541_V1

Simulation data used to generate plots in the associated paper ("Strain rate controls alignment in growing bacterial monolayers").

published: 2024-11-19

Salami, Malik Oyewale; McCumber, Corinne (2024): Dataset for Reassessment of the agreement in retraction indexing across 4 multidisciplinary sources: Crossref, Retraction Watch, Scopus, and Web of Science. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-8457537_V1

This project investigates retraction indexing agreement among data sources: Crossref, Retraction Watch, Scopus, and Web of Science. As of July 2024, this reassesses the April 2023 union list of Schneider et al. (2023): https://doi.org/10.55835/6441e5cae04dbe5586d06a5f. As of April 2023, over 1 in 5 DOIs had discrepancies in retraction indexing among the 49,924 DOIs indexed as retracted in at least one of Crossref, Retraction Watch, Scopus, and Web of Science (Schneider et al., 2023). Here, we determine what changed in 15 months. Pipeline code to get the results files can be found in the GitHub repository https://github.com/infoqualitylab/retraction-indexing-agreement in the iPython notebook 'MET-STI2024_Reassessment_of_retraction_indexing_agreement.ipynb' Some files have been redacted to remove proprietary data, as noted in README.txt. Among our sources, data is openly available only for Crossref and Retraction Watch. FILE FORMATS: 1) unionlist_completed_2023-09-03-crws-ressess.csv - UTF-8 CSV file 2) unionlist_completed-ria_2024-07-09-crws-ressess.csv - UTF-8 CSV file 3) unionlist-15months-period_sankey.png - Portable Network Graphics (PNG) file 4) unionlist_ria_proportion_comparison.png - Portable Network Graphics (PNG) file 5) README.txt - text file FILE DESCRIPTION: Description of the files can be found in README.txt

keywords: retraction status; data quality; indexing; retraction indexing; metadata; meta-science; RISRS

published: 2023-01-05

Tonks, Adam (2023): Data for the paper "Forecasting West Nile Virus with Graph Neural Networks: Harnessing Spatial Dependence in Irregularly Sampled Geospatial Data". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3628170_V1

This is the data used in the paper "Forecasting West Nile Virus with Graph Neural Networks: Harnessing Spatial Dependence in Irregularly Sampled Geospatial Data". A preprint may be found at https://doi.org/10.48550/arXiv.2212.11367 Code from the Github repository https://github.com/adtonks/mosquito_GNN can be used with the data here to reproduce the paper's results. v1.0.0 of the code is also archived at https://doi.org/10.5281/zenodo.7897830

keywords: west nile virus; machine learning; gnn; mosquito; trap; graph neural network; illinois; geospatial

published: 2023-04-12

Towns, John; Hart, David (2023): XSEDE: Allocations Awards and Usage for the NSF Cyberfrastructure Portfolio, 2004-2022. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3731847_V1

The XSEDE program manages the database of allocation awards for the portfolio of advanced research computing resources funded by the National Science Foundation (NSF). The database holds data for allocation awards dating to the start of the TeraGrid program in 2004 through the XSEDE operational period, which ended August 31, 2022. The project data include lead researcher and affiliation, title and abstract, field of science, and the start and end dates. Along with the project information, the data set includes resource allocation and usage data for each award associated with the project. The data show the transition of resources over a fifteen year span along with the evolution of researchers, fields of science, and institutional representation. Because the XSEDE program has ended, the allocation_award_history file includes all allocations activity initiated via XSEDE processes through August 31, 2022. The Resource Providers and successor program to XSEDE agreed to honor all project allocations made during XSEDE. Thus, allocation awards that extend beyond the end of XSEDE may not reflect all activity that may ultimately be part of the project award. Similarly, allocation usage data only reflects usage reported through August 31, 2022, and may not reflect all activity that may ultimately be conducted by projects that were active beyond XSEDE.

keywords: allocations; cyberinfrastructure; XSEDE

published: 2023-05-30

Clem, C. Scott; Hart, Lily V.; McElrath, Thomas C. (2023): Primary Occurrence Data for "Clem, Hart, & McElrath. 2023. A century of Illinois hover flies (Diptera: Syrphidae): Museum and citizen science data reveal recent range expansions, contractions, and species of potential conservation significance". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1613645_V1

Primary occurrence data for Clem, Hart, & McElrath. 2023. A century of Illinois hover flies (Diptera: Syrphidae): Museum and citizen science data reveal recent range expansions, contractions, and species of potential conservation significance. Included are a license.txt file, the cleaned occurrences from each of the six merged datasets, and a cleaned, merged dataset containing all occurrence records in one spreadsheet, formatted according to Darwin Core standards, with a few extra fields such as GBIF identifiers that were included in some of the original downloads.

keywords: csv; occurrences; syrphidae; hover flies; flies; biodiversity; darwin core; darwin-core; GBIF; citizen science; iNaturalist

published: 2024-02-08

Martinez, Carlos; Pena, Gisselle; Wells, Kaylee K. (2024): "Prairie Directory of North America" (2013) Entries for the Tallgrass, Mixed Grass, and Shortgrass Prairie Regions of the United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0421892_V1

This dataset contains transcribed entries from the "Prairie Directory of North America" (Adelman and Schwartz 2013) for the Tallgrass, Mixed Grass, and Shortgrass prairie regions of the united states. We identified the historical spatial extent of the Tallgrass, Mixed Grass, and Shortgrass prairie regions using Ricketts et al. (1999), Olson et al. (2001), and Dixon et al. (2014) and selected the counties entirely or partially within these boundaries from the USDA Forest Service (2022) file. The resulting lists of counties are included as separate files. The dataset contains information on publicly accessible grasslands and prairies in these regions including acreage and amenities like hunting access, restrooms, parking, and trails.

keywords: grasslands; prairies; prairie directory of north america; site amenities; site attributes

published: 2018-05-21

Karigerasi, Manohar H.; Wagner, Lucas K.; Shoemaker, Daniel P. (2018): Geometric analysis of magnetic dimensionality. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3897093_V1

This dataset contains bonding networks and tolerance ranges for geometric magnetic dimensionality. The data can be searched in the html frontend above, code obtained at the GitHub repository, or the raw data can be downloaded as csv below. The csv data contains the results of 42520 compounds (unique icsd_code) from ICSD FindIt v3.5.0. The csv is semicolon-delimited since some fields contain multiple comma-separated values.

keywords: materials science; physics; magnetism; crystallography

published: 2018-07-25

Scannapieco, Frank; Hoang, Linh; Schneider, Jodi (2018): Expert assessment of RobotReviewer data extraction performance on 10 articles. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8274875_V1

The PDF describes the process and data used for the heuristic user evaluation described in the related article “Evaluating an automatic data extraction tool based on the theory of diffusion of innovation” by Linh Hoang, Frank Scannapieco, Linh Cao, Yingjun Guan, Yi-Yun Cheng, and Jodi Schneider (under submission). Frank Scannapieco assessed RobotReviewer data extraction performance on ten articles in 2018-02. Articles are included papers from an update review: Sabharwal A., G.-F.I., Stellrecht E., Scannapeico F.A. Periodontal therapy to prevent the initiation and/or progression of common complex systemic diseases and conditions. An update. Periodontol 2000. In Press. The form was created in consultation with Linh Hoang and Jodi Schneider. To do the assessment, Frank Scannapieco entered PDFs for these ten articles into RobotReviewer and then filled in ten evaluation forms, based on the ten Robot Reviewer automatic data extraction reports. Linh Hoang analyzed these ten evaluation forms and synthesized Frank Scannapieco’s comments to arrive at the evaluation results for the heuristic user evaluation.

keywords: RobotReviewer; systematic review automation; data extraction