Displaying 326 - 350 of 668 in total

Subject Area

Life Sciences (365)
Social Sciences (136)
Physical Sciences (101)
Technology and Engineering (64)
Arts and Humanities (1)
Uncategorized (1)

Funder

Other (206)
U.S. National Science Foundation (NSF) (193)
U.S. Department of Energy (DOE) (68)
U.S. National Institutes of Health (NIH) (63)
U.S. Department of Agriculture (USDA) (44)
Illinois Department of Natural Resources (IDNR) (17)
U.S. Geological Survey (USGS) (7)
U.S. National Aeronautics and Space Administration (NASA) (6)
Illinois Department of Transportation (IDOT) (4)
U.S. Army (2)

Publication Year

2021 (108)
2022 (108)
2020 (96)
2023 (78)
2019 (72)
2024 (70)
2018 (61)
2017 (36)
2016 (30)
2025 (4)
2009 (1)
2011 (1)
2012 (1)
2014 (1)
2015 (1)

License

CC0 (367)
CC BY (281)
custom (20)

Datasets

published: 2021-05-14
 
This document contains the Supplemental Materials for Chapter 4: Climate Change Impacts on Agriculture from the report "An Assessment of the Impacts of Climate Change in Illinois" published in 2021.
keywords: Illinois; climate change; agriculture; impacts; adaptation; crop yield; ISAM; econometrics; days suitable for fieldwork
published: 2021-10-13
 
Drainage network analysis is fundamental to understanding the characteristics of surface hydrology. Based on elevation data, drainage network analysis is often used to extract key hydrological features like drainage networks and streamlines. Limited by raster-based data models, conventional drainage network algorithms typically allow water to flow in 4 or 8 directions (surrounding grids) from a raster grid. To resolve this limitation, this paper describes a new vector-based method for drainage network analysis that allows water to flow in any direction around each location. The method is enabled by rapid advances in Light Detection and Ranging (LiDAR) remote sensing and high-performance computing. The drainage network analysis is conducted using a high-density point cloud instead of Digital Elevation Models (DEMs) at coarse resolutions. Our computational experiments show that the vector-based method can better capture water flows without limiting the number of directions due to imprecise DEMs. Our case study applies the method to Rowan County watershed, North Carolina in the US. After comparing the drainage networks and streamlines detected with corresponding reference data from US Geological Survey generated from the Geonet software, we find that the new method performs well in capturing the characteristics of water flows on landscape surfaces in order to form an accurate drainage network. This dataset contains all the code, notebooks, datasets used in the study conducted for the research publication titled " A Vector-Based Method for Drainage Network Analysis Based on LiDAR Data ". ## What's Inside A quick explanation of the components * `A Vector Approach to Drainage Network Analysis Based on LiDAR Data.ipynb` is a notebook for finding the drainage network based on LiDAR data *`Picture1.png` is a picture representing the pseudocode of our new algorithm * HPC` folder contains codes for running the algorithm with sbatch in HPC ** `execute.sh` is a bash script file that use sbatch to conduct large scale analysis for the algorithm ** `run.sh` is a bash script file that calls the script file `execute.sh` for large scale calculation for the algorithm ** `run.py` includes the codes implemented for the algorithm * `Rowan Creek Data` includes data that are used in the study ** `3_1.las` and `3_2.las ` are the LiDAR data files that is used in our analysis presented in the paper. Users may use this data file to reproduce our results and may replace it with their own LiDAR file to run this method over different areas ** `reference` folder includes reference data from USGS *** `reference_3_1.tif` and `reference_3_2.tif` are reference data for the drainage system analysis retrieved from USGS.
keywords: CyberGIS; Drainage System Analysis; LiDAR
published: 2022-03-25
 
Ground based radar data sets collected during the 2013 NASA EVEX Campaign conducted in Roi-Namur island of the Kwajalein Atoll in the Republic of Marshall Islands are deposited in this databank. Radar data were collected with IRIS VHF and ALTAIR VHF/UHF systems.
published: 2022-06-22
 
This dataset helps to investigate the Spatial Accessibility to HIV Testing, Treatment, and Prevention Services in Illinois and Chicago, USA. The main components are: population data, healthcare data, GTFS feeds, and road network data. The core components are: 1) `GTFS` which contains GTFS (<a href="https://gtfs.org/">General Transit Feed Specification</a>) data which is provided by Chicago Transit Authority (CTA) from <a href="https://developers.google.com/transit/gtfs">Google's GTFS feeds</a>. Documentation defines the format and structure of the files that comprise a GTFS dataset: <a href="https://developers.google.com/transit/gtfs/reference?csw=1">https://developers.google.com/transit/gtfs/reference?csw=1</a>. 2) `HealthCare` contains shapefiles describing HIV healthcare providers in Chicago and Illinois respectively. The services come from <a href="https://locator.hiv.gov/">Locator.HIV.gov</a>. 3) `PopData` contains population data for Chicago and Illinois respectively. Data come from The American Community Survey and <a href="https://map.aidsvu.org/map">AIDSVu</a>. AIDSVu (https://map.aidsvu.org/map) provides data on PLWH in Chicago at the census tract level for the year 2017 and in the State of Illinois at the county level for the year 2016. The American Community Survey (ACS) provided the number of people aged 15 to 64 at the census tract level for the year 2017 and at the county level for the year 2016. The ACS provides annually updated information on demographic and socio economic characteristics of people and housing in the U.S. 4) `RoadNetwork` contains the road networks for Chicago and Illinois respectively from <a href="https://www.openstreetmap.org/copyright">OpenStreetMap</a> using the Python <a href="https://osmnx.readthedocs.io/en/stable/">osmnx</a> package. <b>The abstract for our paper is:</b> Accomplishing the goals outlined in “Ending the HIV (Human Immunodeficiency Virus) Epidemic: A Plan for America Initiative” will require properly estimating and increasing access to HIV testing, treatment, and prevention services. In this research, a computational spatial method for estimating access was applied to measure distance to services from all points of a city or state while considering the size of the population in need for services as well as both driving and public transportation. Specifically, this study employed the enhanced two-step floating catchment area (E2SFCA) method to measure spatial accessibility to HIV testing, treatment (i.e., Ryan White HIV/AIDS program), and prevention (i.e., Pre-Exposure Prophylaxis [PrEP]) services. The method considered the spatial location of MSM (Men Who have Sex with Men), PLWH (People Living with HIV), and the general adult population 15-64 depending on what HIV services the U.S. Centers for Disease Control (CDC) recommends for each group. The study delineated service- and population-specific accessibility maps, demonstrating the method’s utility by analyzing data corresponding to the city of Chicago and the state of Illinois. Findings indicated health disparities in the south and the northwest of Chicago and particular areas in Illinois, as well as unique health disparities for public transportation compared to driving. The methodology details and computer code are shared for use in research and public policy.
keywords: HIV;spatial accessibility;spatial analysis;public transportation;GIS
published: 2021-10-28
 
Bigheaded carp were collected from the Illinois and Des Plaines Rivers, parts of the Illinois Waterway, from May to November 2018. A total of 93 fish were collected during sampling for a study comprised of 40 females, 41 males, and 12 unsexed fish. GC/MS metabolite profiling analysis detected 180 compounds. Livers from carp at the leading edge had differences in energy use and metabolism, and suppression of protective mechanisms relative to downstream fish; differences were consistent across time. This body of work provides evidence that water quality is linked to carp movement in the Illinois River. As water quality in this region continues to improve, consideration of this impact on carp spread is essential to protect the Great Lakes.
keywords: water quality; metabolites; range expansion; energy; contaminants
published: 2023-10-26
 
This dataset contains MRI data and Imaris modeling analysis of CLARITY-cleared, immunostained tissue associated with a study that assessed the effects of lipid blends containing various levels of a hydrolyzed fat system on myelin development in healthy neonatal piglets. Data are from thirty-two piglets of mixed sexes across four diet treatment groups and includes a sow-fed reference group. MRI data (presented in Figure 2 of the associated article) consists of volumetric data from Voxel-Based Morphometry analysis in brain grey matter and white matter, as well as mean fractional anisotropy and mean orientation dispersion index data from Tract-Based Spatial Statistics analysis. Imaris data (presented in Figure 3 of the associated article) consists of twenty-one select output measures from 3D modeling analysis of PLP-stained prefrontal cortex tissue. All methods used for collection/generation/processing of data are described in the associated article: Louie AY, Rund LA, Komiyama-Kasai KA, Weisenberger KE, Stanke KL, Larsen RJ, Leyshon BJ, Kuchan MJ, Das T, Steelman AJ. A hydrolyzed lipid blend diet promotes myelination in neonatal piglets in a region and concentration-dependent manner. J Neurosci Res. 2023.
keywords: myelin; dietary lipid; white matter; CLARITY; Imaris; voxel-based morphometry; diffusion tensor imaging
published: 2021-06-24
 
This dataset contains EEG and Temperature data acquired from inside the bore of an MRI scanner during scanning with two different types of fMRI sequences: single-band and and multi-band. The EEG data were acquired from the heads of adult humans undergoing scanning, and can be used to assess differences in EEG data quality due to sequence type. The temperature data were acquired from a watermelon phantom and can be used to assess heating differences due to sequence type.
keywords: Simultaneous EEG-fMRI, Multi-band fMRI, Safety, Heating
published: 2020-09-27
 
This dataset contains R codes used to produce the figures submitted in the manuscript titled "Understanding the multifaceted geospatial software ecosystem: a survey approach". The raw survey data used to populate these charts cannot be shared due to the survey consent agreement.
keywords: R; figures; geospatial software
published: 2021-06-17
 
Model output dataset (6-hourly) from the Weather Research and Forecasting (WRF) model simulations over South America with the added capability of water vapor tracers to track the moisture that originates over the Amazon and the La Plata river basins. The simulations were performed for the period 2003-2013 at 20-km horizontal resolution fully coupled with the Noah-MP land surface model. Limited number of original output variables sufficient for reproducing the analyses in papers that cite this dataset are included here. The attached wrfout_southamerica_readme.txt contains detailed information about the file format and variables. For the complete model dataset, contact francina@illinois.edu.
keywords: WRF; Amazon; La Plata; South America; Numerical tracers
published: 2023-09-01
 
An online and paper knowledge, attitudes, and practices survey on ticks and tick-borne diseases (TBD) was distributed to farmers in Illinois during summer 2020 to spring 2022 (paper version titled Final Draft Farmer KAP_v.SoftCopy_Revised.docx). These are the raw data associated with that survey and the survey questions used (FarmerTickKAPdata.csv, data dictionary in Data Description.docx). We have added calculated values (columns 286 to end, code for calculation in FarmerKAPvariableCalculation.R), including: the tick knowledge score, TBD knowledge score, and total knowledge score, which are the sum of the total number of correct answers in each category, and score percent, which are the proportion of correct answers in each category.
keywords: ticks; survey; tick-borne disease; farmer
published: 2021-10-22
 
This dataset includes the source data for Figures 1-4 and supplementary figures 1-10 for the manuscript "Kinetic and structural mechanism for DNA unwinding by a non-hexameric helicase".
published: 2021-12-01
 
An online knowledge, attitudes, and practices survey on ticks and tick-borne diseases was distributed to veterinary professionals in Southern and Central Illinois during summer and fall 2020. These are the raw data associated with that survey and the survey questions used. * NOTE: "age" and "gender" variables were removed from the data to protect participants.
keywords: ticks; veterinary medicine; tick-borne disease; survey
published: 2021-08-14
 
1. Rice H2 - Destructive Harvest - These data are for the destructive harvest (above-ground biomass) of 30 diverse indica rice genotypes that were grown to evaluate natural variation as well as the heritability of photosynthesis-related traits. Traits measured include: plant height, leaf area, plant fresh and dry weights, and tiller number. 2. Rice H2 - ACi Response Summary - These data characterize the response of CO2 uptake to change in intercellular CO2 concentration in 30 diverse indica rice genotypes. These measurements were taken to evaluate natural variation and the heritability of photosynthesis-related traits in rice. 3. Rice H2 - Survey Style Gas Exchange Measurements - These data document steady-state survey style gas exchange measurements in 30 diverse indica rice genotypes. These measurements were taken to evaluate natural variation and the heritability of photosynthesis-related traits in rice.
keywords: photosynthesis, photosynthetic capacity, natural variation, heritability, food security, rice
published: 2021-11-18
 
This dataset contains sequencing data obtained from Illumina MiSeq device to prove the concept of the proposed 2DDNA framework. Please refer to README.txt for detailed description of each file.
keywords: machine learning;image processing;computer vision;rewritable storage system;2D DNA-based data storage
published: 2021-01-27
 
*This is the third version of the dataset*. New changes in this 3rd version: <i>1.replaces simulations where the initial condition consists of a sinusoidal channel with topographic perturbations with simulations where the initial condition consists of a sinusoidal channel without topographic perturbations. These simulations better illustrate the transformation of a nondendritic network into a dendritic one. 2. contains two additional simulations showing how total domain size affects the landscape's dynamism. 3. changes dataset title to reflect the publication's title</i> This dataset contains data from 18 simulations using a landscape evolution model. A landscape evolution model simulates how uplift and rock incision shape the Earth's (or other planets) surface. To date, most landscape evolution models exhibit "extreme memory" (paper: https://doi.org/10.1029/2019GL083305 and dataset: https://doi.org/10.13012/B2IDB-4484338_V1). Extreme memory in landscape evolution models causes initial conditions to be unrealistically preserved. This dataset contains simulations from a new landscape evolution model that incorporates a sub-model that allows bedrock channels to erode laterally. With this addition, the landscapes no longer exhibit extreme memory. Initial conditions are erased over time, and the landscapes tend towards a dynamic steady state instead of a static one. The model with lateral erosion is named LEM-wLE (Landscape Evolution Model with Lateral Erosion) and the model without lateral erosion is named LEM-woLE (Landscape Evolution Model without Lateral Erosion). There are 16 folders in total. Here are the descriptions: <i>>LEM-woLE_simulations:</i> This folder contains simulations using LEM-woLE. Inside the folder are 5 subfolders containing 100 elevation rasters, 100 drainage area rasters, and 100 plots showing the slope-area relationship. Elevation depicts the height of the landscape, and drainage area represents a contributing area that is upslope. Each folder corresponds to a different initial condition. Driver files and code for these simulations can be found at https://github.com/jeffskwang/LEM-wLE. <i>>MOVIE_S#_data:</i> There are 13 data folders that contain raster data for 13 simulations using LEM-wLE. Inside each folder are 1000 elevation rasters, 1000 drainage area rasters, and 1000 plots showing the slope-area relationship. Driver files and code for these simulations can be found at https://github.com/jeffskwang/LEM-wLE. <i>>movies_mp4_format:</i> For each data folder there are 3 movies generated that show elevation (a), drainage area (b), and erosion rates (c). These files are formatted in the mp4 format and are best viewed using VLC media player (https://www.videolan.org/vlc/index.html). <i>>movies_wmv_format:</i> This folder contains the same movies as the "movies_mp4_format" folder, but they are in a wmv format. These movies can be viewed using Windows media player or other Windows platform movie software. Here are the captions for the 13 movies: Movie S1. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S2. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Inclined with small, randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S3. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Inclined with large, randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S4. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: V-shaped valley with randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S5. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel with randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S6. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 0.25. Movie S7. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 0.5. Movie S8. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 0.75. Movie S9. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S10. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 2 open boundaries at the top and bottom of the domain, and 2 closed boundaries on the left and right sides. KL/KV = 1. Movie S11. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 4 open boundaries. KL/KV = 1. Movie S12. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 4 open boundaries. KL/KV = 1. Compared to Movie S11, the length of the domain is 50% shorter, decreasing the total domain area. Movie S13. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 4 open boundaries. KL/KV = 1. Compared to Movie S11, the length of the domain is 50% longer, increasing the total domain area. The associated publication for this dataset has not yet been published, and we will update this description with a link when it is.
keywords: landscape evolution; drainage networks; lateral migration; geomorphology
published: 2021-05-10
 
UAV-based high-resolution multispectral time-series orthophotos utilized to understand the relation between growth dynamics, imagery temporal resolution, and end-of-season biomass productivity of biomass sorghum as bioenergy crop. Sensor utilized is a RedEdge Micasense flown at 40 meters above ground level at the Energy Farm- UIUC in 2019.
keywords: Unmanned aerial vehicles; High throughput phenotyping; Machine learning; Bioenergy crops
published: 2021-07-30
 
This data comes from a scoping review associated with the project called Reducing the Inadvertent Spread of Retracted Science. The data summarizes the fields that have been explored by existing research on retraction, a list of studies comparing retraction in different fields, and a list of studies focused on retraction of COVID-19 articles.
keywords: retraction; fields; disciplines; research integrity
published: 2021-04-30
 
This repository includes scripts and datasets for the paper, "Accurate Large-scale Phylogeny-Aware Alignment using BAli-Phy" submitted to Bioinformatics.
keywords: BAli-Phy;Bayesian co-estimation;multiple sequence alignment
published: 2021-05-26
 
Steady-state and dynamic gas exchange data for maize (B73), sugarcane (CP88-1762) and sorghum (Tx430)
keywords: C4 plants; gas exchange
published: 2022-03-23
 
This dataset is a estimation of county-to-county commodity delivery through cold chain in 2017. For each county pair, the weight[kg] and value[$] of the cold chain flow between origin and destination for SCTG 5 and SCTG 7 commodities are estimated by our model. - SCTG 5 - Meat, poultry, fish, seafood, and their preparations - SCTG 7 - Other prepared foodstuffs, fats, and oils
keywords: food flows; cold chain; county-scale; United States; carbon footprint
published: 2024-01-01
 
Supplementary data tables for the dissertation "Hybridization dynamics and population genomics of a Manacus hybrid zone." This work focuses on the dynamics of hybridization over time in two species of tropical birds, the golden-collared manakin (Manacus vitellinus) and white-collared manakin (Manacus candei) comparing data from historical museum samples and contemporary wild-caught birds. Table A1 contains the sample metadata for the Manacus Restriction site-associated DNA sequencing dataset used in the dissertation with associated NCBI Biosample Accession numbers, Smithsonian Museum of Natural History number (where applicable), sample IDs, sampling site locations, and sample information of year the sample was taken, age, and sex. Table A6 contains phenotypic measurements of male plumage traits of manakins used in cline analyses to assess hybrid zone movement over time in historical and contemporary datasets, including beard length (mm), epaulet width (mm), tail length (mm), collar color (nm), and belly color (nm). Table A7 contains a summary of male plumage measurements across the hybrid zone. Table C1 contains a list of annotated protein coding genes in candidate regions of interest in Manacus genomes using outlier regions of genomic divergence, linkage disequilibrium, and enrichment of parental private alleles.
keywords: csv; manacus; manakin; genomics; dissertation
published: 2020-08-22
 
We are releasing the tracing dataset of four microservice benchmarks deployed on our dedicated Kubernetes cluster consisting of 15 heterogeneous nodes. The dataset is not sampled and is from selected types of requests in each benchmark, i.e., compose-posts in the social network application, compose-reviews in the media service application, book-rooms in the hotel reservation application, and reserve-tickets in the train ticket booking application. The four microservice applications come from [DeathStarBench](https://github.com/delimitrou/DeathStarBench) and [Train-Ticket](https://github.com/FudanSELab/train-ticket). The performance anomaly injector is from [FIRM](https://gitlab.engr.illinois.edu/DEPEND/firm.git). The dataset was preprocessed from the raw data generated in FIRM's tracing system. The dataset is separated by on which microservice component is the performance anomaly located (as the file name suggests). Each dataset is in CSV format and fields are separated by commas. Each line consists of the tracing ID and the duration (in 10^(-3) ms) of each component. Execution paths are specified in `execution_paths.txt` in each directory.
keywords: Microservices; Tracing; Performance
published: 2020-10-16
 
Video footage of an Eastern Box Turtle (Terrapene carolina carolina) partially predating a Field Sparrow nest (Spizella pusilla) at 0845 h on the 31 of May 2020. Please note that the date on the video footage is incorrect due to user error, but the time is correct.
keywords: nest predation; turtle; songbird; nest camera; Terrapene carolina carolina; Spizella pusilla;
published: 2020-12-30
 
High-speed X-ray videos of four E. abruptus specimens recorded at the Advanced Photron Source (Argonne National lab) in the Summer of 2018 and corresponding position data of landmarks tracked during the motion. See readme file for more details.
published: 2020-12-31
 
This dataset contains the amino acid and nucleotide alignments corresponding to the phylogenetic analyses of South et al. 2020 in Systematic Entomology. This dataset also includes the gene trees that were used as input for coalescent analysis in ASTRAL.
keywords: Plecoptera; stoneflies; phylogeny; insects