Displaying 376 - 400 of 637 in total

Subject Area

Life Sciences (333)
Social Sciences (135)
Physical Sciences (92)
Technology and Engineering (62)
Uncategorized (14)
Arts and Humanities (1)

Funder

Other (193)
U.S. National Science Foundation (NSF) (189)
U.S. Department of Energy (DOE) (64)
U.S. National Institutes of Health (NIH) (60)
U.S. Department of Agriculture (USDA) (41)
Illinois Department of Natural Resources (IDNR) (17)
U.S. Geological Survey (USGS) (6)
U.S. National Aeronautics and Space Administration (NASA) (5)
Illinois Department of Transportation (IDOT) (4)
U.S. Army (2)

Publication Year

2021 (108)
2022 (108)
2020 (96)
2023 (78)
2019 (72)
2018 (62)
2024 (40)
2017 (36)
2016 (30)
2025 (2)
2009 (1)
2011 (1)
2012 (1)
2014 (1)
2015 (1)

License

CC0 (355)
CC BY (262)
custom (20)

Datasets

published: 2021-04-29
 
Global assessments of climate extremes typically do not account for the unique characteristics of individual crops. A consistent definition of the exposure of specific crops to extreme weather would enable agriculturally-relevant hazard quantification. We introduce the Agriculturally-Relevant Exposure to Shocks (ARES) model, a novel database of both the temperature and moisture extremes facing individual crops by explicitly accounting for crop characteristics. Specifically, we estimate crop-specific temperature and moisture shocks during the growing season for a 0.25-degree spatial grid and daily time scale from 1961-2014 globally for 17 crops. The resulting database presented here provides annual crop- and event-specific exposure rates. Both gridded and country-level exposure rates are provided for each of the 17 crops. Our results provide new insights into the changes in the magnitude as well as spatial and temporal distribution of extreme events that impact crops over the past half-century. For additional information, please see the related paper by Jackson et al. (2021) in Environmental Research Letters.
keywords: Crop-specific; weather extremes; temperature; moisture; global; gridded; time series
published: 2021-04-28
 
An Atlas.ti dataset and accompanying documentation of a thematic analysis of problems and opportunities associated with retracted research and its continued citation.
keywords: Retraction; Citation; Problems and Opportunities
published: 2021-04-22
 
All code in Matlab .m scripts or functions (version R2019b) Affiliated with article “Temperate and chronic virus competition leads to low lysogen frequency” published in the Journal of Theoretical Biology (2021) Codes simulate and plot the solutions of an Ordinary Differential Equations model and generate bifurcation diagrams.
published: 2021-04-22
 
Author-ity 2018 dataset Prepared by Vetle Torvik Apr. 22, 2021 The dataset is based on a snapshot of PubMed taken in December 2018 (NLMs baseline 2018 plus updates throughout 2018). A total of 29.1 million Article records and 114.2 million author name instances. Each instance of an author name is uniquely represented by the PMID and the position on the paper (e.g., 10786286_3 is the third author name on PMID 10786286). Thus, each cluster is represented by a collection of author name instances. The instances were first grouped into "blocks" by last name and first name initial (including some close variants), and then each block was separately subjected to clustering. The resulting clusters are provided in two different formats, the first in a file with only IDs and PMIDs, and the second in a file with cluster summaries: #################### File 1: au2id2018.tsv #################### Each line corresponds to an author name instance (PMID and Author name position) with an Author ID. It has the following tab-delimited fields: 1. Author ID 2. PMID 3. Author name position ######################## File 2: authority2018.tsv ######################### Each line corresponds to a predicted author-individual represented by cluster of author name instances and a summary of all the corresponding papers and author name variants. Each cluster has a unique Author ID (the PMID of the earliest paper in the cluster and the author name position). The summary has the following tab-delimited fields: 1. Author ID (or cluster ID) e.g., 3797874_1 represents a cluster where 3797874_1 is the earliest author name instance. 2. cluster size (number of author name instances on papers) 3. name variants separated by '|' with counts in parenthesis. Each variant of the format lastname_firstname middleinitial, suffix 4. last name variants separated by '|' 5. first name variants separated by '|' 6. middle initial variants separated by '|' ('-' if none) 7. suffix variants separated by '|' ('-' if none) 8. email addresses separated by '|' ('-' if none) 9. ORCIDs separated by '|' ('-' if none). From 2019 ORCID Public Data File https://orcid.org/ and from PubMed XML 10. range of years (e.g., 1997-2009) 11. Top 20 most frequent affiliation words (after stoplisting and tokenizing; some phrases are also made) with counts in parenthesis; separated by '|'; ('-' if none) 12. Top 20 most frequent MeSH (after stoplisting) with counts in parenthesis; separated by '|'; ('-' if none) 13. Journal names with counts in parenthesis (separated by '|'), 14. Top 20 most frequent title words (after stoplisting and tokenizing) with counts in parenthesis; separated by '|'; ('-' if none) 15. Co-author names (lowercased lastname and first/middle initials) with counts in parenthesis; separated by '|'; ('-' if none) 16. Author name instances (PMID_auno separated by '|') 17. Grant IDs (after normalization; '-' if none given; separated by '|'), 18. Total number of times cited. (Citations are based on references harvested from open sources such as PMC). 19. h-index 20. Citation counts (e.g., for h-index): PMIDs by the author that have been cited (with total citation counts in parenthesis); separated by '|'
keywords: author name disambiguation; PubMed
published: 2021-04-19
 
Dataset compiled by Yushu Xia and Michelle Wander for the Soil Health Institute. Data were recovered from peer reviewed literature reporting results for three soil quality indicators (SQIs) (β-glucosidase (BG), fluorescein diacetate (FDA) hydrolysis, and permanganate oxidizable carbon (POXC)) in terms of their relative response to management where soils under grassland cover, no-tillage, cover crops, residue return and organic amendments were compared to conventionally managed controls. Peer-reviewed articles published between January of 1990 and May 2018 were searched using the Thomas Reuters Web of Science database (Thomas Reuters, Philadelphia, Pennsylvania) and Google Scholar to identify studies reporting results for: “β-glucosidase”, “permanganate oxidizable carbon”, “active carbon”, “readily oxidizable carbon”, and “fluorescein diacetate hydrolysis”, together with one or more of the following: “management practice”, “tillage”, “cover crop”, “residue”, “organic fertilizer”, or “manure”. Records were tabulated to compare SQI abundance in soil maintained under a control and soil aggrading practice with the intent to contribute to SQI databases that will support development of interpretive frameworks and/or algorithms including pedo-transfer functions relating indicator abundance to management practices and site specific factors. Meta-data include the following key descriptor variables and covariates useful for development of scoring functions: 1) identifying factors for the study site (location, year of initiation of study and year in which data was reported), 2) soil textural class, pH, and SOC, 3) depth and timing of soil sampling, 4) analytical methods for SQI quantification, 5) units used in published works (i.e. equivalent mass, concentration), 6) SQI abundances, and 7) statistical significance of difference comparisons. *Note: Blank values in tables are considered unreported data.
keywords: Soil health promoting practices; Soil quality indicators; β-glucosidase; fluorescein diacetate hydrolysis; Permanganate oxidizable carbon; Greenhouse gas emissions; Scoring curves; Soil Management Assessment Framework
published: 2021-04-18
 
This dataset contains all the code, notebooks, datasets used in the study conducted for the research publication titled "Multi-scale CyberGIS Analytics for Detecting Spatiotemporal Patterns of COVID-19 Data". Specifically, this package include the artifacts used to conduct spatial-temporal analysis with space time kernel density estimation (STKDE) using COVID-19 data, which should help readers to reproduce some of the analysis and learn about the methods that were conducted in the associated book chapter. ## What’s inside - A quick explanation of the components of the zip file * Multi-scale CyberGIS Analytics for Detecting Spatiotemporal Patterns of COVID-19.ipynb is a jupyter notebook for this project. It contains codes for preprocessing, space time kernel density estimation, postprocessing, and visualization. * data is a folder containing all data needed for the notebook * data/county.txt: US counties information and fip code from Natural Resources Conservation Service. * data/us-counties.txt: County-level COVID-19 data collected from New York Times COVID-19 github repository on August 9th, 2020. * data/covid_death.txt: COVID-19 death information derived after preprocessing step, preparing the input data for STKDE. Each record is if the following format (fips, spatial_x, spatial_y, date, number of death ). * data/stkdefinal.txt: result obtained by conducting STKDE. * wolfram_mathmatica is a folder for 3D visulization code. * wolfram_mathmatica/Visualization.nb: code for visulization of STKDE result via weolfram mathmatica. * img is a folder for figures. * img/above.png: result of 3-D visulization result, above view. * img/side.png: result of 3-D visulization, side view.
keywords: CyberGIS; COVID-19; Space-time kernel density estimation; Spatiotemporal patterns
published: 2021-04-15
 
To generate the bibliographic and survey data to support a data reuse study conducted by several Library faculty and accepted for publication in the Journal of Academic Librarianship, the project team utilized a series of web-based online scripts that employed several different endpoints from the Scopus API. The related dataset: "Data for: An Examination of Data Reuse Practices within Highly Cited Articles of Faculty at a Research University" contains survey design and results. <br /> 1) <b>getScopus_API_process_dmp_IDB.asp</b>: used the search API query the Scopus database API for papers by UIUC authors published in 2015 -- limited to one of 9 pre-defined Scopus subject areas -- and retrieve metadata results sorted highest to lowest by the number of times the retrieved articles were cited. The URL for the basic searches took the following form: https://api.elsevier.com/content/search/scopus?query=(AFFIL%28(urbana%20OR%20champaign) AND univ*%29) OR (AF-ID(60000745) OR AF-ID(60005290))&apikey=xxxxxx&start=" & nstart & "&count=25&date=2015&view=COMPLETE&sort=citedby-count&subj=PHYS<br /> Here, the variable nstart was incremented by 25 each iteration and 25 records were retrieved in each pass. The subject area was renamed (e.g. from PHYS to COMP for computer science) in each of the 9 runs. This script does not use the Scopus API cursor but downloads 25 records at a time for up to 28 times -- or 675 maximum bibliographic records. The project team felt that looking at the most 675 cited articles from UIUC faculty in each of the 9 subject areas was sufficient to gather a robust, representative sample of articles from 2015. These downloaded records were stored in a temporary table that was renamed for each of the 9 subject areas. <br /> 2) <b>get_citing_from_surveys_IDB.asp</b>: takes a Scopus article ID (eid) from the 49 UIUC author returned surveys and retrieves short citing article references, 200 at a time, into a temporary composite table. These citing records contain only one author, no author affiliations, and no author email addresses. This script uses the Scopus API cursor=* feature and is able to download all the citing references of an article 200 records at a time. <br /> 3) <b>put_in_all_authors_affil_IDB.asp</b>: adds important data to the short citing records. The script adds all co-authors and their affiliations, the corresponding author, and author email addresses. <br /> 4) <b>process_for_final_IDB.asp</b>: creates a relational database table with author, title, and source journal information for each of the citing articles that can be copied as an Excel file for processing by the Qualtrics survey software. This was initially 4,626 citing articles over the 49 UIUC authored articles, but was reduced to 2,041 entries after checking for available email addresses and eliminating duplicates.
keywords: Scopus API; Citing Records; Most Cited Articles
published: 2021-04-12
 
Conjugate photoelectron energy spectra derived from coincident FUV and radio measurements. These are outputs of simulations from the semi-empirical SAMI2-PE (Varney et al. 2012) for the night of January 4, 2020.
keywords: Conjugate photoelectrons, SAMI2-PE, ICON
published: 2021-04-11
 
This dataset contains RNASim1000, Cox1-Het datasets as well as analyses of RNASim1000, Cox1-Het, and 1000M1(HF).
keywords: phylogeny estimation; maximum likelihood; RAxML; IQ-TREE; FastTree; cox1; heterotachy; disjoint tree mergers; Tree of Life
published: 2021-04-06
 
These datasets contain modeling files and GIS data associated with a risk assessment study for the Cambrian-Ordovician sandstone aquifer system in Illinois from predevelopment (1863) to the year 2070. Modeling work was completed using the Illinois Groundwater Flow Model, a regional MODFLOW model developed for water supply planning in Illinois, as a base model. The model is run using the graphical user interface Groundwater Vistas 7.0. The development and technical details of the base Illinois Groundwater Flow Model, including hydraulic property zonation, boundary conditions, hydrostratigraphy, solver settings, and discretization, are described in Abrams et al. (2018). Modifications to this base model (the version presented here) are described in Mannix et al. (2018), Hadley et al. (2020) and Abrams and Cullen (2020). Modifications include removal of particular multi-aquifer wells to improve calibration, changing Sandwich Fault Zone properties to achieve calibration at production wells within and near the fault zone, and the incorporation of demand scenarios based on a participatory modeling project with the Southwest Water Planning Group. The zipped folder of model files contains MODFLOW input (package) files, Groundwater Vistas files, and a head file for the entire model run. The zipped folder of GIS data contains rasters of: simulated drawdown in the St. Peter sandstone from predevelopment to 2018, simulated drawdown in the Ironton-Galesville sandstone from predevelopment to 2018, simulated head difference between the St. Peter and Ironton-Galesville sandstone units in 2018, simulated head above the top of the St. Peter sandstone for the years 2029, 2050, and 2070, and simulated head above the top of the Ironton-Galesville sandstone for the years 2029, 2050, and 2070. Raster outputs were derived directly from the simulated heads in the Illinois Groundwater Flow Model. Rasters are clipped to the 8 county northeastern Illinois region (Cook, DuPage, Grundy, Kane, Kendall, Lake, McHenry, and Will counties). Well names, historic and current head targets, and spatial offsets for the Illinois Groundwater Flow Model are available upon request via a data license agreement. Please contact authors to set this up if needed.
keywords: groundwater; aquifer; sandstone aquifer; risk assessment; depletion; Illinois; MODFLOW; modeling
published: 2021-04-05
 
West Nile virus data, aggregated by 55 1-km hexagons, within the NWMAD jurisdiction Cook County, IL. The data incorporates deidentified human illness, mosquito infection and abundance, socio-economic data, and other abiotic and biotic predictors by epi-weeks 18-38 for the years 2005-2016.
keywords: WNV; modeling
published: 2021-03-31
 
This archive contains the datasets used in the paper "Recursive MAGUS: scalable and accurate multiple sequence alignment". - 16S.3, 16S.T, 16S.B.ALL - HomFam - RNASim These can also be found at https://sites.google.com/eng.ucsd.edu/datasets/alignment/pastaupp
published: 2020-12-14
 
Femoral skeletal traits (cross-sectional properties, maximum distal metaphyseal breadth of the femur, and maximum superior/inferior femoral head diameter) of 219 Taiwanese subadult individuals (aged 0 to 17) as used in the manuscript "Allometric scaling and growth: evaluation and applications in subadult body mass estimation."
keywords: femur; cross-sectional geometry; osteometrics; subadult
published: 2020-12-16
 
Terrorism is among the most pressing challenges to democratic governance around the world. The Responsible Terrorism Coverage (or ResTeCo) project aims to address a fundamental dilemma facing 21st century societies: how to give citizens the information they need without giving terrorists the kind of attention they want. The ResTeCo hopes to inform best practices by using extreme-scale text analytic methods to extract information from more than 70 years of terrorism-related media coverage from around the world and across 5 languages. Our goal is to expand the available data on media responses to terrorism and enable the development of empirically-validated models for socially responsible, effective news organizations. This particular dataset contains information extracted from terrorism-related stories in the Summary of World Broadcasts published between 1979 and 2019. It includes variables that measure the relative share of terrorism-related topics, the valence and intensity of emotional language, as well as the people, places, and organizations mentioned. This dataset contains 3 files: 1. "ResTeCo Project SWB Dataset Variable Descriptions.pdf" A detailed codebook containing a summary of the Responsible Terrorism Coverage (ResTeCo) Project BBC Summary of World Broadcasts (SWB) Dataset and descriptions of all variables. 2. "resteco-swb.csv" This file contains the data extracted from terrorism-related media coverage in the BBC Summary of World Broadcasts (SWB) between 1979 and 2019. It includes variables that measure the relative share of topics, sentiment, and emotion present in this coverage. There are also variables that contain metadata and list the people, places, and organizations mentioned in these articles. There are 53 variables and 438,373 observations. The variable "id" uniquely identifies each observation. Each observation represents a single news article. Please note that care should be taken when using "resteco-swb.csv". The file may not be suitable to use in a spreadsheet program like Excel as some of the values get to be quite large. Excel cannot handle some of these large values, which may cause the data to appear corrupted within the software. It is encouraged that a user of this data use a statistical package such as Stata, R, or Python to ensure the structure and quality of the data remains preserved. 3. "README.md" This file contains useful information for the user about the dataset. It is a text file written in markdown language Citation Guidelines 1) To cite this codebook please use the following citation: Althaus, Scott, Joseph Bajjalieh, Marc Jungblut, Dan Shalmon, Subhankar Ghosh, and Pradnyesh Joshi. 2020. Responsible Terrorism Coverage (ResTeCo) Project BBC Summary of World Broadcasts (SWB) Dataset Variable Descriptions. Responsible Terrorism Coverage (ResTeCo) Project BBC Summary of World Broadcasts (SWB) Dataset. Cline Center for Advanced Social Research. December 16. University of Illinois Urbana-Champaign. doi: https://doi.org/10.13012/B2IDB-2128492_V1 2) To cite the data please use the following citation: Althaus, Scott, Joseph Bajjalieh, Marc Jungblut, Dan Shalmon, Subhankar Ghosh, and Pradnyesh Joshi. 2020. Responsible Terrorism Coverage (ResTeCo) Project Summary of World Broadcasts (SWB) Dataset. Cline Center for Advanced Social Research. December 16. University of Illinois Urbana-Champaign. doi: https://doi.org/10.13012/B2IDB-2128492_V1
keywords: Terrorism, Text Analytics, News Coverage, Topic Modeling, Sentiment Analysis
published: 2020-12-29
 
Three datasets: species_abundance_data, species_traits, and environmental_data. The three datasets were collected in the Fortuna Forest Reserve (8°45′ N, 82°15′ W) and Palo Seco Protected Forest (8°45′ N, 82°13′ W) located in western Panama. The two reserves support humid to super-humid rainforests, according to Holdridge (1947). The species_abundance_data and species_traits datasets were collected across 15 subplots of 25 m2 in 12 one-hectare permanent plots distributed across the two reserves. The subplots were spaced 20 m apart along three 5 m wide transects, each 30 m apart. Please read Prada et al. (2017) for details on the environmental characteristics of the study area. Prada CM, Morris A, Andersen KM, et al (2017) Soils and rainfall drive landscape-scale changes in the diversity and functional composition of tree communities in a premontane tropical forest. J Veg Sci 28:859–870. https://doi.org/10.1111/jvs.12540
keywords: functional traits; plants; ferns; environmental data; Fortuna; species data; community ecology
published: 2021-02-24
 
This dataset contains model output from the Community Earth System Model, Version 2 (CESM2; Danabasoglu et al. 2020). These data were used for analysis in Impacts of Large-Scale Soil Moisture Anomalies in Southeastern South America, published in the Journal of Hydrometeorology (DOI: 10.1175/JHM-D-20-0116.1). See this publication for details of the model simulations that created these data. Four NetCDF (.nc) files are included in this dataset. Two files correspond to the control simulation (FHIST_SP_control) and two files correspond to a simulation with a dry soil moisture anomaly imposed in southeastern South America (FHIST_SP_dry; see the publication mentioned in the preceding paragraph for details on the spatial extent of the imposed anomaly). For each simulation, one file corresponds to output from the atmospheric model (file names with "cam") of CESM2 and the other to the land model (file names with "clm2"). These files are raw CESM output concatenated into a single file for each simulation. All files include data from 1979-01-02 to 2003-12-31 at a daily resolution. The spatial resolution of all files is about 1 degree longitude x 1 degree latitude. Variables included in these files are listed or linked below. Variables in atmosphere model output: Vertical velocity (omega) Convective precipitation Large-scale precipitation Surface pressure Specific humidity Temperature (atmospheric profile) Reference temperature (temp. at reference height, 2 meters in this case) Zonal wind Meridional wind Geopotential height Variables in land model output: See https://www.cesm.ucar.edu/models/cesm1.2/clm/models/lnd/clm/doc/UsersGuide/history_fields_table_40.xhtml Note that not all of the variables listed at the above link are included in the land model output files in this dataset. This material is based upon work supported by the National Science Foundation under Grant No. 1454089. We acknowledge high-performance computing support from Cheyenne (doi:10.5065/D6RX99HX) provided by NCAR's Computational and Information Systems Laboratory, sponsored by the National Science Foundation. The CESM project is supported primarily by the National Science Foundation. We thank all the scientists, software engineers, and administrators who contributed to the development of CESM2. References Danabasoglu, G., and Coauthors, 2020: The Community Earth System Model Version 2 (CESM2). Journal of Advances in Modeling Earth Systems, 12, e2019MS001916, https://doi.org/10.1029/2019MS001916.
keywords: Climate modeling; atmospheric science; hydrometeorology; hydroclimatology; soil moisture; land-atmosphere interactions
published: 2020-10-20
 
This dataset includes a total of 501 images of 42 fossil specimens of Striatopollis and 459 specimens of 45 extant species of the tribe Amherstieae-Fabaceae. These images were taken using Airyscan confocal superresolution microscopy at 630X magnification (63x/NA 1.4 oil DIC). The images are in the CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or in ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/microscope-software/zen/czi.html#microscope---image-data].
keywords: Striatopollis catatumbus; superresolution microscopy; Cenozoic; tropics; Zeiss; CZI; striate pollen.
published: 2021-03-23
 
DNN weights used in the evaluation of the ApproxTuner system. Link to paper: https://dl.acm.org/doi/10.1145/3437801.3446108
published: 2021-03-17
 
This dataset was developed as part of a study that assessed data reuse. Through bibliometric analysis, corresponding authors of highly cited papers published in 2015 at the University of Illinois at Urbana-Champaign in nine STEM disciplines were identified and then surveyed to determine if data were generated for their article and their knowledge of reuse by other researchers. Second, the corresponding authors who cited those 2015 articles were identified and surveyed to ascertain whether they reused data from the original article and how that data was obtained. The project goal was to better understand data reuse in practice and to explore if research data from an initial publication was reused in subsequent publications.
keywords: data reuse; data sharing; data management; data services; Scopus API
published: 2021-03-15
 
Dataset associated with "Hiding in plain sight: genetic confirmation of putative Louisiana Fatmucket Lampsilis hydiana in Illinois" as submitted to Freshwater Mollusk Biology and Conservation by Stodola et al. Images are from cataloged specimens from the Illinois Natural History Survey (INHS) Mollusk Collection in Champaign, Illinois that were used for genetic research. File names indicate the species as confirmed in Stodola et al. (i.e., Lampsilis siliquoidea or Lampsilis hydiana) followed by the INHS Mollusk Collection catalog number, followed by the individual specimen number, followed by shell view (interior or exterior). If no specimen number is noted in the file name, there is only one specimen for that catalog number. For example: Lsiliquoidea_46515_1_2_3_exterior. Images were created by photographing specimens on a metric grid in an OrTech Photo-e-Box Plus with a Nikon D610 single lens reflex camera using a 60mm lens. Post-processing of images (cropping, image rotation, and auto contrast) occurred in Adobe Photoshop and saved as TIFF files using no image compression, interleaved pixel order, and IBM PC Byte Order. One additional partial lot, INHS Mollusk Catalog No. 37059 (shown with both interior and exterior view in one image), is included for reference but was not genetically sequenced. A .csv file contains an index of all specimens photographed. SPECIES: species confirmed using genetic analyses GENE: cox1 or nad1 mitochondrial gene ACCESSION: GenBank accession number INHS CATALOG NO: Illinois Natural History Survey Mollusk Collection Catalog number WATERBODY: waterbody where specimen was collected PUTATIVE SPECIES: species determination based on morphological characters prior to genetic analysis Phylogenetic sequence data (.nex files) were aligned using BioEdit (Hall, T.A. 1999. BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symposium Series 41:95-98.). Pertinent methodology for the analysis are contained within the manuscript submittal for Stodola et al. to Freshwater Mollusk Biology and Conservation. In these files, "N" is a standard symbol for an unknown base.
keywords: Lampsilis hydiana; Lampsilis siliquoidea; unionid; Louisiana Fatmucket; Fatmucket; genetic confirmation
published: 2021-03-14
 
This dataset contains all the code, notebooks, datasets used in the study conducted to measure the spatial accessibility of COVID-19 healthcare resources with a particular focus on Illinois, USA. Specifically, the dataset measures spatial access for people to hospitals and ICU beds in Illinois. The spatial accessibility is measured by the use of an enhanced two-step floating catchment area (E2FCA) method (Luo & Qi, 2009), which is an outcome of interactions between demands (i.e, # of potential patients; people) and supply (i.e., # of beds or physicians). The result is a map of spatial accessibility to hospital beds. It identifies which regions need more healthcare resources, such as the number of ICU beds and ventilators. This notebook serves as a guideline of which areas need more beds in the fight against COVID-19. ## What's Inside A quick explanation of the components of the zip file * `COVID-19Acc.ipynb` is a notebook for calculating spatial accessibility and `COVID-19Acc.html` is an export of the notebook as HTML. * `Data` contains all of the data necessary for calculations: &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `Chicago_Network.graphml`/`Illinois_Network.graphml` are GraphML files of the OSMNX street networks for Chicago and Illinois respectively. &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `GridFile/` has hexagonal gridfiles for Chicago and Illinois &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `HospitalData/` has shapefiles for the hospitals in Chicago and Illinois &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `IL_zip_covid19/COVIDZip.json` has JSON file which contains COVID cases by zip code from IDPH &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `PopData/` contains population data for Chicago and Illinois by census tract and zip code. &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `Result/` is where we write out the results of the spatial accessibility measures &nbsp;&nbsp;&nbsp;&nbsp;&nbsp; * `SVI/`contains data about the Social Vulnerability Index (SVI) * `img/` contains some images and HTML maps of the hospitals (the notebook generates the maps) * `README.md` is the document you're currently reading! * `requirements.txt` is a list of Python packages necessary to use the notebook (besides Jupyter/IPython). You can install the packages with `python3 -m pip install -r requirements.txt`
keywords: COVID-19; spatial accessibility; CyberGISX
published: 2021-03-10
 
The PhytoplasmasRef_Trivellone_etal.fas fasta file contains the original final sequence alignment used in the phylogenetic analyses of Trivellone et al. (Ecology and Evolution, in review). The 27 sequences (21 phytoplasma reference strains and 6 phytoplasmas strains from the present study) were aligned using the Muscle algorithm as implemented in MEGA 7.0 with default settings. The final dataset contains 952 positions of the F2n/R2 fragment of the 16S rRNA gene. The data analyses are further described in the cited original paper.
keywords: Hemiptera; Cicadellidae; Mollicutes; Phytoplasma; biorepository
published: 2021-01-27
 
*This is the third version of the dataset*. New changes in this 3rd version: <i>1.replaces simulations where the initial condition consists of a sinusoidal channel with topographic perturbations with simulations where the initial condition consists of a sinusoidal channel without topographic perturbations. These simulations better illustrate the transformation of a nondendritic network into a dendritic one. 2. contains two additional simulations showing how total domain size affects the landscape's dynamism. 3. changes dataset title to reflect the publication's title</i> This dataset contains data from 18 simulations using a landscape evolution model. A landscape evolution model simulates how uplift and rock incision shape the Earth's (or other planets) surface. To date, most landscape evolution models exhibit "extreme memory" (paper: https://doi.org/10.1029/2019GL083305 and dataset: https://doi.org/10.13012/B2IDB-4484338_V1). Extreme memory in landscape evolution models causes initial conditions to be unrealistically preserved. This dataset contains simulations from a new landscape evolution model that incorporates a sub-model that allows bedrock channels to erode laterally. With this addition, the landscapes no longer exhibit extreme memory. Initial conditions are erased over time, and the landscapes tend towards a dynamic steady state instead of a static one. The model with lateral erosion is named LEM-wLE (Landscape Evolution Model with Lateral Erosion) and the model without lateral erosion is named LEM-woLE (Landscape Evolution Model without Lateral Erosion). There are 16 folders in total. Here are the descriptions: <i>>LEM-woLE_simulations:</i> This folder contains simulations using LEM-woLE. Inside the folder are 5 subfolders containing 100 elevation rasters, 100 drainage area rasters, and 100 plots showing the slope-area relationship. Elevation depicts the height of the landscape, and drainage area represents a contributing area that is upslope. Each folder corresponds to a different initial condition. Driver files and code for these simulations can be found at https://github.com/jeffskwang/LEM-wLE. <i>>MOVIE_S#_data:</i> There are 13 data folders that contain raster data for 13 simulations using LEM-wLE. Inside each folder are 1000 elevation rasters, 1000 drainage area rasters, and 1000 plots showing the slope-area relationship. Driver files and code for these simulations can be found at https://github.com/jeffskwang/LEM-wLE. <i>>movies_mp4_format:</i> For each data folder there are 3 movies generated that show elevation (a), drainage area (b), and erosion rates (c). These files are formatted in the mp4 format and are best viewed using VLC media player (https://www.videolan.org/vlc/index.html). <i>>movies_wmv_format:</i> This folder contains the same movies as the "movies_mp4_format" folder, but they are in a wmv format. These movies can be viewed using Windows media player or other Windows platform movie software. Here are the captions for the 13 movies: Movie S1. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S2. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Inclined with small, randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S3. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Inclined with large, randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S4. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: V-shaped valley with randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S5. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel with randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S6. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 0.25. Movie S7. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 0.5. Movie S8. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Sinusoidal channel without randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 0.75. Movie S9. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 1 open boundary at the bottom of the domain, and 3 closed boundaries elsewhere. KL/KV = 1. Movie S10. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 2 open boundaries at the top and bottom of the domain, and 2 closed boundaries on the left and right sides. KL/KV = 1. Movie S11. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 4 open boundaries. KL/KV = 1. Movie S12. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 4 open boundaries. KL/KV = 1. Compared to Movie S11, the length of the domain is 50% shorter, decreasing the total domain area. Movie S13. 200 MYR (1,000 RUs eroded) simulation showing elevation (a), logarithm of drainage area (b), and change in elevation (c). Initial Condition: Flat with randomized perturbations. Boundary Condition: 4 open boundaries. KL/KV = 1. Compared to Movie S11, the length of the domain is 50% longer, increasing the total domain area. The associated publication for this dataset has not yet been published, and we will update this description with a link when it is.
keywords: landscape evolution; drainage networks; lateral migration; geomorphology
published: 2021-03-08
 
These are abundance dynamics data and simulations for the paper "Higher-order interaction between species inhibits bacterial invasion of a phototroph-predator microbial community". In this V2, data were converted in Python, in addition to MATLAB and more information on how to work with the data was included in the Readme.
keywords: Microbial community; Higher order interaction; Invasion; Algae; Bacteria; Ciliate