Displaying datasets 1 - 25 of 364 in total

Subject Area

Life Sciences (190)
Social Sciences (82)
Physical Sciences (46)
Technology and Engineering (30)
Uncategorized (15)
Arts and Humanities (1)

Funder

Other (95)
U.S. National Science Foundation (NSF) (94)
U.S. National Institutes of Health (NIH) (39)
U.S. Department of Energy (DOE) (33)
U.S. Department of Agriculture (USDA) (19)
Illinois Department of Natural Resources (IDNR) (9)
U.S. National Aeronautics and Space Administration (NASA) (4)
U.S. Geological Survey (USGS) (3)
U.S. Army (1)

Publication Year

2020 (101)
2019 (73)
2021 (65)
2018 (59)
2017 (35)
2016 (30)
2022 (1)

License

CC0 (208)
CC BY (150)
custom (6)
published: 2020-12-16
 
The Cline Center Global News Index is a searchable database of textual features extracted from millions of news stories, specifically designed to provide comprehensive coverage of events around the world. In addition to searching documents for keywords, users can query metadata and features such as named entities extracted using Natural Language Processing (NLP) methods and variables that measure sentiment and emotional valence. Archer is a web application purpose-built by the Cline Center to enable researchers to access data from the Global News Index. Archer provides a user-friendly interface for querying the Global News Index (with the back-end indexing still handled by Solr). By default, queries are built using icons and drop-down menus. More technically-savvy users can use Lucene/Solr query syntax via a ‘raw query’ option. Archer allows users to save and iterate on their queries, and to visualize faceted query results, which can be helpful for users as they refine their queries. <b>Additional Resources:</b> - Access to Archer and the Global News Index is limited to account-holders. If you are interested in signing up for an account, you can fill out the <a href="https://docs.google.com/forms/d/1Vx_PpkIV4U1mt2FrPlfmdTC19VSidWQ8OC3D0lLNnvs/edit"><b>Archer User Information Form</b></a>. - Current users who would like to provide feedback, such as reporting a bug or requesting a feature, can fill out the <a href="https://forms.gle/6eA2yJUGFMtj5swY7"><b>Archer User Feedback Form</b></a>. - The Cline Center sends out periodic email newsletters to the Archer Users Group. Please fill out this form to <a href="https://groups.webservices.illinois.edu/subscribe/123172"><b>subscribe to Archer Users Group</b></a>. <b>Citation Guidelines:</b> 1) To cite the GNI codebook (or any other documentation associated with the Global News Index and Archer) please use the following citation: Cline Center for Advanced Social Research. 2020. Global News Index and Extracted Features Repository [codebook], v1.0.1. Champaign, IL: University of Illinois. Dec. 16. doi:10.13012/B2IDB-5649852_V2 2) To cite data from the Global News Index (accessed via Archer or otherwise) please use the following citation (filling in the correct date of access): Cline Center for Advanced Social Research. 2020. Global News Index and Extracted Features Repository [database], v1.0.1. Champaign, IL: University of Illinois. Dec. 16. Accessed Month, DD, YYYY. doi:10.13012/B2IDB-5649852_V2
keywords: Cline Center; Cline Center for Advanced Social Research; political; social; political science; Global News Index; Archer; news; mass communication; journalism;
published: 2021-06-14
 
Chronic contact exposure to realistic soil concentrations (0, 7.5, 15, and 100 ppb) of the neonicotinoid pesticide imidacloprid had species- and sex-specific effects on adult bee movement characteristics, but not on adult female bee brain development. This dataset contains two data files. The first contains information about adult bee movement characteristics for female Osmia lignaria and female and male Megachile rotundata over a 10-minute trial (total distance traveled and average movement speed). The second contains information about female Osmia lignaria and Megachile rotundata adult brain morphology. Detected effects included: female Osmia lignaria adults moved faster as they aged in the 0 and 7.5 ppb, but not in the 15 or 100 ppb, groups; young male Megachile rotundata adults moved more quickly (7.5 and 100 ppb) and farther (100 ppb) when treated with imidacloprid compared to the control group (0 ppb); and, while there was no impact of imidacloprid on adult female neuropil:Kenyon cell volume (N:K), N:K decreased with Osmia ligaria adult age and increased with Megachile rotundata adult age.
keywords: neonicotinoid; imidacloprid; bee; movement
published: 2021-06-14
 
This repository contains the weights for two StyleGAN2 networks trained on two composite T1 and T2 weighted open-source brain MR image datasets, and one StyleGAN2 network trained on the Flickr Face HQ image dataset. Example images sampled from the respective StyleGANs are also included. The datasets themselves are not included in this repository. Additional details and citations can be found in the file "README.md".
keywords: StyleGAN2; Generative adversarial network (GAN); MRI; Medical imaging
published: 2021-02-28
 
This dataset contains the RegCM4 simulations used in the article " Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM". This dataset was used to investigate the impact of a new aging parameterisation scheme implemented in a regional climate model RegCM4. The dataset contains two sets of simulations: Expt_fix and Expt_dyn. It consists of the seasonal mean and daily mean values of the variables that were used to create the visualizations of this study. The Expt_fix and Expt_dyn dataset contain 34 and 38 NetCDF files, respectively. The CERES_vs_2expts_new.mat file is the comparison between CERES shortwave downward flux at the surface and same model outputs from two experiments for clear sky and all sky conditions. -------------------------------------------------- The following information about the dataset was generated on 2021-01-08 by SUDIPTA GHOSH <b>GENERAL INFORMATION</b> <i>1. Date of data collection (single date, range, approximate date):</i> 2019-01-01 to 2019-12-31 <i>2. Geographic location of data collection:</i> Urbana-Champaign,Illinois, USA <i>3. Information about funding sources that supported the collection of the data:</i> This work is supported by the MoEFCC under the NCAP-COALESCE project [Grant No. 14/10/2014-CC]. The first author acknowledges DST-INSPIRE fellowship [IF150055] and Fulbright-Kalam Climate Doctoral fellowship. N. R. acknowledges funding from NSF AGS-1254428 and DOE grant DE-SC0019192. Department of Science and Technology, Funds for Improvement of Science and Technology infrastructure in universities and higher educational institutions (DST-FIST) grant (SR/FST/ESII-016/2014) are acknowledged for the computing support. <b>DATA & FILE OVERVIEW</b> <i>1. File List:</i> Expt_fix and Expt_dyn datasets contain the analysed seasonal means and daily means of the variables that have been used to create the visualizations of this study. Each of the Expt_fix and Expt_dyn datasets contains 34 and 38 NetCDF files, respectively. <i>2. Relationship between files, if important:</i> NA <i>3. Additional related data collected that was not included in the current data package:</i> No <b>METHODOLOGICAL INFORMATION</b> <i>1. Description of methods used for collection/generation of data: </i> The model RegCM4 code is freely available online from <a href="http://gforge.ictp.it/gf/project/regcm/">http://gforge.ictp.it/gf/project/regcm/</a>. The anthropogenic aerosol emissions considered for the simulations are taken from IIASA inventory. The data used can be easily accessed online <a href="http://clima-dods.ictp.it/regcm4/">http://clima-dods.ictp.it/regcm4/</a> website. TRMM observed precipitation data can be assessed from <a href="https://giovanni.gsfc.nasa.gov/giovanni/">https://giovanni.gsfc.nasa.gov/giovanni/</a> website. CRU temperature data is available at <a href="https://crudata.uea.ac.uk/cru/data/hrg/">https://crudata.uea.ac.uk/cru/data/hrg/</a>. CERES satellite surface shortwave downward fluxes are available at <a href="https://ceres.larc.nasa.gov/data/">https://ceres.larc.nasa.gov/data/</a> website. Input files for the RegCM4 model are archived in <a href="http://clima-dods.ictp.it/regcm4/">http://clima-dods.ictp.it/regcm4/</a> website. This dataset contains the RegCM4 simulations used in the article " Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM ". Two sets of simulations: Expt_fix and Expt_dyn consists of the output data . This dataset only contains the analysed seasonal mean and daily mean of the variables that have been used to create the visualizations of this study. Each of Expt_fix and Expt_dyn contains 34 and 38 NetCDF files respectively. This dataset was used to investigate the impact of a new aging parameterisation scheme implemented in a regional climate model RegCM4. <i>2. Methods for processing the data:</i> Seasonal Mean and daily average values were extracted from 6-hourly model output. <i>3. Instrument- or software-specific information needed to interpret the data:</i> CDO-1.7.1, Grads-2.0.a9, Matlab2016b <i>4. Standards and calibration information, if appropriate:</i> NA <i>5. Environmental/experimental conditions:</i> NA <i>6. Describe any quality-assurance procedures performed on the data:</i> NA <i>7. People involved with sample collection, processing, analysis and/or submission:</i> Sudipta Ghosh, Nicole Riemer, Graziano Giuliani, Filippo Giorgi, Dilip Ganguly, Sagnik Dey <b>DATA-SPECIFIC INFORMATION FOR: Expt_fix_data.tar.gz</b> <i>1. Number of variables:</i> 29 <i>2. Number of cases/rows:</i> NA <i>3. Variable List:</i> Mass concentration (Kg m-3) of BC, BC_HB, BC_HL, OC, OC_HB, OC_HL; Columnar burden (mg m-2)] of BC, BC_HL, BC_HB, OC; Dry deposition flux (mg m-2 day-1) of BC_HB, BC_HL, OC_HB, OC_HL; Wet deposition flux due washout (mg m-2 day-1) of BC_HB, BC_HL, OC_HB, OC_HL; Wet deposition flux due to rainout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; AOD (unit less), precipitation (Kg m-2 s-1), temperature (K) , v-wind (m s-1), u-wind (m s-1), Surface shortwave downward flux (W m-2), Shortwave radiative forcing at the surface and top of atmosphere (W m-2) <b>DATA-SPECIFIC INFORMATION FOR: Expt_dyn_data.tar.gz</b> <i>1. Number of variables:</i> 30 <i>2. Number of cases/rows:</i> NA <i>3. Variable List:</i> Mass concentration (Kg m-3) of BC, BC_HB, BC_HL, OC, OC_HB, OC_HL; Columnar burden (mg m-2)] of BC, BC_HL, BC_HB, OC; Dry deposition flux (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; Wet deposition flux due washout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; Wet deposition flux due to rainout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; AOD (unit less); precipitation (Kg m-2 s-1); temperature (K); v-wind (m s-1); u-wind (m s-1); Surface shortwave downward flux (W m-2); Shortwave radiative forcing at the surface and top of atmosphere (W m-2); ageingscale (s-1) <b>DATA-SPECIFIC INFORMATION FOR: CERES_vs_2expts_new.mat</b> <i>1. Number of variables:</i> 12 <i>2. Number of cases/rows:</i> NA <i>3. Variable List:</i> Surface shortwave downward flux for clear sky (W/m-2) for CERES, Expt_fix, Expt_dyn (for winter JF and monsoon JJAS seasons); Surface shortwave downward flux for all sky conditions (W/m-2) for CERES, Expt_fix, Expt_dyn (for winter JF and monsoon JJAS seasons). <b>NOTE:</b> The following information applies for all three (3) files: <i> Missing data codes:</i> NA <i>Specialized formats or other abbreviations used:</i> NA
keywords: Carbonaceous aerosols; ageing parameterisation scheme; regional climate model; NetCDF
published: 2021-06-08
 
Dataset associated with Jones and Ward JAE-2020-0031.R1 submission: Pre-to post-fledging carryover effects and the adaptive significance of variation in wing development for juvenile songbirds. Excel CSV files with data used in analyses and file with descriptions of each column. The flight ability variable in this dataset was derived from fledgling drop tests, examples of which can be found in the related dataset: Jones, Todd M.; Benson, Thomas J.; Ward, Michael P. (2019): Flight Ability of Juvenile Songbirds at Fledgling: Examples of Fledgling Drop Tests. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2044905_V1.
keywords: fledgling; wing development; life history; adaptive significance; post-fledging; songbirds
published: 2021-05-21
 
Data sets from "Inferring Species Trees from Gene-Family with Duplication and Loss using Multi-Copy Gene-Family Tree Decomposition." It contains trees and sequences simulated with gene duplication and loss under a variety of different conditions. <b>Note:</b> - trees.tar.gz contains the simulated gene-family trees used in our experiments (both true trees from SimPhy as well as trees estimated from alignements). - sequences.tar.gz contains simulated sequence data used for estimating the gene-family trees as well as the concatenation analysis. - biological.tar.gz contains the gene trees used as inputs for the experiments we ran on empirical data sets as well as species trees outputted by the methods we tested on those data sets. - stats.txt list statistics (such as AD, MGTE, and average size) for our simulated model conditions.
keywords: gene duplication and loss; species-tree inference; simulated data;
has sharing link
 
planned publication date: 2021-09-01
 
Bigheaded Carp have spread throughout the Mississippi River basin since the 1970s. Little has stopped the spread as carp have the ability to pass through locks and dams, and they are currently approaching the Great Lakes. However, the location of the leading edge in the Illinois River has stalled for over a decade, even though there is no barrier preventing further advancement towards the Great Lakes. Defining why carp are not moving towards the Great Lakes is important for predicting why they might advance in the future. The aim of this study was to test the hypothesis that anthropogenic contaminants in the Illinois River may be playing a role in preventing further upstream movement of Bigheaded Carp. Ninety three livers were collected from carp at several locations between May and October of 2018. Liver samples were analyzed using gas chromatography-mass spectrometry in a targeted metabolite profiling approach. Livers from carp at the leading edge had differences in energy use and metabolism, and suppression of protective mechanisms relative to downstream fish; differences were consistent across time. This body of work provides evidence that water quality is linked to carp movement in the Illinois River. As water quality in this region continues to improve, consideration of this impact on carp spread is essential to protect the Great Lakes.
keywords: water quality; metabolites; range expansion; energy; contaminants
published: 2021-05-26
 
Steady-state and dynamic gas exchange data for maize (B73), sugarcane (CP88-1762) and sorghum (Tx430)
keywords: C4 plants; gas exchange
published: 2021-05-14
 
This is the complete dataset for the "Anomalous density fluctuations in a strange metal" Proceedings of the National Academy of Sciences publication (https://doi.org/10.1073/pnas.1721495115). This is an integration of the Zenodo dataset which includes raw M-EELS data. <b>METHODOLOGICAL INFORMATION</b> 1. Description of methods used for collection/generation of data: Data have been collected with a M-EELS instrument and according to the data acquisition protocol described in the original PNAS publication and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026) 2. Methods for processing the data: Raw data were collected with a channeltron-based M-EELS apparatus described in the reference PNAS publication and analyzed according to the procedure outlined both in the PNAS paper and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026). The raw M-EELS spectra at each momentum have been subject to minor data processing involving: (a) averaging of different acquisitions at the same conditions, (b) energy binning, (c) division of an effective Coulomb matrix element (which yields a structure factor S(q,\omega)), (d) antisymmetrization (which yields the imaginary chi) All these procedures are described in the PNAS paper. 3. Instrument- or software-specific information needed to interpret the data: These data are simple .txt or .dat files which can be read with any standard data analysis software, notably Python notebooks, MatLab, Origin, IgorPro, and others. We do not include scripts in order to provide maximum flexibility. 4. Relationship between files, if important: We divided in different folders raw data, structure factors and imaginary chi. <b>DATA-SPECIFIC INFORMATION</b> There are 8 folders within the Data_public_deposition_v1.zip. Each folder contain data needed to create the corresponding figure in the publication. <b>1. Fig1:</b> This folder contains 21 DAT files needed to plot the theory data in panels C and D, following this naming conventions: [chiA]or[chiB]or[Pi]_q_number.dat With chiA is the imaginary RPA charge susceptibility with a Coulomb interaction of electronically weakly coupled layers chiB is the imaginary RPA charge susceptibility with the usual 4\pi e^2/q^2 Coulomb interaction. Pi is the imaginary Lindhard polarizability. q is momentum in reciprocal lattice units Number is the numerical momentum value in reciprocal lattice units <b>2. Fig2:</b> Files needed to plot Fig. 2 of the PNAS paper. Contains 3 folders as listed below. The files in this folder are named following this convention: Bi2212_295K_(1,-1)_50eV_161107_q_number_2.16_avg.dat, 295K is the sample temperature (1,-1) is the momentum direction in reciprocal lattice units 50 eV is the incident e beam energy 161107 is the start date of the experiment in yymmdd format Q is the momentum Number is the momentum in reciprocal lattice units 2.16 is the energy range covered by the data in eV Avg identifies averaged data ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra <b>3. Fig3:</b> Files needed to plot Fig. 3 of the PNAS paper. OP/ OD prefix identifies optimally doped or overdosed sample data, respectively. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra <b>4. Fig4:</b> Files needed to plot Fig. 4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra <b>5. FigS1:</b> Files needed to plot Fig. S1 of the PNAS paper. There are 5 files in this folder. DAT files are M-EELS data following the prior naming convention, while the two .txt files are digitized data from N. Nücker, U. Eckern, J. Fink, and P. Müller, Long-Wavelength Collective Excitations of Charge Carriers in High-Tc Superconductors, Phys. Rev. B 44, 7155(R) (1991), and K. H. G. Schulte, The interplay of Spectroscopy and Correlated Materials, Ph.D. thesis, University of Groningen (2002). <b>6. FigS2:</b> Files needed to plot Fig. S2 of the PNAS paper. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra <b>7. FigS3:</b> Files needed to plot Fig. S3 of the PNAS paper. There are 2 files in this folder: 20K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 20 K 295K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 295 K <b>8. FigS4:</b> Files needed to plot Fig. S4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra
keywords: Momentum resolved electron energy loss spectroscopy (M-EELS); cuprates; plasmons; strange metal
published: 2021-05-14
 
Please cite as: Menglin Liu and Benjamin M. Gramig. "Survey of Cover Crop, Conservation Tillage and Nutrient Management Practice Usage in Illinois and 2020 Fall Covers for Spring Savings Crop Insurance Discount Program Participation." Report to the Illinois Department of Agriculture and Fall Covers for Spring Savings working group. Center for the Economics of Sustainability and Department of Agricultural and Consumer Economics, University of Illinois at Urbana-Champaign. 2021. https://doi.org/10.13012/B2IDB-5222984_V1
keywords: cover crops; Illinois; 2020; conservation tillage; nutrient management practices; farmer survey; NLRS
published: 2021-05-17
 
Please cite as: Wuebbles, D., J. Angel, K. Petersen, and A.M. Lemke, (Eds.), 2021: An Assessment of the Impacts of Climate Change in Illinois. The Nature Conservancy, Illinois, USA. https://doi.org/10.13012/B2IDB-1260194_V1 Climate change is a major environmental challenge that is likely to affect many aspects of life in Illinois, ranging from human and environmental health to the economy. Illinois is already experiencing impacts from the changing climate and, as climate change progresses and temperatures continue to rise, these impacts are expected to increase over time. This assessment takes an in-depth look at how the climate is changing now in Illinois, and how it is projected to change in the future, to provide greater clarity on how climate change could affect urban and rural communities in the state. Beyond providing an overview of anticipated climate changes, the report explores predicted effects on hydrology, agriculture, human health, and native ecosystems.
keywords: Climate change; Illinois; Public health; Agriculture; Environment; Water; Hydrology; Ecosystems
published: 2021-05-14
 
Supplemental Forest Data for Chapter 6: Climate Change Impacts on Ecosystems in "An Assessment of the Impacts of Climate Change in Illinois"
published: 2021-05-14
 
Please cite as: Jim Miller, Sergiusz Czesny, Qihong Dai, James Ellis, Louis Iverson, Jeff Matthews, Charles Roswell, Cory Suski, John Taft, and Mike Ward. 2021. “Climate Change Impacts on Ecosystems: Scientific and Common Species Names”.
keywords: Scientific names; Common names; Illinois species
published: 2021-05-14
 
This document contains the Supplemental Materials for Chapter 4: Climate Change Impacts on Agriculture from the report "An Assessment of the Impacts of Climate Change in Illinois" published in 2021.
keywords: Illinois; climate change; agriculture; impacts; adaptation; crop yield; ISAM; econometrics; days suitable for fieldwork
published: 2021-05-14
 
- The aim of this research was to evaluate the novel dietary fiber source, miscanthus grass, in comparison to traditional fiber sources, and their effects on the microbiota of healthy adult cats. Four dietary treatments, cellulose (CO), miscanthus grass fiber (MF), a blend of miscanthus fiber and tomato pomace (MF+TP), or beet pulp (BP) were evaluated.<br /><br />- The study was conducted using a completely randomized design with twenty-eight neutered adult, domesticated shorthair cats (19 females and 9 males, mean age 2.2 ± 0.03 yr; mean body weight 4.6 ± 0.7 kg, mean body condition score 5.6 ± 0.6). Total DNA from fresh fecal samples was extracted using Mo-Bio PowerSoil kits (MO BIO Laboratories, Inc., Carlsbad, CA). Amplification of the 292 bp-fragment of V4 region from the 16S rRNA gene was completed using a Fluidigm Access Array (Fluidigm Corporation, South San Francisco, CA). Paired-end Illumina sequencing was performed on a MiSeq using v3 reagents (Illumina Inc., San Diego, CA) at the Roy J. Carver Biotechnology Center at the University of Illinois. <br />- Filenames are composed of animal name identifier, diet (BP= beet pulp; CO= cellulose; MF= miscanthus grass fiber; TP= blend of miscanthus fiber and tomato pomace).
keywords: cats; dietary fiber; fecal microbiota; miscanthus grass; nutrient digestibility; postbiotics
published: 2021-05-13
 
Data files and R code to replicate the econometric analysis in the journal article: B Chen, BM Gramig and SD Yun. “Conservation Tillage Mitigates Drought Induced Soybean Yield Losses in the US Corn Belt.” Q Open. https://doi.org/10.1093/qopen/qoab007
keywords: R, Conservation Tillage, Drought, Yield, Corn, Soybeans, Resilience, Climate Change
published: 2020-10-28
 
We studied we examined the role of stream flow on environmental DNA (eDNA) concentrations and detectability of an invasive clam (Corbicula fluminea), while also accounting for other abiotic and biotic variables. This data includes the eDNA concentrations, quadrat estimates of clam density, and abiotic variables.
keywords: Corbicula; detection probability; eDNA; invasive species; lotic; occupancy modeling
published: 2021-05-07
 
Prepared by Vetle Torvik 2021-05-07 The dataset comes as a single tab-delimited Latin-1 encoded file (only the City column uses non-ASCII characters). • How was the dataset created? The dataset is based on a snapshot of PubMed (which includes Medline and PubMed-not-Medline records) taken in December, 2018. (NLMs baseline 2018 plus updates throughout 2018). Affiliations are linked to a particular author on a particular article. Prior to 2014, NLM recorded the affiliation of the first author only. However, MapAffil 2018 covers some PubMed records lacking affiliations that were harvested elsewhere, from PMC (e.g., PMID 22427989), NIH grants (e.g., 1838378), and Microsoft Academic Graph and ADS (e.g. 5833220). Affiliations are pre-processed (e.g., transliterated into ASCII from UTF-8 and html) so they may differ (sometimes a lot; see PMID 27487542) from PubMed records. All affiliation strings where processed using the MapAffil procedure, to identify and disambiguate the most specific place-name, as described in: Torvik VI. MapAffil: A bibliographic tool for mapping author affiliation strings to cities and their geocodes worldwide. D-Lib Magazine 2015; 21 (11/12). 10p • Look for Fig. 4 in the following article for coverage statistics over time: Palmblad, M., Torvik, V.I. Spatiotemporal analysis of tropical disease research combining Europe PMC and affiliation mapping web services. Trop Med Health 45, 33 (2017). <a href="https://doi.org/10.1186/s41182-017-0073-6">https://doi.org/10.1186/s41182-017-0073-6</a> Expect to see big upticks in coverage of PMIDs around 1988 and for non-first authors in 2014. • The code and back-end data is periodically updated and made available for query by PMID at http://abel.ischool.illinois.edu/cgi-bin/mapaffil/search.py • What is the format of the dataset? The dataset contains 52,931,957 rows (plus a header row). Each row (line) in the file has a unique PMID and author order, and contains the following eighteen columns, tab-delimited. All columns are ASCII, except city which contains Latin-1. 1. PMID: positive non-zero integer; int(10) unsigned 2. au_order: positive non-zero integer; smallint(4) 3. lastname: varchar(80) 4. firstname: varchar(80); NLM started including these in 2002 but many have been harvested from outside PubMed 5. initial_2: middle name initial 6. orcid: From 2019 ORCID Public Data File https://orcid.org/ and from PubMed XML 7. year: year of the publication 8. journal: name of journal that the publication is published 9. affiliation: author's affiliation?? 10. disciplines: extracted from departments, divisions, schools, laboratories, centers, etc. that occur on at least unique 100 affiliations across the dataset, some with standardization (e.g., 1770799), English translations (e.g., 2314876), or spelling corrections (e.g., 1291843) 11. grid: inferred using a high-recall technique focused on educational institutions (but, for experimental purposes, includes a few select hospitals, national institutes/centers, international companies, governmental agencies, and 200+ other IDs [RINGGOLD, Wikidata, ISNI, VIAF, http] for institutions not in GRID). Based on 2019 GRID version https://www.grid.ac/ 12. type: EDU, HOS, EDU-HOS, ORG, COM, GOV, MIL, UNK 13. city: varchar(200); typically 'city, state, country' but could include further subdivisions; unresolved ambiguities are concatenated by '|' 14. state: Australia, Canada and USA (which includes territories like PR, GU, AS, and post-codes like AE and AA) 15. country 16. lat: at most 3 decimals (only available when city is not a country or state) 17. lon: at most 3 decimals (only available when city is not a country or state) 18. fips: varchar(5); for USA only retrieved by lat-lon query to https://geo.fcc.gov/api/census/block/find
keywords: PubMed, MEDLINE, Digital Libraries, Bibliographic Databases; Author Affiliations; Geographic Indexing; Place Name Ambiguity; Geoparsing; Geocoding; Toponym Extraction; Toponym Resolution; institution name disambiguation
published: 2021-05-12
 
These are the data sets associated with our publication "Field borders provide winter refuge for beneficial predators and parasitoids: a case study on organic farms." For this project, we compared the communities of overwintering arthropod natural enemies in organic cultivated fields and wildflower-strip field borders at five different sites in central Illinois. Abstract: Semi-natural field borders are frequently used in midwestern U.S. sustainable agriculture. These habitats are meant to help diversify otherwise monocultural landscapes and provision them with ecosystem services, including biological control. Predatory and parasitic arthropods (i.e., potential natural enemies) often flourish in these habitats and may move into crops to help control pests. However, detailed information on the capacity of semi-natural field borders for providing overwintering refuge for these arthropods is poorly understood. In this study, we used soil emergence tents to characterize potential natural enemy communities (i.e., predacious beetles, wasps, spiders, and other arthropods) overwintering in cultivated organic crop fields and adjacent field borders. We found a greater abundance, species richness, and unique community composition of predatory and parasitic arthropods in field borders compared to arable crop fields, which were generally poorly suited as overwintering habitat. Furthermore, potential natural enemies tended to be positively associated with forb cover and negatively associated with grass cover, suggesting that grassy field borders with less forb cover are less well-suited as winter refugia. These results demonstrate that semi-natural habitats like field borders may act as a source for many natural enemies on a year-to-year basis and are important for conserving arthropod diversity in agricultural landscapes.
keywords: Natural enemy; wildflower strips; conservation biological control; semi-natural habitat; field border; organic farming
published: 2021-05-10
 
This dataset contains the emulated global multi-model urban daily temperature projections under RCP 8.5 scenario. The dataset is derived from the study "Large model structural uncertainty in global projections of urban heat waves" (XXXX). Details about this dataset and the local urban climate emulator are described in the article. This dataset documents the global urban daily temperatures of 17 CMIP5 Earth system models for 2006-2015 and 2061-2070. This dataset may be useful for multiple communities regarding urban climate change, heat waves, impacts, vulnerability, risks, and adaptation applications.
keywords: Urban heat waves; CMIP; urban warming; heat stress; urban climate change
published: 2021-05-10
 
UAV-based high-resolution multispectral time-series orthophotos utilized to understand the relation between growth dynamics, imagery temporal resolution, and end-of-season biomass productivity of biomass sorghum as bioenergy crop. Sensor utilized is a RedEdge Micasense flown at 40 meters above ground level at the Energy Farm- UIUC in 2019.
keywords: Unmanned aerial vehicles; High throughput phenotyping; Machine learning; Bioenergy crops
published: 2021-05-10
 
This dataset contains data used in publication "Institutional Data Repository Development, a Moving Target" submitted to Code4Lib Journal. It is a tabular data file describing attributes of data files in datasets published in Illinois Data Bank 2016-04-01 to 2021-04-01.
published: 2021-05-09
 
Raw data and its analysis collected from a trial designed to test the impact of providing a Bacillus-based direct-fed microbial (DFM) on the syndrome resulting from orally infecting pigs with either Salmonella enterica serotype Choleraesuis (S. Choleraesuis) alone, or in combination with an intranasal challenge, three days later, with porcine reproductive and respiratory syndrome virus (PRRSV).
keywords: excel file
published: 2021-05-07
 
The dataset is based on a snapshot of PubMed taken in December 2018 (NLMs baseline 2018 plus updates throughout 2018), and for ORCIDs, primarily, the 2019 ORCID Public Data File https://orcid.org/. Matching an ORCID to an individual author name on a PMID is a non-trivial process. Anyone can create an ORCID and claim to have contributed to any published work. Many records claim too many articles and most claim too few. Even though ORCID records are (most?) often populated by author name searches in popular bibliographic databases, there is no confirmation that the person's name is listed on the article. This dataset is the product of mapping ORCIDs to individual author names on PMIDs, even when the ORCID name does not match any author name on the PMID, and when there are multiple (good) candidate author names. The algorithm avoids assigning the ORCID to an article when there are no good candidates and when there are multiple equally good matches. For some ORCIDs that clearly claim too much, it triggers a very strict matching procedure (for ORCIDs that claim too much but the majority appear correct, e.g., 0000-0002-2788-5457), and sometimes deletes ORCIDs altogether when all (or nearly all) of its claimed PMIDs appear incorrect. When an individual clearly has multiple ORCIDs it deletes the least complete of them (e.g., 0000-0002-1651-2428 vs 0000-0001-6258-4628). It should be noted that the ORCIDs that claim to much are not necessarily due nefarious or trolling intentions, even though a few appear so. Certainly many are are due to laziness, such as claiming everything with a particular last name. Some cases appear to be due to test engineers (e.g., 0000-0001-7243-8157; 0000-0002-1595-6203), or librarians assisting faculty (e.g., ; 0000-0003-3289-5681), or group/laboratory IDs (0000-0003-4234-1746), or having contributed to an article in capacities other than authorship such as an Investigator, an Editor, or part of a Collective (e.g., 0000-0003-2125-4256 as part of the FlyBase Consortium on PMID 22127867), or as a "Reply To" in which case the identity of the article and authors might be conflated. The NLM has, in the past, limited the total number of authors indexed too. The dataset certainly has errors but I have taken great care to fix some glaring ones (individuals who claim to much), while still capturing authors who have published under multiple names and not explicitly listed them in their ORCID profile. The final dataset provides a "matchscore" that could be used for further clean-up. Four files: person.tsv: 7,194,692 rows, including header 1. orcid 2. lastname 3. firstname 4. creditname 5. othernames 6. otherids 7. emails employment.tsv: 2,884,981 rows, including header 1. orcid 2. putcode 3. role 4. start-date 5. end-date 6. id 7. source 8. dept 9. name 10. city 11. region 12 country 13. affiliation education.tsv: 3,202,253 rows, including header 1. orcid 2. putcode 3. role 4. start-date 5. end-date 6. id 7. source 8. dept 9. name 10. city 11. region 12 country 13. affiliation pubmed2orcid.tsv: 13,133,065 rows, including header 1. PMID 2. au_order (author name position on the article) 3. orcid 4. matchscore (see below) 5. source: orcid (2019 ORCID Public Data File https://orcid.org/), pubmed (NLMs distributed XML files), or patci (an earlier version of ORCID with citations processed through the Patci tool) 12,037,375 from orcid; 1,06,5892 from PubMed XML; 29,797 from Patci matchscore: 000: lastname, firstname and middle init match (e.g., Eric T MacKenzie vs 00: lastname, firstname match (e.g., Keith Ward) 0: lastname, firstname reversed match (e.g., Conde Santiago vs Santiago Conde) 1: lastname, first and middle init match (e.g., L. F. Panchenko) 11: lastname and partial firstname match (e.g., Mike Boland vs Michael Boland or Mel Ziman vs Melanie Ziman) 12: lastname and first init match 15: 3 part lastname and firstname match (David Grahame Hardie vs D Grahame Hardie) 2: lastname match and multipart firstname initial match Maria Dolores Suarez Ortega vs M. D. Suarez 22: partial lastname match and firstname match (e.g., Erika Friedmann vs Erika Friedman) 23: e.g., Antonio Garcia Garcia vs A G Garcia 25: Allan Downie vs J A Downie 26: Oliver Racz vs Oliver Bacz 27: Rita Ostrovskaya vs R U Ostrovskaia 29: Andrew Staehelin vs L A Staehlin 3: M Tronko vs N D Tron'ko 4: Sharon Dent (Also known as Sharon Y.R. Dent; Sharon Y Roth; Sharon Yoder) vs Sharon Yoder 45: Okulov Aleksei vs A B Okulov 48: Maria Del Rosario Garcia De Vicuna Pinedo vs R Garcia-Vicuna 49: Anatoliy Ivashchenko vs A Ivashenko 5 = lastname match only (weak match but sometimes captures alternative first name for better subsequent matches); e.g., Bill Hieb vs W F Hieb 6 = first name match only (weak match but sometimes captures alternative first name for better subsequent matches); e.g., Maria Borawska vs Maria Koscielak 7 = last or first name match on "other names"; e.g., Hromokovska Tetiana (Also known as Gromokovskaia, T. S., Громоковська Тетяна) vs T Gromokovskaia 77: Siva Subramanian vs Kolinjavadi N. Sivasubramanian 88 = no name in orcid but match caught by uniqueness of name across paper (at least 90% and 2 more than next most common name) prefix: C = ambiguity reduced (possibly eliminated) using city match (e.g., H Yang on PMID 24972200) I = ambiguity eliminated by excluding investigators (ie.., one author and one or more investigators with that name) T = ambiguity eliminated using PubMed pos (T for tie-breaker) W = ambiguity resolved by authority2018
published: 2021-05-07
 
- The objective of this study was to evaluate macronutrient apparent total tract digestibility (ATTD), gastrointestinal tolerance, and fermentative end-products in extruded, canine diets. <br />- Five diets were formulated to be isocaloric and isonitrogenous with either garbanzo beans (GBD), green lentils (GLD), peanut flour (PFD), dried yeast (DYD), or poultry by-product meal (CON) as the primary protein sources. Ten adult, intact, female beagles (mean age: 4.2 ± 1.1 yr, mean 28 weight: 11.9 ± 1.3 kg) were used in a replicated, 5x5 Latin square design with 14 d periods. Total DNA from fresh fecal samples was extracted using Mo-Bio PowerSoil kits (MO BIO Laboratories, Inc., Carlsbad, CA). Amplification of the 292 bp-fragment of V4 region from the 16S rRNA gene was completed using a Fluidigm Access Array (Fluidigm Corporation, South San Francisco, CA). Paired-end Illumina sequencing was performed on a MiSeq using v3 reagents (Illumina Inc., San Diego, CA) at the Roy J. Carver Biotechnology Center at the University of Illinois. <br />- Filenames are composed of animal name identifier, diet (CON=control; DY= dried yeast; GB= garbanzo beans; GL= green lentils; PF= peanut flour) and period replicate number (P1, P2, P3, P4, and P5).
keywords: Dog; Digestibility; Legume; Microbiota; Pulse; Yeast