Displaying 501 - 525 of 639 in total

Subject Area

Life Sciences (334)
Social Sciences (135)
Physical Sciences (92)
Technology and Engineering (62)
Uncategorized (15)
Arts and Humanities (1)

Funder

Other (193)
U.S. National Science Foundation (NSF) (189)
U.S. Department of Energy (DOE) (64)
U.S. National Institutes of Health (NIH) (60)
U.S. Department of Agriculture (USDA) (42)
Illinois Department of Natural Resources (IDNR) (17)
U.S. Geological Survey (USGS) (6)
U.S. National Aeronautics and Space Administration (NASA) (5)
Illinois Department of Transportation (IDOT) (4)
U.S. Army (2)

Publication Year

2021 (108)
2022 (108)
2020 (96)
2023 (78)
2019 (72)
2018 (62)
2024 (42)
2017 (36)
2016 (30)
2025 (2)
2009 (1)
2011 (1)
2012 (1)
2014 (1)
2015 (1)

License

CC0 (356)
CC BY (263)
custom (20)

Datasets

published: 2020-03-08
 
This dataset inventories the availability of entrepreneurship and small business education, including co-curricular opportunities, in two-year colleges in the United States. The inventory provides a snapshot of activities at more than 1,650 public, not-for-profit, and private for-profit institutions, in 2014.
keywords: Small business education; entrepreneurship education; Kauffman Entrepreneurship Education Inventory; Ewing Marion Kauffman Foundation; Paul J. Magelli
published: 2020-06-01
 
Dataset associated with Hoover et al AUK-19-093 submission: Local conspecific density does not influence reproductive output in a secondary cavity-nesting songbird. Excel CSV with all of the data used in analyses. Description of variables YEARS: year ORDINAL_DATE: number for what day of the year it is with 1 January = 1,……30 December = 365 SITE: acronym for each study site BOX: unique nest box identifier on each study site TREAT: designates whether nest box was in a high- or low- nest box density area within each study site ACTUAL_NO_NEIGHBORS: number of pairs of warblers using a nest box within 200 m of a given pair’s nest box CLUTCH_SIZE: number of warbler eggs in nest at the onset of incubation PROWN: number of warbler nestlings once eggs have hatched PROWF: number of warbler nestlings that fledged out of the nest box HATCH_SUCCESS: proportion of eggs in the nest that hatched FLEDG_SUCCESS: proportion of the nestlings that fledged from the nest box HATCH_SUCCESS2: binary category where “0” indicates there was some, and “1” indicates there was no hatching failure FLEDG_SUCCESS2: binary category where “0” indicates there was some, and “1” indicates there was no nestling failure (i.e. nestling death) BHCO_PARASIT2: binary category where “0” indicates no cowbird parasitism, and “1” indicates there was cowbird parasitism BHCOE: number of cowbird eggs in clutch BHCOF: number of cowbird nestlings that fledged from the nest PAIRID: unique number that identifies a male and female warbler that are together at a nest box and this number is the same in a subsequent nesting attempt or year if the same male and female are together again FEMALE_ID: unique identifier for each female which represents her leg band combination. Each letter represents a band with letters preceding the hyphen being on the right leg and after the hyphen the left leg FEM_AGE: binary category where “0” indicates a 1-year-old bird and “1” indicates a >1-year-old bird FEMALE_BREEDING_ATTEMPT: “1” indicates first, “2” indicates second,……..breeding attempt within a given year SECOND_ATTEMPT: for any female that fledged a brood in a given year, binary category where “0” represents that they did not, and “1” indicates that they did attempt a second brood that year F_TOT_PROWF: total reproductive output (number of warbler fledglings produced) for a given female in a given year MALE_ID: unique identifier for each male which represents his leg band combination. Each letter represents a band with letters preceding the hyphen being on the right leg and after the hyphen the left leg MALE_AGE2: binary category where “0” indicates a 1-year-old bird and “1” indicates a >1-year-old bird Provisioning_rate: total number of food provisions per nestling per hour by male and female warbler combined BROOD_MASS: average nestling mass (g) for the brood BROOD_TARSUS: average nestling tarsus length (mm) for the brood Brood_condition: unit-less index of nestling condition that uses the residuals of the BROOD_MASS/BROOD_TARSUS relationship A period (“.”) represents where data were not collected, not available, or because individual nest or female did not qualify for consideration of a category assignment. An empty cell represents no data available for this particular cell.
keywords: conspecific density; density dependence; food limitation; hatching success; nestling body condition; nestling provisioning; Prothonotary Warbler; reproductive output
published: 2020-02-27
 
These data were collected for an experiment examining effects of neonicotinoid (clothianidin) presence on hover fly (Diptera: Syrphidae) behavior. Hover flies of two species (Eristalis arbustorum and Toxomerus marginatus) were offered a choice to feed on artificial flowers laced with sucrose solution that was either contaminated (CLO) or not contaminated (CON) with clothianidin. Two different concentrations of clothianidin in 0.5 M sucrose solution were tested: 2.5 ppb and 150 ppb. We conducted four sets of 10 trials, each trial set examining a different combination of species and clothianidin dose. Across 6 hours of video for each trial we recorded 1) number of visits to each flower that resulted in feeding, and 2) amount of time spent feeding during each visit. We found that while neither species fed significantly longer on either of the solutions, E. arbustorum appeared to avoid flowers with clothianidin particularly at high rates. In the paper, we attribute this avoidance response, partially, to hover fly-visible spectral differences between the two flower choices and discuss potential implications for field and lab-based studies. In the enclosed zip file we have included all data for this project and code scripts from R. * Note: Data folder contains 4 files (instead of 6 as mentioned in Readme): e.tenax_photoreceptors.csv; hoverfly_data_UPDATE.csv; number_visits_UPDATE.csv; and Original 2018 hover fly choice test data_Clem2020.xlsx
keywords: Syrphidae; hoverfly; Eristalis; Toxomerus; Choice Experiment; Neonicotinoid; Clothianidin
published: 2020-02-23
 
Citation context annotation for papers citing retracted paper Matsuyama 2005 (RETRACTED: Matsuyama W, Mitsuyama H, Watanabe M, Oonakahara KI, Higashimoto I, Osame M, Arimura K. Effects of omega-3 polyunsaturated fatty acids on inflammatory markers in COPD. Chest. 2005 Dec 1;128(6):3817-27.), retracted in 2008 (Retraction in: Chest (2008) 134:4 (893) <a href="https://doi.org/10.1016/S0012-3692(08)60339-6">https://doi.org/10.1016/S0012-3692(08)60339-6<a/> ). This is part of the supplemental data for Jodi Schneider, Di Ye, Alison Hill, and Ashley Whitehorn. "Continued Citation of a Fraudulent Clinical Trial Report, Eleven Years after it was retracted for Falsifying Data" [R&R under review with Scientometrics]. Overall we found 148 citations to the retracted paper from 2006 to 2019, However, this dataset does not include the annotations described in the 2015. in Ashley Fulton, Alison Coates, Marie Williams, Peter Howe, and Alison Hill. "Persistent citation of the only published randomized controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction." Publications 3, no. 1 (2015): 17-26. In this dataset 70 new and newly found citations are listed: 66 annotated citations and 4 pending citations (non-annotated since we don't have full-text). "New citations" refer to articles published from March 25, 2014 to 2019, found in Google Scholar and Web of Science. "Newly found citations" refer articles published 2006-2013, found in Google Scholar and Web of Science, but not previously covered in Ashley Fulton, Alison Coates, Marie Williams, Peter Howe, and Alison Hill. "Persistent citation of the only published randomised controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction." Publications 3, no. 1 (2015): 17-26. NOTES: This is Unicode data. Some publication titles & quotes are in non-Latin characters and they may contain commas, quotation marks, etc. FILES/FILE FORMATS Same data in two formats: 2006-2019-new-citation-contexts-to-Matsuyama.csv - Unicode CSV (preservation format only) 2006-2019-new-citation-contexts-to-Matsuyama.xlsx - Excel workbook (preferred format) ROW EXPLANATIONS 70 rows of data - one citing publication per row COLUMN HEADER EXPLANATIONS Note - processing notes Annotation pending - Y or blank Year Published - publication year ID - ID corresponding to the network analysis. See Ye, Di; Schneider, Jodi (2019): Network of First and Second-generation citations to Matsuyama 2005 from Google Scholar and Web of Science. University of Illinois at Urbana-Champaign. <a href="https://doi.org/10.13012/B2IDB-1403534_V2">https://doi.org/10.13012/B2IDB-1403534_V2</a> Title - item title (some have non-Latin characters, commas, etc.) Official Translated Title - item title in English, as listed in the publication Machine Translated Title - item title in English, translated by Google Scholar Language - publication language Type - publication type (e.g., bachelor's thesis, blog post, book chapter, clinical guidelines, Cochrane Review, consumer-oriented evidence summary, continuing education journal article, journal article, letter to the editor, magazine article, Master's thesis, patent, Ph.D. thesis, textbook chapter, training module) Book title for book chapters - Only for a book chapter - the book title University for theses - for bachelor's thesis, Master's thesis, Ph.D. thesis - the associated university Pre/Post Retraction - "Pre" for 2006-2008 (means published before the October 2008 retraction notice or in the 2 months afterwards); "Post" for 2009-2019 (considered post-retraction for our analysis) Identifier where relevant - ISBN, Patent ID, PMID (only for items we considered hard to find/identify, e.g. those without a DOI-based URL) URL where available - URL, ideally a DOI-based URL Reference number/style - reference Only in bibliography - Y or blank Acknowledged - If annotated, Y, Not relevant as retraction not published yet, or N (blank otherwise) Positive / "Poor Research" (Negative) - P for positive, N for negative if annotated; blank otherwise Human translated quotations - Y or blank; blank means Google scholar was used to translate quotations for Translated Quotation X Specific/in passing (overall) - Specific if any of the 5 quotations are specific [aggregates Specific / In Passing (Quotation X)] Quotation 1 - First quotation (or blank) (includes non-Latin characters in some cases) Translated Quotation 1 - English translation of "Quotation 1" (or blank) Specific / In Passing (Quotation 1) - Specific if "Quotation 1" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 1) - Methods; Results; or Methods and Results - blank if "Quotation 1" not specific, no associated quotation, or not yet annotated Quotation 2 - Second quotation (includes non-Latin characters in some cases) Translated Quotation 2 - English translation of "Quotation 2" Specific / In Passing (Quotation 2) - Specific if "Quotation 2" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 2) - Methods; Results; or Methods and Results - blank if "Quotation 2" not specific, no associated quotation, or not yet annotated Quotation 3 - Third quotation (includes non-Latin characters in some cases) Translated Quotation 3 - English translation of "Quotation 3" Specific / In Passing (Quotation 3) - Specific if "Quotation 3" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 3) - Methods; Results; or Methods and Results - blank if "Quotation 3" not specific, no associated quotation, or not yet annotated Quotation 4 - Fourth quotation (includes non-Latin characters in some cases) Translated Quotation 4 - English translation of "Quotation 4" Specific / In Passing (Quotation 4) - Specific if "Quotation 4" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 4) - Methods; Results; or Methods and Results - blank if "Quotation 4" not specific, no associated quotation, or not yet annotated Quotation 5 - Fifth quotation (includes non-Latin characters in some cases) Translated Quotation 5 - English translation of "Quotation 5" Specific / In Passing (Quotation 5) - Specific if "Quotation 5" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 5) - Methods; Results; or Methods and Results - blank if "Quotation 5" not specific, no associated quotation, or not yet annotated Further Notes - additional notes
keywords: citation context annotation, retraction, diffusion of retraction
published: 2020-02-12
 
This dataset contains the results of a three month audit of housing advertisements. It accompanies the 2020 ICWSM paper "Auditing Race and Gender Discrimination in Online Housing Markets". It covers data collected between Dec 7, 2018 and March 19, 2019. There are two json files in the dataset: The first contains a list of json objects representing advertisements separated by newlines. Each object includes the date and time it was collected, the image and title (if collected) of the ad, the page on which it was displayed, and the training treatment it received. The second file is a list of json objects representing a visit to a housing lister separated by newlines. Each object contains the url, training treatment applied, the location searched, and the metadata of the top sites scraped. This metadata includes location, price, and number of rooms. The dataset also includes the raw images of ads collected in order to code them by interest and targeting. These were captured by selenium and named using a perceptive hash to de-duplicate images.
keywords: algorithmic audit; advertisement audit;
published: 2020-02-05
 
The Delt_Comb.NEX text file contains the original data used in the phylogenetic analyses of Zahniser & Dietrich, 2013 (European Journal of Taxonomy, 45: 1-211). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first nine lines of the file indicate the file type (Nexus), that 152 taxa were analyzed, that a total of 3971 characters were analyzed, the format of the data, and specification for two symbols used in the dataset. There are four datasets separated into blocks, one each for: 28S rDNA gene, Histone H3 gene, morphology, and insertion/deletion characters scored based on the alignment of the 28S rDNA dataset. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the publication using this dataset. A text file, Delt_morph_char.txt, is available here that states the morphological characters and characters states that were scored in the Delt_Comb.NEX dataset. The original DNA sequence data are available from NCBI GenBank under the accession numbers indicated in publication. Chromatogram files for each sequencing read are available from the first author upon request.
keywords: phylogeny; DNA sequence; morphology; parsimony analysis; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; histone H3; bayesian analysis
published: 2020-02-01
 
This data describes habitat use, availability, landscape level influences, and daily movement of dabbling ducks in the Wabash River Valley of southeastern Illinois and southwestern Indiana. It contains triangulated locations of individual ducks, associated habitat assignments of those locations, flood survey data to determine water availability, and randomly generated points to assess landscape level questions.
keywords: waterfowl; ducks; dabbling; mallard; teal; habitat
published: 2020-01-28
 
This dataset includes two data files that provide the time series (Jul. - Sep. 2017) data of sun-induced chlorophyll fluorescence (SIF_760) collected under sunny conditions at two maize sites (one rainfed and the other irrigated) in Nebraska in 2017. Data contain 392 SIF_760 records at the rainfed site and 707 records at the irrigated site. The timestamp uses local standard time. Data are available for the sunny conditions from 8 am to 5 pm (corresponding to 9 am to 6 pm local time) throughout the study period.
keywords: sun-induced chlorophyll fluorescence (SIF); maize; gross primary production(GPP); light use efficiency(LUE); SIF yield
published: 2020-01-27
 
Morphologic data of dunes in the World's big rivers. Morphologic descriptors for large dunes include: dune height, dune mean leeside angle, dune maximum leeside angle, dune wavelength, dune flow depth (at the crest), and the fractional height of the maximum slope on the leeside for each dune. Morphologic descriptors for small dunes include: dune height, dune mean leeside angle, dune maximum leeside angle, dune wavelength, and dune flow depth (at the crest).
keywords: dune; bedform; rivers; morphology;
published: 2019-12-22
 
Dataset providing calculation of a Competition Index (CI) for Late Pleistocene carnivore guilds in Laos and Vietnam and their relationship to humans. Prey mass spectra, Prey focus masses, and prey class raw data can be used to calculate the CI following Hemmer (2004). Mass estimates were calculated for each species following Van Valkenburgh (1990). Full citations to methodological papers are included as relationships with other resources
keywords: competition; Southeast Asia; carnivores; humans
published: 2019-12-17
 
This dataset provides the raw data, code and related figures for the paper, "Channel Activation of CHSH Nonlocality"
keywords: Super-activation; Non-locality breaking channel
published: 2019-12-10
 
The dataset consists of two types of data: the estimate of land productivity (the maximum productivity, MP) and the estimate of land that has low productivity for any major crops planted in the Contiguous United States and then may be available for growing bioenergy crops (the marginal land, ML). All data items are in GeoTiff format, under the World Geodetic System (WGS) 84 project, and with a resolution of 0.0020810045 degree (~250 m). The MP values are calculated based on machine learning model estimated yields of major crops in the CONUS, and its expected value (MP_mean.tif), and associated uncertainty (MP_IDP.tif). The ML availability data have two versions: a deterministic version and a version with uncertainty. The deterministic MLs are determined as the land pixels with expected MP values falling in the range defined in the following criteria, and the MLs with uncertainty are determined as the probability that the MP value of a land pixel falls in the range defined in the following criteria: Criteria_____Description S1________ Current crop and pasture land with MP <= P50 S2________ Current crop and pasture land with MP <= P25 S3________ S1 + current grass and shrub land with P25 < MP < P50 S4________ S2 + current grass and shrub land with P10 < MP < P25 Economic__ Current crop and pasture land with potential profitability < 0 Here P10, P25 and P50 are the 10th, 25th and 50th percentile of crop MP values
keywords: Land productivity;marginal land;land use
published: 2019-12-12
 
This dataset contains gamma-ray spectra templates for a source interdiction and uranium enrichment measurement task. This dataset also contains Keras machine learning models trained using datasets created using these templates.
keywords: gamma-ray spectroscopy; neural networks; machine learning; isotope identification; uranium enrichment; sodium iodide; NaI(Tl)
published: 2019-12-03
 
This is the data set associated with the manuscript titled "Extensive host-switching of avian feather lice following the Cretaceous-Paleogene mass extinction event." Included are the gene alignments used for phylogenetic analyses and the cophylogenetic input files.
keywords: phylogenomics, cophylogenetics, feather lice, birds
published: 2019-12-03
 
These are the alignments of transcriptome data used for the analysis of members of Heteroptera. This dataset is analyzed in "Deep instability in the phylogenetic backbone of Heteroptera is only partly overcome by transcriptome-based phylogenomics" published in Insect Systematics and Diversity.
keywords: Heteroptera; Hemiptera; Phylogenomics; transcriptome
published: 2019-11-18
 
VCF files used to analyze a novel filtering tool VEF, presented in the article "VEF: a Variant Filtering tool based on Ensemble methods".
keywords: VCF files; filtering; VEF
published: 2019-10-16
 
Human annotations of randomly selected judged documents from the AP 88-89, Robust 2004, WT10g, and GOV2 TREC collections. Seven annotators were asked to read documents in their entirety and then select up to ten terms they felt best represented the main topic(s) of the document. Terms were chosen from among a set sampled from the document in question and from related documents.
keywords: TREC; information retrieval; document topicality; document description
published: 2019-11-12
 
We are sharing the tweet IDs of four social movements: #BlackLivesMatter, #WhiteLivesMatter, #AllLivesMatter, and #BlueLivesMatter movements. The tweets are collected between May 1st, 2015 and May 30, 2017. We eliminated the location to the United States and focused on extracting the original tweets, excluding the retweets. Recommended citations for the data: Rezapour, R. (2019). Data for: How do Moral Values Differ in Tweets on Social Movements?. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9614170_V1 and Rezapour, R., Ferronato, P., and Diesner, J. (2019). How do moral values differ in tweets on social movements?. In 2019 Computer Supported Cooperative Work and Social Computing Companion Publication (CSCW’19 Companion), Austin, TX.
keywords: Twitter; social movements; black lives matter; blue lives matter; all lives matter; white lives matter
published: 2019-10-18
 
Supporting secondary data used in a manuscript currently in submission regarding the invasion dynamics of the asian tiger mosquito, Aedes albopictus, in the state of Illinois
keywords: albopictus;mosquito
published: 2019-07-04
 
Software (Matlab .m files) for the article: Lying in Wait: Modeling the Control of Bacterial Infections via Antibiotic-Induced Proviruses. The files can be used to reproduce the analysis and figures in the article.
keywords: Matlab codes; antibiotic-induced dynamics
published: 2019-09-25
 
<sup>12</sup>CO and <sup>13</sup>CO maps for six molecular clouds in the Large Magellanic Cloud, obtained with the Atacama Large Millimeter/submillimeter Array (ALMA). See the associated article in the Astrophysical Journal, and README files within each ZIP archive. Please cite the article if you use these data.
keywords: Radio astronomy
published: 2019-09-17
 
Trained models for multi-task multi-dataset learning for text classification as well as sequence tagging in tweets. Classification tasks include sentiment prediction, abusive content, sarcasm, and veridictality. Sequence tagging tasks include POS, NER, Chunking, and SuperSenseTagging. Models were trained using: <a href="https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_classification_tagging.py">https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_classification_tagging.py</a> See <a href="https://github.com/socialmediaie/SocialMediaIE">https://github.com/socialmediaie/SocialMediaIE</a> and <a href="https://socialmediaie.github.io">https://socialmediaie.github.io</a> for details. If you are using this data, please also cite the related article: Shubhanshu Mishra. 2019. Multi-dataset-multi-task Neural Sequence Tagging for Information Extraction from Tweets. In Proceedings of the 30th ACM Conference on Hypertext and Social Media (HT '19). ACM, New York, NY, USA, 283-284. DOI: https://doi.org/10.1145/3342220.3344929
keywords: twitter; deep learning; machine learning; trained models; multi-task learning; multi-dataset learning; classification; sequence tagging
published: 2019-09-05
 
The data set here include data from NMR, LC-MS/MS, MALDI-MS, H/D exchange MS experiments used in paper "A novel rotifer derived alkaloid paralyzes schistosome larvae and prevents infection".
published: 2019-09-06
 
This is a dataset of 1101 comments from The New York Times (May 1, 2015-August 31, 2015) that contains a mention of the stemmed words vaccine or vaxx.
keywords: vaccine;online comments