Displaying 401 - 425 of 781 in total

Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2019-09-17

Mishra, Shubhanshu (2019): Trained models for multi-task multi-dataset learning for sequence prediction in tweets. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0934773_V1

Trained models for multi-task multi-dataset learning for sequence tagging in tweets. Sequence tagging tasks include POS, NER, Chunking, and SuperSenseTagging. Models were trained using: <a href="https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_experiment.py">https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_experiment.py</a> See <a href="https://github.com/socialmediaie/SocialMediaIE">https://github.com/socialmediaie/SocialMediaIE</a> and <a href="https://socialmediaie.github.io">https://socialmediaie.github.io</a> for details. If you are using this data, please also cite the related article: Shubhanshu Mishra. 2019. Multi-dataset-multi-task Neural Sequence Tagging for Information Extraction from Tweets. In Proceedings of the 30th ACM Conference on Hypertext and Social Media (HT '19). ACM, New York, NY, USA, 283-284. DOI: https://doi.org/10.1145/3342220.3344929

keywords: twitter; deep learning; machine learning; trained models; multi-task learning; multi-dataset learning;

published: 2024-02-26

Harsh, Vipul; Zhou, Wenxuan; Ashok, Sachin; Mysore, Radhika Niranjan; Godfrey, Brighten; Banerjee, Sujata (2024): Murphy traces. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6641912_V1

Traces created using DeathStarBench (https://github.com/delimitrou/DeathStarBench) benchmark of microservice applications with injected failures on containers. Failures consist of disk/CPU/memory failures.

keywords: Murphy;Performance Diagnosis;Microservice;Failures

published: 2020-12-02

Yang, Pan; Cai, Ximing; Khanna, Madhu (2020): Farmers’ Perceptions of Marginal Land for Biofuel Crops . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3074705_V1

The dataset includes the survey results about farmers’ perceptions of marginal land availability and the likelihood of a land pixel being marginal based on a machine learning model trained from the survey. Two spreadsheet files are the farmer and farm characteristics (marginal_land_survey_data_shared.xlsx), and the existing land use of marginal lands (land_use_info_sharing.xlsx). Note: the blank cells in these two spreadsheets mean missing values in the survey response. The GeoTiff file includes two bands, one the marginal land likelihood in the Midwestern states (0-1), the other the dominant reason of land marginality (0-5; 0 for farm size, 1 for growing season precipitation, 2 for root zone soil water capacity, 3 for average slope, 4 for growing season mean temperature, and 5 for growing season diurnal range of temperature). To read the data, please use a GIS software such as ArcGIS or QGIS.

keywords: marginal land; survey

published: 2021-05-14

Abbamonte, Peter (2021): Data for Anomalous density fluctuations in a strange metal. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2888536_V1

This is the complete dataset for the "Anomalous density fluctuations in a strange metal" Proceedings of the National Academy of Sciences publication (https://doi.org/10.1073/pnas.1721495115). This is an integration of the Zenodo dataset which includes raw M-EELS data. METHODOLOGICAL INFORMATION 1. Description of methods used for collection/generation of data: Data have been collected with a M-EELS instrument and according to the data acquisition protocol described in the original PNAS publication and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026) 2. Methods for processing the data: Raw data were collected with a channeltron-based M-EELS apparatus described in the reference PNAS publication and analyzed according to the procedure outlined both in the PNAS paper and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026). The raw M-EELS spectra at each momentum have been subject to minor data processing involving: (a) averaging of different acquisitions at the same conditions, (b) energy binning, (c) division of an effective Coulomb matrix element (which yields a structure factor S(q,\omega)), (d) antisymmetrization (which yields the imaginary chi) All these procedures are described in the PNAS paper. 3. Instrument- or software-specific information needed to interpret the data: These data are simple .txt or .dat files which can be read with any standard data analysis software, notably Python notebooks, MatLab, Origin, IgorPro, and others. We do not include scripts in order to provide maximum flexibility. 4. Relationship between files, if important: We divided in different folders raw data, structure factors and imaginary chi. DATA-SPECIFIC INFORMATION There are 8 folders within the Data_public_deposition_v1.zip. Each folder contain data needed to create the corresponding figure in the publication. 1. Fig1: This folder contains 21 DAT files needed to plot the theory data in panels C and D, following this naming conventions: [chiA]or[chiB]or[Pi]_q_number.dat With chiA is the imaginary RPA charge susceptibility with a Coulomb interaction of electronically weakly coupled layers chiB is the imaginary RPA charge susceptibility with the usual 4\pi e^2/q^2 Coulomb interaction. Pi is the imaginary Lindhard polarizability. q is momentum in reciprocal lattice units Number is the numerical momentum value in reciprocal lattice units 2. Fig2: Files needed to plot Fig. 2 of the PNAS paper. Contains 3 folders as listed below. The files in this folder are named following this convention: Bi2212_295K_(1,-1)_50eV_161107_q_number_2.16_avg.dat, 295K is the sample temperature (1,-1) is the momentum direction in reciprocal lattice units 50 eV is the incident e beam energy 161107 is the start date of the experiment in yymmdd format Q is the momentum Number is the momentum in reciprocal lattice units 2.16 is the energy range covered by the data in eV Avg identifies averaged data ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra 3. Fig3: Files needed to plot Fig. 3 of the PNAS paper. OP/ OD prefix identifies optimally doped or overdosed sample data, respectively. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra 4. Fig4: Files needed to plot Fig. 4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra 5. FigS1: Files needed to plot Fig. S1 of the PNAS paper. There are 5 files in this folder. DAT files are M-EELS data following the prior naming convention, while the two .txt files are digitized data from N. Nücker, U. Eckern, J. Fink, and P. Müller, Long-Wavelength Collective Excitations of Charge Carriers in High-Tc Superconductors, Phys. Rev. B 44, 7155(R) (1991), and K. H. G. Schulte, The interplay of Spectroscopy and Correlated Materials, Ph.D. thesis, University of Groningen (2002). 6. FigS2: Files needed to plot Fig. S2 of the PNAS paper. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra 7. FigS3: Files needed to plot Fig. S3 of the PNAS paper. There are 2 files in this folder: 20K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 20 K 295K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 295 K 8. FigS4: Files needed to plot Fig. S4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta. ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor Raw_avg_data: raw averaged M-EELS spectra Sqw: Structure factors derived from the M-EELS spectra

keywords: Momentum resolved electron energy loss spectroscopy (M-EELS); cuprates; plasmons; strange metal

published: 2022-09-29

Merrill, Loren; Jones, Todd; Brawn, Jeffrey; Ward, Michael (2022): Merrill et al. ECE-2021-05-00793.R1. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8719858_V1

Dataset associated with Merrill et al. ECE-2021-05-00793.R1 submission: Early life patterns of growth are linked to levels of phenotypic trait covariance and post-fledging mortality across avian species. Excel CSV files with all of the data used in analyses and file with descriptions of each column.

keywords: canalization; developmental flexibility; early-life stress; nest predation; phenotypic correlation; trait covariance

published: 2022-10-14

Zhou, Shan; Li, Jiahui; Lu, Jun; Liu, Haihua; Kim, Ji-Young; Kim, Ahyoung; Yao, Lehan; Liu, Chang; Qian, Chang; Hood, Zachary D. ; Lin, Xiaoying; Chen, Wenxiang; Gage, Thomas E. ; Arslan, Ilke; Travesset, Alex; Sun, Kai; Kotov, Nicholas A.; Chen, Qian (2022): Chiral Assemblies of Pinwheel Superlattices on Substrates. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0873473_V1

This dataset is the raw data including SEM, TEM, PINEM images and FDTD simulation as well as pairwise interaction calculation results.

published: 2022-11-07

Jones, Todd; Di Giovanni, Alexander; Hauber, Mark; Ward, Michael (2022): Data for Jones et al. ECY22-0118.R3. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7460768_V1

Dataset associated with Jones et al. ECY22-0118.R3 submission: Ontogenetic effects of brood parasitism by the Brown-headed Cowbird on host offspring. Excel CSV files with all of the data used in analyses and file with descriptions of each column.

keywords: brood parasitism; cowbirds; host-parasite systems; ontogeny; post-fledging; songbirds

published: 2022-12-31

Maffeo, Christopher; Wilson, Jim; Quednau, Lauren; Aksimentiev, Aleksei (2022): Simulation Trajectories for "DNA double helix, a tiny electromotor". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6770800_V1

Trajectory data for Nature Nanotechnology manuscript "DNA double helix, a tiny electromotor" that demonstrates how an electric field applied along the helical axis of a DNA or RNA molecule will generate an electroosmotic flow that causes the duplex to spin about that axis, much like a turbine.

keywords: All-atom MD simulation; DNA; nanotechnology; motors and rotors

published: 2022-08-25

Souza-Cole, Ian; Ward, Michael; Rebecca, Mau; Jeffrey, Foster; Benson, Thomas (2022): Eastern Whip-poor-will abundance declines with urban land cover and increases with moth abundance in the American Midwest. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7745473_V1

Data in this publication were used to analyze the factors that influence the abundance of eastern whip-poor-wills in the Midwest and to describe the diet of this species. These data were collected in Illinois in 2019 and 2020. Procedures were approved by the Illinois Institutional Animal Care and Use Committee (IACUC), protocol no. 19006

keywords: eastern whip-poor-will; Antrostomus vociferus; abundance; moths; nightjars; Lepidoptera; metabarcoding

published: 2020-11-06

Sashittal, Palash; Zhang, Chuanyi; El-Kebir, Mohammed (2020): Simulation Data for JUMPER: Discontinuous Transcript Assembly in SARS-CoV-2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6667667_V1

This data contains bam files and transcripts in the simulated instances generated for the paper 'JUMPER: Discontinuous Transcript Assembly in SARS-CoV-2' submitted for RECOMB 2021. The folder 'bam' contained the simulated bam files aligned using STAR wile the reads were generated using the method polyester Note: in the readme file, close to the end of the document, please ignore this sentence: 'Those files can be opened by using [name of software].'

keywords: transcript assembly; SARS-CoV-2; discontinuous transcription; coronaviruses

published: 2022-08-23

Seyfried, Georgia; Corrales, Adriana; Kent, Angela; Dalling, James; Yang, Wendy (2022): Soil data for Watershed-scale variation in potential fungal community contributions to ectomycorrhizal biogeochemical syndromes. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3532603_V1

This dataset contains soil chemical properties used to variation in soil fungal communities beneath Oreomunnea mexicana trees in the manuscript "Watershed-scale variation in potential fungal community contributions to ectomycorrhizal biogeochemical syndromes"

keywords: Acid-base chemistry; Ectomycorrhizal fungi; Exploration type; Nitrogen cycling; Nitrogen isotopes; Plant-soil (below-ground) interactions; Saprotrophic fungi; Tropical forest

published: 2022-11-07

Sweedler, Jonathan; Castro, Daniel (2022): Single-cell and Subcellular Analysis using Ultrahigh Resolution 21 T MALDI FTICR Mass Spectrometry. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4873339_V1

The dataset contains the data and code for Single-cell and Subcellular Analysis of freshly isolated cultured, uncultured P1 cells and uncultured Old cells. The .csv file named 'MagLab20220721' contains the sample and intensity information with the columns referring to the m/z values and the rows being the samples. The 'MagLabNameINdex.csv' file contains all the index information. The file named '20220721_MagLab.spydata' contains the loaded data of both the two previous files in Spyder. The .mat file contains the aligned data for the three groups.

keywords: Single-cell; Subcellular; Mass Spectrometry; MALDI; Lipidomics; FTICR; 21 T

published: 2023-03-13

Yang, Joyce; Zhao , Lei; Oleson, Keith (2023): Historical and future CESM climate simulations from: Large humidity effects on urban heat exposure and cooling challenges under climate change. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9627482_V1

This dataset contains the historical and future (SSP3 and RCP7.0) CESM climate simulations used in the article "Large humidity effects on urban heat exposure and cooling challenges under climate change" (upcoming). Further details about these simulations can be found in the article. This dataset documents the monthly mean projections of air temperature, wet-bulb temperature, precipitation, relative humidity, and numerous other climatic variables for 2000-2009 (for the historical run) and for 2015-2100 (for the future projection under SSP3-RCP7). This dataset may be useful for urban planners, climate scientists, and decision-makers interested in changes in urban and rural climate under climate change.

keywords: urban climate; climate change; heat stress; urban heat

published: 2023-12-23

Rodriguez-Zas, Sandra (2023): Supplemental File companion to a manuscript to be submitted. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8635710_V1

Supplemental document corresponding to a submission to Physiological Genomics (Data supplements and source materials must now be deposited in a community-recognized data repository or to a generalist public access repository if no community resource is available. See "Author/Production Requirements" for more information.) https://pg.msubmit.net/

keywords: Supplemental, Physiological Genomics

published: 2021-01-04

Zhao, Lei; Oleson, Keith; Bou-Zeid, Elie; Krayenhoff, Eric Scott; Bray, Andrew; Zhu, Qing; Zheng, Zhonghua; Chen, Chen; Oppenheimer, Michael (2021): Multi-model urban climate projections data from: Global multi-model projections of local urban climates. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4585244_V1

This dataset contains the emulated global multi-model urban climate projections under RCP 8.5 and RCP 4.5 used in the article "Global multi-model projections of local urban climates" (https://www.nature.com/articles/s41558-020-00958-8). Details about this dataset and the local urban climate emulator are described in the article. This dataset documents the monthly mean projections of urban temperatures and urban relative humidity of 26 CMIP5 Earth system models (ESMs) from 2006 to 2100 across the globe. This dataset may be useful for multiple communities regarding urban climate change, impacts, vulnerability, risks, and adaptation applications.

keywords: Urban climate; multi-model climate projections; CMIP; urban warming; heat stress

published: 2022-09-07

Long, Stephen P.; Wang, Yu; Stutz, Samantha S. (2022): Data for Increased bundle sheath leakiness of CO2 during photosynthetic induction shows a lack of coordination between the C4 and C3 cycles. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1181155_V1

We developed a new application of isotopic gas exchange which couples a tunable diode laser absorption spectroscope (TDL) with a leaf gas exchange system, analyzing leakiness through induction of C4 photosynthesis on dark to high-light transitions. The youngest fully expanded leaf was measured on 40-45 day-old maize(B73) and sorghum (Tx430). Detail definition of each variable in raw Li-6400XT and Li-6800 (in "Original_data_AND_Data_processing_code.zip") is summarized in: <a href="https://www.licor.com/env/support/LI-6800/topics/symbols.html#const">https://www.licor.com/env/support/LI-6800/topics/symbols.html#const</a>

keywords: leakiness; bundle sheath leakage; C4 photosynthesis; photosynthetic induction; non-steady-state photosynthesis; carbon isotope discrimination; photosynthetic efficiency; corn

published: 2022-11-28

Zhang, Na; Sharma, Bijay P.; Khanna, Madhu (2022): Data for Determining Spatially Varying Profit-Maximizing Management Practices for Miscanthus and Switchgrass Production in the Rainfed United States. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9793873_V1

The compiled datasets include county-level variables used for simulating miscanthus and switchgrass production in 2287 counties across the rainfed US including 5-year (2012-2016) averaged growing season degree days (GDD), 5-year (2012-2016) averaged growing season cumulative precipitation, National Commodity Crop Productivity Index (NCCPI) values, regional dummies (only for miscanthus), the regional-level random effect of the yield response function, N price, land cash rent, the first year fixed cost (only for switchgrass), and separate datasets for simulating an alternative model assuming a constant N rate. The GAMS codes are used to run the simulation to obtain the main results including the age-varying profit-maximizing N rate, biomass yields, and annual profits for miscanthus and switchgrass production across counties in the rainfed US. The STATA codes are used to merge and analyze simulation results and create summary statistics tables and key figures.

keywords: Age; Miscanthus; Net present value; Nitrogen; Optimal lifespan; Profit maximization; Switchgrass; Yield; Center for Advanced Bioenergy and Bioproducts Innovation

published: 2022-08-06

Carson, Dawn; Kopsco, Heather; Gronemeyer, Peg; Mateus-Pinilla, Nohra; Smith, Genee; Sandstrom, Emma; Smith, Rebecca (2022): Knowledge, attitudes, and practices of Illinois medical professionals related to ticks and tick-borne disease. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0685545_V1

An online knowledge, attitudes, and practices survey on ticks and tick-borne diseases was distributed to medical professionals in Illinois during summer 2020 to fall 2021. These are the raw data associated with that survey and the survey questions used. Age, gender, and county of practice have been removed for identifiability. We have added calculated values (columns 165 to end), including: the tick knowledge score, TBD knowledge score, and total knowledge score, which are the sum of the total number of correct answers in each category, and score percent, which are the proportion of correct answers in each category; region, which is determined from the county of practice; TBD relevant practice, which separates the practice variable into TBD primary, secondary, and non-responders; and several variables which group categories.

keywords: ticks; medicine; tick-borne disease; survey

published: 2022-08-06

Madhavan, Vidya; Aishwarya, Anuva (2022): Data for Spin-selective tunneling from nanowires of the candidate topological Kondo insulator SmB6. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9971603_V1

This dataset consists of all the files and codes that are part of the manuscript (main text and supplement) titled "Spin-selective tunneling from nanowires of the candidate topological Kondo insulator SmB6". For detailed information on the individual files refer to the specific readme files.

keywords: Topology; Kondo Inuslator; Spin; Scanning tunneling microscopy; antiferromagnetism

published: 2018-08-29

Finlon, Joseph (2018): Matched Radar and Microphysical Properties During MC3E. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6396968_V1

This dataset contains best estimates of the particle size distribution and measurements of the radar reflectivity factor and total water content for instances where ground-based radar and airborne microphysical observations were considered collocated with each other.

keywords: MC3E; MCS; GPM; microphysics; radar; aircraft; ice

published: 2018-10-03

Das, Anupam; Acar, Gunes; Borisov, Nikita; Pradeep, Amogh (2018): A Crawl of the Mobile Web Measuring Sensor Accesses. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9213932_V1

This dataset is the result of three crawls of the web performed in May 2018. The data contains raw crawl data and instrumentation captured by OpenWPM-Mobile, as well as analysis that identifies which scripts access mobile sensors, which ones perform some of browser fingerprinting, as well as clustering of scripts based on their intended use. The dataset is described in the included README.md file; more details about the methodology can be found in our ACM CCS'18 paper: Anupam Das, Gunes Acar, Nikita Borisov, Amogh Pradeep. The Web's Sixth Sense: A Study of Scripts Accessing Smartphone Sensors. In Proceedings of the 25th ACM Conference on Computer and Communications Security (CCS), Toronto, Canada, October 15–19, 2018. (Forthcoming)

keywords: mobile sensors; web crawls; browser fingerprinting; javascript

published: 2020-06-03

Zhang, Jun; Wuebbles, Donald; Kinnison, Douglas; Baughcum, Steven (2020): Potential Impacts of Supersonic Aircraft on Stratospheric Ozone and Climate. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9081595_V1

This datasets provide basis of our analysis in the paper - Potential Impacts of Supersonic Aircraft on Stratospheric Ozone and Climate. All datasets here can be categorized into emission data and model output data (WACCM). All the model simulations (background and perturbation) were run to steady-state and only the datasets used in analysis are archived here.

keywords: NetCDF; Supersonic aircraft; Stratospheric ozone; Climate

published: 2021-04-29

Jackson, Nicole ; Konar, Megan ; Debaere, Peter; Sheffield, Justin (2021): Data for "Crop-specific exposure to extreme temperature and moisture for the globe for the last half century". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5457902_V1

Global assessments of climate extremes typically do not account for the unique characteristics of individual crops. A consistent definition of the exposure of specific crops to extreme weather would enable agriculturally-relevant hazard quantification. We introduce the Agriculturally-Relevant Exposure to Shocks (ARES) model, a novel database of both the temperature and moisture extremes facing individual crops by explicitly accounting for crop characteristics. Specifically, we estimate crop-specific temperature and moisture shocks during the growing season for a 0.25-degree spatial grid and daily time scale from 1961-2014 globally for 17 crops. The resulting database presented here provides annual crop- and event-specific exposure rates. Both gridded and country-level exposure rates are provided for each of the 17 crops. Our results provide new insights into the changes in the magnitude as well as spatial and temporal distribution of extreme events that impact crops over the past half-century. For additional information, please see the related paper by Jackson et al. (2021) in Environmental Research Letters.

keywords: Crop-specific; weather extremes; temperature; moisture; global; gridded; time series

published: 2022-06-07

Chu, Gillian; Warnow, Tandy (2022): RNASim-VS2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8812049_V1

Provides RNASim-VS2 datasets used in Gillian's Master's thesis.

published: 2022-07-25

Jett, Jacob (2022): SBKS - Species Raw Entity Mentions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4950847_V1

A set of species entity mentions derived from an NERC dataset analyzing 900 synthetic biology articles published by the ACS. This data is associated with the Synthetic Biology Knowledge System repository (https://web.synbioks.org/). The data in this dataset are raw mentions from the NERC data.

keywords: synthetic biology; NERC data; species mentions