Displaying 351 - 375 of 668 in total

Datasets

published: 2020-11-25

Barker, Louise; Gaulke, Sarah M.; Chace, Jordyn Z.; Davis, Mark A.; Niemiller, Matthew L.; Taylor, Steven J.; Schuett, Gordon W. (2020): Video: Agkistrodon contortrix combat behavior. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9209722_V1

Video recorded by Louise Barker using a Cannon Powershot camera documents late-season combat behavior in Agkistrodon contortrix. Recorded in Beaufort County, North Carolina, 11.1 km SE of downtown Washington on 21 October 2020.

keywords: Agkistrodon contortrix; combat; mating; reproduction; copperhead; pit viper; Viperidae;

published: 2020-12-15

Khanna, Madhu; Chen, Xiaoguang; Wang, Weiwei; Oliver, Anthony (2020): BEPAM-E Model Code and CABBI Simulation Results for "Repeal of the Clean Power Plan: Social Cost and Distributional Implications". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7562109_V1

The dataset consists of results and various input data that are used in the GAMS model for the publication "Repeal of the Clean Power Plan: Social Cost and Distributional Implications". All the data are either excel files or in the .inc format which can be read within GAMS or Notepad. Main data sources include: agriculture, transportation and electricity data. Model details can be found in the paper and the GAMS model package.

keywords: carbon abatement; welfare cost; electricity sector; partial equilibrium model

published: 2021-01-23

Willson, James; Roddur, Mrinmoy; Warnow, Tandy (2021): Data From: "Comparing Methods for Species Tree Estimation With Gene Duplication and Loss". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2418574_V1

Data sets from "Comparing Methods for Species Tree Estimation With Gene Duplication and Loss." It contains data simulated with gene duplication and loss under a variety of different conditions.

keywords: gene duplication and loss; species-tree inference;

published: 2021-06-16

Warnow , Tandy; Wedell, Eleanor (2021): Fragmentary Sequences for Variable-Sized RNAsim Datasets. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8788479_V1

Thank you for using these datasets. These RNAsim aligned fragmentary sequences were generated from the query sequences selected by Balaban et al. (2019) in their variable-size datasets (https://doi.org/10.5061/dryad.78nf7dq). They were created for use for phylogenetic placement with the multiple sequence alignments and backbone trees provided by Balaban et al. (2019). The file structures included here also correspond with the data Balaban et al. (2020) provided. This includes: Directories for five varying backbone tree sizes, shown as 5000, 10000, 50000, 100000, and 200000. These directory names are also used by Balaban et al. (2019), and indicate the size of the backbone tree included in their data. Subdirectories for each replicate from the backbone tree size labelled 0 through 4. For the smaller four backbone tree sizes there are five replicates, and for the largest there is one replicate. Each replicate contains 200 text files with one aligned query sequence fragment in fasta format.

keywords: Fragmentary Sequences; RNAsim

published: 2019-10-23

Ouldali, Hadjer; Sarthak, Kumar; Ensslen, Tobias; Piguet, Fabien; Manivet, Philippe; Pelta, Juan; Behrends, Jan C.; Aksimentiev, Aleksei; Oukhaled, Abdelghani (2019): Experiment and simulation raw data for Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4905767_V1

Raw MD simulation trajectory, input and configuration files, SEM current data, and experimental raw data accompanying the publication, "Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore". README.md contains a description of all associated files.

keywords: molecular dynamics; protein sequencing; aerolysin; nanopore sequencing

published: 2019-10-05

Saurabh, Jha; Archit, Patke; Mike, Showerman; Jeremy, Enos; Greg, Bauer; Zbigniew, Kalbarczyk; Ravishankar, Iyer; William , Kramer (2019): Monet - Blue Waters Network Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2921318_V1

This dataset contains collected and aggregated network information from NCSA’s Blue Waters system, which is comprised of 27,648 nodes connected via Cray Gemini* 3D torus (dimension 24x24x24) interconnect, from Jan/01/2017 to May/31/2017. Network performance counters for links are exposed via Cray's gpcdr (<a href="https://github.com/ovis-hpc/ovis/wiki/gpcdr-kernel-module">https://github.com/ovis-hpc/ovis/wiki/gpcdr-kernel-module</a>) kernel module. Lightweight Distributed Metric Service ([LDMS](<a href="https://github.com/ovis-hpc/ovis">https://github.com/ovis-hpc/ovis</a>)) is used to sampled the performance counters at 60 second intervals. Please read "README.md" file. Acknowledgement: This dataset is collected as a part of the Blue Waters sustained-petascale computing project, which is supported by the National Science Foundation and the state of Illinois. Blue Waters is a joint effort of the University of Illinois at Urbana-Champaign and its National Center for Supercomputing Applications.

keywords: HPC; Interconnect; Network; Congestion; Blue Waters; Dataset

published: 2021-11-19

Shen, Chengze; Park, Minhyuk; Warnow, Tandy (2021): Seven ROSE datasets in high and low fragmentation conditions. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6128941_V1

This is a general description of the datasets included in this upload; details of each dataset can be found in the individual README.txt in each compressed folder. We have: 1. ROSE-HF.tar.gz 2. ROSE-LF.tar.gz HF (high fragmentary): 50% of the sequences are made fragmentary, which have average lengths of 25% of the original lengths with a standard deviation of 60 bp. LF (low fragmentary): 25% of the sequences are made fragmentary, which have average lengths of 50% of the original lengths with a standard deviation of 60 bp. The seven ROSE datasets made fragmentary are: 1000L1, 1000L3, 1000L4, 1000M3, 1000S1, 1000S2 and 1000S4. "ROSE-HF.tar.gz" contains HF versions of the seven ROSE datasets. "ROSE-LF.tar.gz" contains LF versions of the seven ROSE datasets.

keywords: ROSE; simulation; fragmentary

published: 2022-03-20

Lee, Sangjun; Huang, Edwin W.; Johnson, Thomas A.; Guo, Xuefei; Husain, Ali A.; Mitrano, Matteo; Lu, Kannan; Zakrzewski, Alexander V.; de la Pena, Gilberto A.; Peng, Yingying; Huang, Hai; Lee, Sang-Jun; Jang, Hoyoung; Lee, Jun-Sik; Joe, Young Il; Doriese, William B.; Szypryt, Paul; Swetz, Daniel S.; Chi, Songxue; Aczel, Adam A.; MacDougall, Gregory J.; Kivelson, Steven A. ; Fradkin, Eduardo; Abbamonte, Peter (2022): Data for "Generic character of charge and spin density waves in superconducting cuprates". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1757317_V1

Data for "Generic character of charge and spin density waves in superconducting cuprates". - Neutron scattering data for SDW - RSXS scans of CDW of LESCO x=0.10, 0.125, 0.15, 0.17, 0.20 at various temperatures. - Temperature dependence of CDW peak intensity, correlation length, Qcdw (Lorentzian fit, S(q,T) fit, Landau-Ginzburg fit) - XAS data of LESCO x=0.10, 0.125, 0.15, 0.17, 0.20

published: 2020-09-18

Clark, Lindsay; Njuguna, Joyce; Jin, Xiaoli; Petersen, Karen; Anzoua, Kossanou G.; Bagmet, Larissa; Chebukin, Pavel; Deuter, Martin; Dzyubenko, Elena; Dzyubenko, Nicolay; Heo, Kweon; Johnson, Douglas A.; Jørgensen, Uffe; Kjeldsen, Jens B.; Nagano, Hironori; Peng, Junhua; Sabitov, Andrey; Yamada, Toshihiko; Yoo, Ji Hye; Yu, Chang Yeon; Long, Stephen P.; Sacks, Erik (2020): RAD-seq genotypes for a Miscanthus sacchariflorus diversity panel. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8170405_V1

Restriction site-associated DNA sequencing (RAD-seq) data from 643 Miscanthus accessions from a diversity panel, including 613 Miscanthus sacchariflorus, three M. sinensis, and 27 M. xgiganteus. DNA was digested with PstI and MspI, and single-end Illumina sequencing was performed adjacent to the PstI site. Variant and genotype calling was performed with TASSEL-GBSv2, using the Miscanthus sinensis v7.1 reference genome from Phytozome 12 (https://phytozome.jgi.doe.gov). Additional ploidy-aware genotype calling was performed by polyRAD v1.1.

keywords: variant call format (VCF); genotyping-by-sequencing (GBS); single nucleotide polymorphism (SNP); grass; genetic diversity; biomass

published: 2020-08-01

Xu, Ye; Dietrich, Christopher H.; Zhang, Yalin; Dmitriev, Dmitry; Zhang, Li; Wang, Yi-Mei; Lu, Si-Han; Qin, Dao-Zheng (2020): NEXUS morphological data file for phylogenetic analysis of Empoascini. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4470290_V1

The Empoascini_morph_data.nex text file contains the original data used in the phylogenetic analyses of Xu et al. (Systematic Entomology, in review). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first nine lines of the file indicate the file type (Nexus), that 110 taxa were analyzed, that a total of 99 characters were analyzed, the format of the data, and specification for symbols used in the dataset to indicate different character states. For species that have more than one state for a particular character, the states are enclosed in square brackets. Question marks represent missing data.The pdf file, Appendix1.pdf, is available here and describes the morphological characters and character states that were scored in the dataset. The data analyses are described in the cited original paper.

keywords: Hemiptera; Cicadellidae; morphology; biogeography; evolution

published: 2021-02-28

Ghosh, Sudipta; Riemer, Nicole; Giuliani, Graziano; Giorgi , Filippo; Ganguly, Dilip; Dey, Sagnik (2021): Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0533274_V1

This dataset contains the RegCM4 simulations used in the article " Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM". This dataset was used to investigate the impact of a new aging parameterisation scheme implemented in a regional climate model RegCM4. The dataset contains two sets of simulations: Expt_fix and Expt_dyn. It consists of the seasonal mean and daily mean values of the variables that were used to create the visualizations of this study. The Expt_fix and Expt_dyn dataset contain 34 and 38 NetCDF files, respectively. The CERES_vs_2expts_new.mat file is the comparison between CERES shortwave downward flux at the surface and same model outputs from two experiments for clear sky and all sky conditions. -------------------------------------------------- The following information about the dataset was generated on 2021-01-08 by SUDIPTA GHOSH GENERAL INFORMATION 1. Date of data collection (single date, range, approximate date): 2019-01-01 to 2019-12-31 2. Geographic location of data collection: Urbana-Champaign,Illinois, USA 3. Information about funding sources that supported the collection of the data: This work is supported by the MoEFCC under the NCAP-COALESCE project [Grant No. 14/10/2014-CC]. The first author acknowledges DST-INSPIRE fellowship [IF150055] and Fulbright-Kalam Climate Doctoral fellowship. N. R. acknowledges funding from NSF AGS-1254428 and DOE grant DE-SC0019192. Department of Science and Technology, Funds for Improvement of Science and Technology infrastructure in universities and higher educational institutions (DST-FIST) grant (SR/FST/ESII-016/2014) are acknowledged for the computing support. DATA & FILE OVERVIEW 1. File List: Expt_fix and Expt_dyn datasets contain the analysed seasonal means and daily means of the variables that have been used to create the visualizations of this study. Each of the Expt_fix and Expt_dyn datasets contains 34 and 38 NetCDF files, respectively. 2. Relationship between files, if important: NA 3. Additional related data collected that was not included in the current data package: No METHODOLOGICAL INFORMATION 1. Description of methods used for collection/generation of data: The model RegCM4 code is freely available online from <a href="http://gforge.ictp.it/gf/project/regcm/">http://gforge.ictp.it/gf/project/regcm/</a>. The anthropogenic aerosol emissions considered for the simulations are taken from IIASA inventory. The data used can be easily accessed online <a href="http://clima-dods.ictp.it/regcm4/">http://clima-dods.ictp.it/regcm4/</a> website. TRMM observed precipitation data can be assessed from <a href="https://giovanni.gsfc.nasa.gov/giovanni/">https://giovanni.gsfc.nasa.gov/giovanni/</a> website. CRU temperature data is available at <a href="https://crudata.uea.ac.uk/cru/data/hrg/">https://crudata.uea.ac.uk/cru/data/hrg/</a>. CERES satellite surface shortwave downward fluxes are available at <a href="https://ceres.larc.nasa.gov/data/">https://ceres.larc.nasa.gov/data/</a> website. Input files for the RegCM4 model are archived in <a href="http://clima-dods.ictp.it/regcm4/">http://clima-dods.ictp.it/regcm4/</a> website. This dataset contains the RegCM4 simulations used in the article " Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM ". Two sets of simulations: Expt_fix and Expt_dyn consists of the output data . This dataset only contains the analysed seasonal mean and daily mean of the variables that have been used to create the visualizations of this study. Each of Expt_fix and Expt_dyn contains 34 and 38 NetCDF files respectively. This dataset was used to investigate the impact of a new aging parameterisation scheme implemented in a regional climate model RegCM4. 2. Methods for processing the data: Seasonal Mean and daily average values were extracted from 6-hourly model output. 3. Instrument- or software-specific information needed to interpret the data: CDO-1.7.1, Grads-2.0.a9, Matlab2016b 4. Standards and calibration information, if appropriate: NA 5. Environmental/experimental conditions: NA 6. Describe any quality-assurance procedures performed on the data: NA 7. People involved with sample collection, processing, analysis and/or submission: Sudipta Ghosh, Nicole Riemer, Graziano Giuliani, Filippo Giorgi, Dilip Ganguly, Sagnik Dey DATA-SPECIFIC INFORMATION FOR: Expt_fix_data.tar.gz 1. Number of variables: 29 2. Number of cases/rows: NA 3. Variable List: Mass concentration (Kg m-3) of BC, BC_HB, BC_HL, OC, OC_HB, OC_HL; Columnar burden (mg m-2)] of BC, BC_HL, BC_HB, OC; Dry deposition flux (mg m-2 day-1) of BC_HB, BC_HL, OC_HB, OC_HL; Wet deposition flux due washout (mg m-2 day-1) of BC_HB, BC_HL, OC_HB, OC_HL; Wet deposition flux due to rainout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; AOD (unit less), precipitation (Kg m-2 s-1), temperature (K) , v-wind (m s-1), u-wind (m s-1), Surface shortwave downward flux (W m-2), Shortwave radiative forcing at the surface and top of atmosphere (W m-2) DATA-SPECIFIC INFORMATION FOR: Expt_dyn_data.tar.gz 1. Number of variables: 30 2. Number of cases/rows: NA 3. Variable List: Mass concentration (Kg m-3) of BC, BC_HB, BC_HL, OC, OC_HB, OC_HL; Columnar burden (mg m-2)] of BC, BC_HL, BC_HB, OC; Dry deposition flux (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; Wet deposition flux due washout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; Wet deposition flux due to rainout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; AOD (unit less); precipitation (Kg m-2 s-1); temperature (K); v-wind (m s-1); u-wind (m s-1); Surface shortwave downward flux (W m-2); Shortwave radiative forcing at the surface and top of atmosphere (W m-2); ageingscale (s-1) DATA-SPECIFIC INFORMATION FOR: CERES_vs_2expts_new.mat 1. Number of variables: 12 2. Number of cases/rows: NA 3. Variable List: Surface shortwave downward flux for clear sky (W/m-2) for CERES, Expt_fix, Expt_dyn (for winter JF and monsoon JJAS seasons); Surface shortwave downward flux for all sky conditions (W/m-2) for CERES, Expt_fix, Expt_dyn (for winter JF and monsoon JJAS seasons). NOTE: The following information applies for all three (3) files: Missing data codes: NA Specialized formats or other abbreviations used: NA

keywords: Carbonaceous aerosols; ageing parameterisation scheme; regional climate model; NetCDF

published: 2021-08-05

Lotspeich-Yadao, Michael (2021): State of Illinois - Common Spatial Geodatabase for the Social Sciences. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4857915_V1

This geodatabase serves two purposes: 1) to provide State of Illinois agencies with a fast resource for the preparation of maps and figures that require the use of shape or line files from federal agencies, the State of Illinois, or the City of Chicago, and 2) as a start for social scientists interested in exploring how geographic information systems (whether this is data visualization or geographically weighted regression) can bring new meaning to the interpretation of their data. All layer files included are relevant to the State of Illinois. Sources for this geodatabase include the U.S. Census Bureau, U.S. Geological Survey, City of Chicago, Chicago Public Schools, Chicago Transit Authority, Regional Transportation Authority, and Bureau of Transportation Statistics.

keywords: State of Illinois; City of Chicago; Chicago Public Schools; GIS; Statistical tabulation areas; hydrography

published: 2020-09-25

Androwski, Rebecca; Asad, Nadeem; Wood, Janet; Hofer, Allison; Locke, Steven; Smith, Cassandra; Rose, Becky; Schroeder, Nathan (2020): Data From: Mutually exclusive dendritic arbors in C. elegans neurons share a common architecture and convergent molecular cues. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0818023_V1

This dataset includes neuronal development of the Caenorhabditis elegans dauer and adult.

keywords: Nematode; Dendrite; Stress

published: 2021-03-08

Jaikumar, Nikhil S.; Fernandes, Samuel B.; Leakey, Andrew D.B.; Brown, Patrick J.; Stutz, Samantha S.; Bernacchi, Carl; Long, Stephen P. (2021): Photosynethic Performance Measurements in Biomass Sorghum Varietals in Central Illinois during Four Growing Seasons.. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4580996_V2

In a set of field studies across four years, the effect of self-shading on photosynthetic performance in lower canopy sorghum leaves was studied at sites in Champaign County, IL. Photosynthetic parameters in upper and lower canopy leaves, carbon assimilation, electron transport, stomatal conductance, and activity of three C4-specific photosynthetic enzymes, were compared within a genetically diverse range of accessions varying widely in canopy architecture and thereby in the degree of self-shading. Accessions with erect leaves and high light transmission through the canopy are henceforth referred to as ‘erectophile’ and those with low leaf erectness, ‘planophile’. In the final year of the study, bundle sheath leakiness in erectophile and planophile accessions was also compared.

keywords: Sorghum; Photosynethic Performance; Leaf Inclination

published: 2019-09-17

Mishra, Shubhanshu (2019): Trained models for multi-task multi-dataset learning for text classification as well as sequence tagging in tweets. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1094364_V1

Trained models for multi-task multi-dataset learning for text classification as well as sequence tagging in tweets. Classification tasks include sentiment prediction, abusive content, sarcasm, and veridictality. Sequence tagging tasks include POS, NER, Chunking, and SuperSenseTagging. Models were trained using: <a href="https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_classification_tagging.py">https://github.com/socialmediaie/SocialMediaIE/blob/master/SocialMediaIE/scripts/multitask_multidataset_classification_tagging.py</a> See <a href="https://github.com/socialmediaie/SocialMediaIE">https://github.com/socialmediaie/SocialMediaIE</a> and <a href="https://socialmediaie.github.io">https://socialmediaie.github.io</a> for details. If you are using this data, please also cite the related article: Shubhanshu Mishra. 2019. Multi-dataset-multi-task Neural Sequence Tagging for Information Extraction from Tweets. In Proceedings of the 30th ACM Conference on Hypertext and Social Media (HT '19). ACM, New York, NY, USA, 283-284. DOI: https://doi.org/10.1145/3342220.3344929

keywords: twitter; deep learning; machine learning; trained models; multi-task learning; multi-dataset learning; classification; sequence tagging

published: 2020-08-19

Jetti, Yaswanth Sai; Dunn, Alison C. (2020): The matrix of influence coefficients due to pyramidal distribution on an overlapping hexagonal grid. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0925335_V1

This data set is a matrix of values. The element in the row "i" and the column "j" denotes the influence of hexagonal pyramidal distribution at node "i" on the node "j". The size of the matrix is 16641x16641. This matrix corresponds to a 129x129 grid. Influence coefficient matrix on a smaller grid can be obtained by appropriately choosing the elements from the bigger matrix.

keywords: Influence coefficients

published: 2021-03-14

Kang, Jeon-Young; Michels, Alexander; Lyu, Fangzheng; Wang, Shaohua; Agbodo, Nelson; Freeman, Vincent L; Wang, Shaowen; Anand, Padmanabhan (2021): Spatial accessibility of COVID-19 healthcare resources in Illinois, USA. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6582453_V1

This dataset contains all the code, notebooks, datasets used in the study conducted to measure the spatial accessibility of COVID-19 healthcare resources with a particular focus on Illinois, USA. Specifically, the dataset measures spatial access for people to hospitals and ICU beds in Illinois. The spatial accessibility is measured by the use of an enhanced two-step floating catchment area (E2FCA) method (Luo & Qi, 2009), which is an outcome of interactions between demands (i.e, # of potential patients; people) and supply (i.e., # of beds or physicians). The result is a map of spatial accessibility to hospital beds. It identifies which regions need more healthcare resources, such as the number of ICU beds and ventilators. This notebook serves as a guideline of which areas need more beds in the fight against COVID-19. ## What's Inside A quick explanation of the components of the zip file * `COVID-19Acc.ipynb` is a notebook for calculating spatial accessibility and `COVID-19Acc.html` is an export of the notebook as HTML. * `Data` contains all of the data necessary for calculations:       * `Chicago_Network.graphml`/`Illinois_Network.graphml` are GraphML files of the OSMNX street networks for Chicago and Illinois respectively.       * `GridFile/` has hexagonal gridfiles for Chicago and Illinois       * `HospitalData/` has shapefiles for the hospitals in Chicago and Illinois       * `IL_zip_covid19/COVIDZip.json` has JSON file which contains COVID cases by zip code from IDPH       * `PopData/` contains population data for Chicago and Illinois by census tract and zip code.       * `Result/` is where we write out the results of the spatial accessibility measures       * `SVI/`contains data about the Social Vulnerability Index (SVI) * `img/` contains some images and HTML maps of the hospitals (the notebook generates the maps) * `README.md` is the document you're currently reading! * `requirements.txt` is a list of Python packages necessary to use the notebook (besides Jupyter/IPython). You can install the packages with `python3 -m pip install -r requirements.txt`

keywords: COVID-19; spatial accessibility; CyberGISX

published: 2024-03-01

Chen, Chu-Chun; Dominguez, Francina (2024): Data for The location of large-scale soil moisture anomalies affects moisture transport and precipitation over southeastern South America. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0536017_V1

This dataset contains model output from the Community Earth System Model, Version 1 (CESM1; Hurrell et al., 2013) and variables from the European Centre for Medium-Range Weather Forecast (ECMWF) Reanalysis v5 (ERA5; Hersbach et al., 2020). These data were used for analysis in “The location of large-scale soil moisture anomalies affects moisture transport and precipitation over southeastern South America”, published in Geophysical Research Letters. Acknowledgments: This work was supported by NSF Award AGS-1852709. We acknowledge high-performance computing support from Cheyenne (doi:10.5065/D6RX99HX) provided by NCAR's Computational and Information Systems Laboratory, sponsored by the NSF. We thank Dr. Haiyan Teng for providing guidance on setting up the CESM experiments and offering valuable advice. References: Hersbach H, Bell B, Berrisford P, et al. The ERA5 global reanalysis. Q J R Meteorol Soc. 2020; 146: 1999–2049. https://doi.org/10.1002/qj.3803 Hurrell, J. W., and Coauthors, 2013: The Community Earth System Model: A Framework for Collaborative Research. Bull. Amer. Meteor. Soc., 94, 1339–1360, https://doi.org/10.1175/BAMS-D-12-00121.1

keywords: atmospheric sciences; climate modeling; land-atmosphere interactions; soil moisture; regional atmospheric circulation; southeastern South America

published: 2020-07-15

Legried, Brandon; Molloy, Erin K.; Warnow, Tandy; Roch, Sebastien (2020): Data from: Polynomial-Time Statistical Estimation of Species Trees under Gene Duplication and Loss. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2626814_V3

This repository includes scripts and datasets for the paper, "Polynomial-Time Statistical Estimation of Species Trees under Gene Duplication and Loss."

keywords: Species tree estimation; gene duplication and loss; identifiability; statistical consistency; quartets; ASTRAL

published: 2023-03-27

Littlefield, Alexander; Xie, Dajie; Richards, Corey; Ocier, Christian; Gao, Haibo; Messinger, Jonah; Ju, Lawrence; Gao, Jingxing; Edwards, Lonna; Braun, Paul; Goddard, Lynford (2023): Data for Enabling High Precision Gradient Index Control in Subsurface Multiphoton Lithography. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3190140_V1

This dataset contains the full data used in the paper titled "Enabling High Precision Gradient Index Control in Subsurface Multiphoton Lithography," available at https://doi.org/10.1021/acsphotonics.2c01950 . The data used for Table 1 can be found in the dataset for the related Figure 8. Some supplemental figures' data can be found in the main figures data: Figure S2's data is contained in Figure 6. Figure S4 and Table S1 data is derived from Figure 6. Figure S9 is derived from Figure 7. Figure S10 is contained in Figure 7. Figure S12 is derived from Figure 6 and the Python code prism-fringe-analysis. Figures without a data file named after them do not have any data affiliated with them and are purely graphical representations.

published: 2020-05-31

Zhang, Chuanyi; El-Kebir, Mohammed; Ochoa, Idoia (2020): Simulated multi-sample tumor bulk sequencing data. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9059263_V1

This repository includes a simulated dataset and related scripts used for the paper "Moss: Accurate Single-Nucleotide Variant Calling from Multiple Bulk DNA Tumor Samples".

keywords: Somatic Mutations; Bulk DNA Sequencing; Cancer Genomics

published: 2020-04-20

Ferrer, Astrid (2020): Data for: Contribution of fungal and invertebrate communities to mass loss and wood depolymerization in tropical terrestrial and aquatic habitats. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1530066_V1

Supplemental data sets for the Manuscript entitled "Contribution of fungal and invertebrate communities to mass loss and wood depolymerization in tropical terrestrial and aquatic habitats"

keywords: Coiba Island; wood decomposition; cellulose; hemicellulose; lignin breakdown; aquatic fungi

published: 2020-01-31

Bradshaw, Therin M.; Blake-Bradshaw, Abigail G.; Fournier, Auriel M.V.; Lancaster, Joseph D. ; O'Connell, John; Jacques, Christopher N.; Eicholtz, Michael W.; Hagy, Heath M (2020): Marsh bird occupancy of wetlands managed for waterfowl in the Midwestern USA - Analysis Inputs. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5152821_V1

Data inputs, and scripts for the analysis detailed in Bradshaw et al, published in PlosONE 2020.

keywords: Marsh birds; wetlands

published: 2020-06-19

Copas, Katherine (2020): World Values Survey and World Bank Data for measuring perceptions of expertise in developing nations . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2476863_V1

This dataset include data pulled from the World Bank 2009, the World Values Survey wave 6, Transparency International from 2009. The data were used to measure perceptions of expertise from individuals in nations that are recipients of development aid as measured by the World Bank.

keywords: World Values Survey; World Bank; expertise; development

published: 2024-03-09

Mishra, Apratim; Diesner, Jana; Torvik, Vetle I. (2024): Hype - PubMed dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0651259_V1

Hype - PubMed dataset Prepared by Apratim Mishra This dataset captures ‘Hype’ within biomedical abstracts sourced from PubMed. The selection chosen is ‘journal articles’ written in English, published between 1975 and 2019, totaling ~5.2 million. The classification relies on the presence of specific candidate ‘hype words’ and their abstract location. Therefore, each article might have multiple instances in the dataset due to the presence of multiple hype words in different abstract sentences. The candidate hype words are 36 in count: 'major', 'novel', 'central', 'critical', 'essential', 'strongly', 'unique', 'promising', 'markedly', 'excellent', 'crucial', 'robust', 'importantly', 'prominent', 'dramatically', 'favorable', 'vital', 'surprisingly', 'remarkably', 'remarkable', 'definitive', 'pivotal', 'innovative', 'supportive', 'encouraging', 'unprecedented', 'bright', 'enormous', 'exceptional', 'outstanding', 'noteworthy', 'creative', 'assuring', 'reassuring', 'spectacular', and 'hopeful'. File 1: hype_dataset.csv Primary dataset. It has the following columns: 1. PMID: represents unique article ID in PubMed 2. Hype_word: Candidate hype word, such as ‘novel.’ 3. Sentence: Sentence in abstract containing the hype word. 4. Abstract_length: Length of article abstract. 5. Hype_percentile: Abstract relative position of hype word. 6. Hype_value: Propensity of hype based on the hype word, the sentence, and the abstract location. 7. Introduction: The ‘I’ component of the hype word based on IMRaD 8. Methods: The ‘M’ component of the hype word based on IMRaD 9. Results: The ‘R’ component of the hype word based on IMRaD 10. Discussion: The ‘D’ component of the hype word based on IMRaD File 2: hype_removed_phrases.csv Secondary dataset with same columns as File 1. Hype in the primary dataset is based on excluding certain phrases that are rarely hype. The phrases that were removed are included in File 2 and modeled separately. Removed phrases: 1. Major: histocompatibility, component, protein, metabolite, complex, surgery 2. Novel: assay, mutation, antagonist, inhibitor, algorithm, technique, series, method, hybrid 3. Central: catheters, system, design, composite, catheter, pressure, thickness, compartment 4. Critical: compartment, micelle, temperature, incident, solution, ischemia, concentration 5. Essential: medium, features, properties, opportunities 6. Unique: model, amino 7. Robust: regression 8. Vital: capacity, signs, organs, status, structures, staining, rates, cells, information 9. Outstanding: questions, issues, question, challenge, problems, problem, remains 10. Remarkable: properties 11. Definite: radiotherapy, surgery 12. Bright: field

keywords: Hype; PubMed; Abstracts; Biomedicine

Subject Area

Funder

Publication Year

License

Datasets