Illinois Data Bank
Deposit Dataset
Find Data
Policies
Guides
Contact Us
Log in with NetID
Toggle navigation
Illinois Data Bank
Deposit Dataset
Find Data
Policies
Guides
Contact Us
Log in with NetID
Displaying 301 - 325 of 738 in total
<
1
2
…
9
10
11
12
13
14
15
16
17
…
29
30
>
25 per page
50 per page
Show All
Go
Clear Filters
Generate Report from Search Results
Subject Area
Life Sciences (395)
Social Sciences (139)
Physical Sciences (106)
Technology and Engineering (68)
Uncategorized
Arts and Humanities (1)
Funder
Other (227)
U.S. National Science Foundation (NSF) (211)
U.S. Department of Energy (DOE) (76)
U.S. National Institutes of Health (NIH) (73)
U.S. Department of Agriculture (USDA) (50)
Illinois Department of Natural Resources (IDNR) (20)
U.S. Geological Survey (USGS) (7)
U.S. National Aeronautics and Space Administration (NASA) (6)
Illinois Department of Transportation (IDOT) (4)
U.S. Army (2)
Publication Year
2024 (110)
2021 (108)
2022 (106)
2020 (96)
2023 (75)
2019 (72)
2018 (61)
2025 (39)
2017 (36)
2016 (30)
2009 (1)
2011 (1)
2012 (1)
2014 (1)
2015 (1)
License
CC0 (403)
CC BY (312)
custom (23)
Illinois Data Bank Dataset Search Results
Dataset Search Results
published: 2023-03-08
Majeed, Fahd; Khanna, Madhu (2023): Code and Data for "Carbon Mitigation Payments Can Reduce the Riskiness of Bioenergy Crop Production". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6296964_V1
A stochastic domination analysis model was developed to examine the effect that emerging carbon markets can have on the spatially varying returns and risk profiles of bioenergy crops relative to conventional crops. The code is written in MATLAB, and includes the calculated output. See the README file for instructions to run the code.
keywords:
bioenergy crops; economic modeling; stochastic domination analysis model;
published: 2018-12-20
Dong, Xiaoru; Xie, Jingyi; Linh, Hoang (2018): Inclusion_Criteria_Annotation. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5958960_V1
File Name: Inclusion_Criteria_Annotation.csv Data Preparation: Xiaoru Dong Date of Preparation: 2018-12-14 Data Contributions: Jingyi Xie, Xiaoru Dong, Linh Hoang Data Source: Cochrane systematic reviews published up to January 3, 2018 by 52 different Cochrane groups in 8 Cochrane group networks. Associated Manuscript authors: Xiaoru Dong, Jingyi Xie, Linh Hoang, and Jodi Schneider. Associated Manuscript, Working title: Machine classification of inclusion criteria from Cochrane systematic reviews. Description: The file contains lists of inclusion criteria of Cochrane Systematic Reviews and the manual annotation results. 5420 inclusion criteria were annotated, out of 7158 inclusion criteria available. Annotations are either "Only RCTs" or "Others". There are 2 columns in the file: - "Inclusion Criteria": Content of inclusion criteria of Cochrane Systematic Reviews. - "Only RCTs": Manual Annotation results. In which, "x" means the inclusion criteria is classified as "Only RCTs". Blank means that the inclusion criteria is classified as "Others". Notes: 1. "RCT" stands for Randomized Controlled Trial, which, in definition, is "a work that reports on a clinical trial that involves at least one test treatment and one control treatment, concurrent enrollment and follow-up of the test- and control-treated groups, and in which the treatments to be administered are selected by a random process, such as the use of a random-numbers table." [Randomized Controlled Trial publication type definition from https://www.nlm.nih.gov/mesh/pubtypes.html]. 2. In order to reproduce the relevant data to this, please get the code of the project published on GitHub at: https://github.com/XiaoruDong/InclusionCriteria and run the code following the instruction provided.
keywords:
Inclusion criteria, Randomized controlled trials, Machine learning, Systematic reviews
published: 2019-03-25
Clark, Lindsay V.; Dwiyanti, Maria Stefanie; Anzoua, Kossonou G.; Brummer, Joe E.; Ghimire, Bimal Kumar; Głowacka, Katarzyna; Hall, Megan; Heo, Kweon; Jin, Xiaoli; Lipka, Alexander E.; Peng, Junhua; Yamada, Toshihiko; Yoo, Ji Hye; Yu, Chang Yeon; Zhao, Hua; Long, Stephen P.; Sacks, Erik J. (2019): Miscanthus sinensis multi-location trial: phenotypic analysis, genome-wide association, and genomic prediction . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0790815_V3
This dataset contains genotypic and phenotypic data, R scripts, and the results of analysis pertaining to a multi-location field trial of Miscanthus sinensis. Genome-wide association and genomic prediction were performed for biomass yield and 14 yield-component traits across six field trial locations in Asia and North America, using 46,177 single-nucleotide polymorphism (SNP) markers mined from restriction site-associated DNA sequencing (RAD-seq) and 568 M. sinensis accessions. Genomic regions and candidate genes were identified that can be used for breeding improved varieties of M. sinensis, which in turn will be used to generate new M. xgiganteus clones for biomass.
keywords:
miscanthus; genotyping-by-sequencing (GBS); genome-wide association studies (GWAS); genomic selection
published: 2024-02-15
Hoggatt, Meredith; Starbuck, Clarissa; O'Keefe, Joy (2024): Data for "Acoustic monitoring yields informative bat population density estimates". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7001459_V1
Dataset includes the dataset for estimating bat density from acoustic data and the R code. The data support a publication by Meredith L. Hoggatt, Clarissa A. Starbuck, and Joy M. O'Keefe entitled Acoustic monitoring yields informative bat population density estimates.
keywords:
acoustics; bats; monitoring; population density; random encounter model
published: 2019-02-22
Fernández, Roberto; Parker, Gary; Stark, Colin (2019): Experiments on patterns of alluvial cover and bedrock erosion in a meandering channel. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2-3044828_V1
This dataset includes measurements taken during the experiments on patterns of alluvial cover over bedrock. The dataset includes an hour worth of timelapse images taken every 10s for eight different experimental conditions. It also includes the instantaneous water surface elevations measured with eTapes at a frequency of 10Hz for each experiment. The 'Read me Data.txt' file explains in more detail the contents of the dataset.
keywords:
bedrock; erosion; alluvial; meandering; alluvial cover; sinuosity; flume; experiments; abrasion;
published: 2024-02-16
Zhang, Mingxiao; Sutton, Bradley (2024): Sample Data for “Measuring CSF Shunt Flow with MRI Using Flow Enhancement of Signal Intensity (FENSI)”. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7252521_V1
Sample data from one typical phantom test and one deidentified shunt patient test (shown in Fig. 8 of the MRM paper), with the corresponding analysis code for the Shunt-FENSI technique. For the MRM paper “Measuring CSF Shunt Flow with MRI Using Flow Enhancement of Signal Intensity (FENSI)”
keywords:
Shunt-FENSI; MRM; Hydrocephalus; VP Shunt; Flow Quantification; Pediatric Neurosurgery; Pulse Sequence; Signal Simulation
published: 2016-12-20
Wickes, Elizabeth; Nakamura, Katia (2016): Supporting data processing scripts and example data for Peru AIDData analysis. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7860393_V1
Scripts and example data for AIDData (aiddata.org) processing in support of forthcoming Nakamura dissertation. This dataset includes two sets of scripts and example data files from an aiddata.org data dump. Fuller documentation about the functionality for these scripts is within the readme file. Additional background information and description of usage will be in the forthcoming Nakamura dissertation (link will be added when available). Data originally supplied by Nakamura. Python code and this readme file created by Wickes. Data included within this deposit are examples to demonstrate execution. Roughly, there are two python scripts in here: keyword_search.py, designed to assist in finding records matching specific keywords, and matching_tool.ipynb, designed to assist in detection of which records are and are not contained within a keyword results file and an aiddata project data file.
keywords:
aiddata; natural resources
published: 2021-11-05
Keralis, Spencer D. C.; Yakin, Syamil (2021): Becoming A Trans Inclusive Library - Library Employee Survey. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0888551_V1
This data set contains survey results from a 2021 survey of University of Illinois University Library employees conducted as part of the Becoming A Trans Inclusive Library Project to evaluate the awareness of University of Illinois faculty, staff, and student employees regarding transgender identities, and to assess the professional development needs of library employees to better serve trans and gender non-conforming patrons. The survey instrument is available in the IDEALS repository: http://hdl.handle.net/2142/110080.
keywords:
transgender awareness, academic library, gender identity awareness, professional development opportunities
published: 2022-10-27
Holiman, Haley; Kitaif, J. Carson; Fournier, Auriel M.V.; Iglay, Ray; Woodrey, Mark S. (2022): Estimating ability to detect secretive marsh birds over distance using autonomous recording units. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4450613_V1
keywords:
marsh birds; automated recording units
published: 2021-09-17
Stern, Jessica; Herman, Brook D. ; Matthews, Jeffrey (2021): Data from determining vegetation metric robustness to environmental and methodological variables . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0938556_V1
We studied vegetation metric robustness to environmental (season, interannual, and regional) and methodological (observer) variables, as well as adequate sample size for vegetation metrics across four regions of the United States.
keywords:
coefficients of conservatism; floristic quality assessment; restoration; vegetation metric;
published: 2022-11-11
Hsiao, Haw-Wen; Zuo, Jian-Min (2022): Data for Chemical Short-Range Ordering in a CrCoNi Medium-Entropy Alloy. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4432073_V1
This dataset is for characterizing chemical short-range-ordering in CrCoNi medium entropy alloys. It has three sub-folders: 1. code, 2. sample WQ, 3. sample HT. The software needed to run the files is Gatan Microscopy Suite® (GMS). Please follow the instruction on this page to install the DM3 GMS: <a href="https://www.gatan.com/installation-instructions#Step1">https://www.gatan.com/installation-instructions#Step1</a> 1. Code folder contains three DM scripts to be installed in Gatan DigitalMicrograph software to analyze scanning electron nanobeam diffraction (SEND) dataset: Cepstrum.s: need [EF-SEND_sampleWQ_cropped_aligned.dm3] in Sample WQ and the average image from [EF-SEND_sampleWQ_cropped_aligned.dm3]. Same for Sample HT folder. log_BraggRemoval.s: same as above. Patterson.s: Need refined diffuse patterns in Sample HT folder. 2. Sample WQ and 3. Sample HT folders both contain the SEND data (.ser) and the binned SEND data (.dm3) as well as our calculated strain maps as the strain measurement reference. The Sample WQ folder additionally has atomic resolution STEM images; the Sample HT folder additionally has three refined diffuse patterns as references for diffraction data processing. * Only .ser file is needed to perform the strain measurement using imToolBox as listed in the manuscript. .emi file contains the meta data of the microscope, which can be opened together with .ser file using FEI TIA software.
keywords:
Medium entropy alloy; CrCoNi; chemical short-range-ordering; CSRO; TEM
published: 2022-11-09
Wang, Junren; Konar, Megan; Dalin, Carole; Liu, Yu; Stillwell, Ashlynn S.; Xu, Ming; Zhu, Tingju (2022): Data for: Economic and Virtual Water Multilayer Networks in China. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5215221_V1
This dataset includes the blue water intensity by sector (41 industries and service sectors) for provinces in China, economic and virtual water network flow for China in 2017, and the corresponding network properties for these two networks.
keywords:
Economic network; Virtual water; Supply chains; Network analysis; Multilayer; MRIO
published: 2023-04-02
Lee, Yuanyao; Khanna, Madhu; Chen, Luoye (2023): Code and Data for "Quantifying Uncertainties in Greenhouse Gas Savings and Mitigation Costs with Cellulosic Biofuels". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4326514_V1
Use of cellulosic biofuels from non-feedstocks are modeled using the BEPAM (Biofuel and Environmental Policy Analysis Model) model to quantifying the uncertainties about induced land use change effects, net greenhouse gas saving potential, and economic costs. The code is in GAMS, general algebraic modeling language. NOTE: Column 3 is titled "BAU" in "merged_BAU.gdx", "merged_RFS.gdx", and "merged_CEM.gdx", but contains "RFS" data in "merged_RFS.gdx" and "CEM" data in "merged_CEM.gdx".
keywords:
cellulosic biomass; BEPAM; economic modeling
published: 2016-12-19
Hahn, James (2016): API analysis of the Minrva mobile app (May 2015 – December 2015). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5495131_V1
Files in this dataset represent an investigation into use of the Library mobile app Minrva during the months of May 2015 through December 2015. During this time interval 45,975 API hits were recorded by the Minrva web server. The dataset included herein is an analysis of the following: 1) a delineation of API hits to mobile app modules use in the Minrva app by month, 2) a general analysis of Minrva app downloads to module use, and 3) the annotated data file providing associations from API hits to specific modules used, organized by month (May 2015 – December 2015).
keywords:
API analysis; log analysis; Minrva Mobile App
published: 2021-07-15
Castro, Daniel; Sweedler, Jonathan (2021): High-Throughput Single-Organelle Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5949772_V1
The dataset contains the high-throughput matrix-assisted laser desorption/ionization mass spectrometry XmL files for the atrial gland and red hemiduct of Aplysia californica.
keywords:
Dense-core vesicle; High-throughput; Mass Spectrometry; MALDI; Organelle; Image-Guided; Atrial gland; red hemiduct; Lucent Vesicle
published: 2023-03-28
Hsiao, Tzu-Kun; Torvik, Vetle (2023): OpCitance: Citation contexts identified from the PubMed Central open access articles. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4353270_V2
Sentences and citation contexts identified from the PubMed Central open access articles ---------------------------------------------------------------------- The dataset is delivered as 24 tab-delimited text files. The files contain 720,649,608 sentences, 75,848,689 of which are citation contexts. The dataset is based on a snapshot of articles in the XML version of the PubMed Central open access subset (i.e., the PMCOA subset). The PMCOA subset was collected in May 2019. The dataset is created as described in: Hsiao TK., & Torvik V. I. (manuscript) OpCitance: Citation contexts identified from the PubMed Central open access articles. <b>Files</b>: • A_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with A. • B_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with B. • C_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with C. • D_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with D. • E_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with E. • F_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with F. • G_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with G. • H_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with H. • I_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with I. • J_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with J. • K_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with K. • L_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with L. • M_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with M. • N_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with N. • O_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with O. • P_p1_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with P (part 1). • P_p2_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with P (part 2). • Q_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with Q. • R_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with R. • S_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with S. • T_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with T. • UV_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with U or V. • W_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with W. • XYZ_journal_IntxtCit.tsv – Sentences and citation contexts identified from articles published in journals with journal titles starting with X, Y or Z. Each row in the file is a sentence/citation context and contains the following columns: • pmcid: PMCID of the article • pmid: PMID of the article. If an article does not have a PMID, the value is NONE. • location: The article component (abstract, main text, table, figure, etc.) to which the citation context/sentence belongs. • IMRaD: The type of IMRaD section associated with the citation context/sentence. I, M, R, and D represent introduction/background, method, results, and conclusion/discussion, respectively; NoIMRaD indicates that the section type is not identifiable. • sentence_id: The ID of the citation context/sentence in the article component • total_sentences: The number of sentences in the article component. • intxt_id: The ID of the citation. • intxt_pmid: PMID of the citation (as tagged in the XML file). If a citation does not have a PMID tagged in the XML file, the value is "-". • intxt_pmid_source: The sources where the intxt_pmid can be identified. Xml represents that the PMID is only identified from the XML file; xml,pmc represents that the PMID is not only from the XML file, but also in the citation data collected from the NCBI Entrez Programming Utilities. If a citation does not have an intxt_pmid, the value is "-". • intxt_mark: The citation marker associated with the inline citation. • best_id: The best source link ID (e.g., PMID) of the citation. • best_source: The sources that confirm the best ID. • best_id_diff: The comparison result between the best_id column and the intxt_pmid column. • citation: A citation context. If no citation is found in a sentence, the value is the sentence. • progression: Text progression of the citation context/sentence. <b>Supplementary Files</b> • PMC-OA-patci.tsv.gz – This file contains the best source link IDs for the references (e.g., PMID). Patci [1] was used to identify the best source link IDs. The best source link IDs are mapped to the citation contexts and displayed in the *_journal IntxtCit.tsv files as the best_id column. Each row in the PMC-OA-patci.tsv.gz file is a citation (i.e., a reference extracted from the XML file) and contains the following columns: • pmcid: PMCID of the citing article. • pos: The citation's position in the reference list. • fromPMID: PMID of the citing article. • toPMID: Source link ID (e.g., PMID) of the citation. This ID is identified by Patci. • SRC: The sources that confirm the toPMID. • MatchDB: The origin bibliographic database of the toPMID. • Probability: The match probability of the toPMID. • toPMID2: PMID of the citation (as tagged in the XML file). • SRC2: The sources that confirm the toPMID2. • intxt_id: The ID of the citation. • journal: The first letter of the journal title. This maps to the *_journal_IntxtCit.tsv files. • same_ref_string: Whether the citation string appears in the reference list more than once. • DIFF: The comparison result between the toPMID column and the toPMID2 column. • bestID: The best source link ID (e.g., PMID) of the citation. • bestSRC: The sources that confirm the best ID. • Match: Matching result produced by Patci. [1] Agarwal, S., Lincoln, M., Cai, H., & Torvik, V. (2014). Patci – a tool for identifying scientific articles cited by patents. GSLIS Research Showcase 2014. http://hdl.handle.net/2142/54885 • intxt_cit_license_fromPMC.tsv – This file contains the CC licensing information for each article. The licensing information is from PMC's file lists [2], retrieved on June 19, 2020, and March 9, 2023. It should be noted that the license information for 189,855 PMCIDs is <b>NO-CC CODE</b> in the file lists, and 521 PMCIDs are absent in the file lists. The absence of CC licensing information does not indicate that the article lacks a CC license. For example, PMCID: 6156294 (<b>NO-CC CODE</b>) and PMCID: 6118074 (absent in the PMC's file lists) are under CC-BY licenses according to their PDF versions of articles. The intxt_cit_license_fromPMC.tsv file has two columns: • pmcid: PMCID of the article. • license: The article’s CC license information provided in PMC’s file lists. The value is nan when an article is not present in the PMC’s file lists. [2] https://www.ncbi.nlm.nih.gov/pmc/tools/ftp/ • Supplementary_File_1.zip – This file contains the code for generating the dataset.
keywords:
citation context; in-text citation; inline citation; bibliometrics; science of science
published: 2023-04-12
Han, Edmund; Nahid, Shahriar Muhammad; Rakib, Tawfiqur; Nolan, Gillian; F. Ferrari, Paolo; Hossain, M. Abir ; Schleife, André ; Nam, SungWoo; Ertekin, Elif; van der Zande, Arend; Huang, Pinshane (2023): Data for Bend-induced ferroelectric domain walls in α-In2Se3. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1187822_V1
STEM images of kinks in α-In2Se3, DFT calculation of bending of α-In2Se3, PFM on as exfoliated and controllably bend α-In2Se3
published: 2023-08-11
Li, Shuai; Leakey, Andrew D.B.; Moller, Christopher A.; Montes, Christopher M.; Sacks, Erik J.; DeKyoung, Lee; Ainsworth, Elizabeth A. (2023): Similar photosynthetic, but different yield responses of C3 and C4 crops to elevated O3. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9446886_V1
This dataset contains leaf photosynthetic and biochemical traits, plant biomass, and yield in five C3 crops (chickpea, rice, snap bean, soybean, wheat) and four C4 crops (sorghum, maize, Miscanthus × giganteus, switchgrass) grown under ambient and elevated O3 concentration ([O3]) in the field at free-air O3 concentration enrichment (O3-FACE) facilities over the past 20 years.
keywords:
C3 and C4 crops; elevated O3; FACE; photosynthesis; yield
published: 2024-01-30
Aishwarya, Anuva; Madhavan, Vidya (2024): Data for Melting of the charge density wave by generation of pairs of topological defects in UTe2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6515700_V1
The data files are for the paper entitled: Melting of the charge density wave by generation of pairs of topological defects in UTe2 to be published in Nature Physics. The data was obtained on a 300 mK custom designed Unisoku scanning tunneling microscope using the Nanonis module. All the data files have been named based on the Figure numbers that they represent.
keywords:
superconductivity; triplet; topology; heavy fermion; Kondo; magnetic field; charge density wave
published: 2023-08-02
Jeng, Amos; Bosch, Nigel; Perry, Michelle (2023): Data for: Phatic Expressions Influence Perceived Helpfulness in Online Peer Help-Giving: A Mixed Methods Study. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6591732_V1
This dataset was developed as part of an online survey study that investigates how phatic expressions—comments that are social rather than informative in nature—influence the perceived helpfulness of online peer help-giving replies in an asynchronous college course discussion forum. During the study, undergraduate students (N = 320) rated and described the helpfulness of examples of replies to online requests for help, both with and without four types of phatic expressions: greeting/parting tokens, other-oriented comments, self-oriented comments, and neutral comments.
keywords:
help-giving; phatic expression; discussion forum; online learning; engagement
published: 2023-09-13
Shen, Chengze; Liu, Baqiao; Williams, Kelly P.; Warnow, Tandy (2023): Additional datasets (RNASim10k) for EMMA: A New Method for Computing Multiple Sequence Alignments given a Constraint Subset Alignment. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4194451_V1
This upload contains one additional set of datasets (RNASim10k, ten replicates) used in Experiment 2 of the EMMA paper (appeared in WABI 2023): Shen, Chengze, Baqiao Liu, Kelly P. Williams, and Tandy Warnow. "EMMA: A New Method for Computing Multiple Sequence Alignments given a Constraint Subset Alignment". The zipped file has the following structure: 10k |__R0 |__unaln.fas |__true.fas |__true.tre |__R1 ... # Alignment files: 1. `unaln.fas`: all unaligned sequences. 2. `true.fas`: the reference alignment of all sequences. 3. `true.tre`: the reference tree on all sequences. For other datasets that uniquely appeared in EMMA, please refer to the related dataset (which is linked below): Shen, Chengze; Liu, Baqiao; Williams, Kelly P.; Warnow, Tandy (2022): Datasets for EMMA: A New Method for Computing Multiple Sequence Alignments given a Constraint Subset Alignment. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2567453_V1
keywords:
SALMA;MAFFT;alignment;eHMM;sequence length heterogeneity
published: 2017-09-16
Mirarab, Siavash; Warnow, Tandy (2017): Data for 16S and 23S rRNA alignments. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1614388_V1
This dataset contains the data for 16S and 23S rRNA alignments including their reference trees. The original alignments are from the Gutell Lab CRW, currently located at https://crw-site.chemistry.gatech.edu/DAT/3C/Alignment/.
published: 2019-07-11
Daniels, Melissa; Larson, Eric (2019): Data for Effects of forest windstorm disturbance on invasive plants in protected areas of southern Illinois, USA. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1401121_V1
We studied the effect of windstorm disturbance on forest invasive plants in southern Illinois. This data includes raw data on plant abundance at survey points, compiled data used in statistical analyses, and spatial data for surveyed plots and units. This file package also includes a readme.doc file that describes the data in detail, including attribute descriptions.
keywords:
tornado, blowdowns, derecho, invasive plants, Shawnee National Forest, southern Illinois
published: 2022-03-30
Tiemann, Jeremy S.; Stodola, Alison P.; Douglass, Sarah A.; Vinsel, Rachel M.; Cummings, Kevin S. (2022): Dataset associated with Nonindigenous Aquatic Mollusks in Illinois manuscript. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8947838_V1
This dataset is associated with a larger manuscript published in 2022 in the Illinois Natural History Survey Bulletin to summarize all known records for nonindigenous aquatic mollusks in Illinois, and full sources are referenced within the manuscript. We examined museum holdings, literature accounts, publicly available databases sponsored by the U.S. Geological Survey (USGS) - Nonindigenous Aquatic Species program (http://nas.er.usgs.gov/.) and InvertEBase (invertebase.org). We also included sporadic field survey data of encounters of nonindigenous aquatic species from colleagues within the Illinois Natural History Survey, Illinois Department of Natural Resources, U.S. Fish and Wildlife Service, county forest preserve districts, and other natural resource agencies about their encounters with nonindigenous aquatic mollusk species. Lastly, we examined the role and utility of citizen-science data to document occurrences of nonindigenous aquatic mollusk species. We queried iNaturalist (www.inaturalist.org) for all available nonindigenous freshwater mollusk data for Illinois. Table heading descriptions (if not intuitive) are: “INHS verified” is whether an INHS staff member verified the record by observing vouchered specimen or photograph; “Source” is where a record was accessed or obtained; “individualCount” is number collected or observed in a record; “MuseumCode” is standard museum abbreviation or acronym; “Institution” is source that housed or reported a record, and this also includes the spelled-out museum code; “Collectors” typically indicates who collected the specimen or voucher; “Lat_Long determined by” denotes whether collection coordinates were stated by the collector or by a curator (using inference from data available); “fieldNumber” typically indicates a unique field number that a collector may have used in the field; “identifiedBy” typically explains who identified a specimen or verified a specimen identification.
keywords:
Illinois; Exotic species; Non-native aquatic species; NAS; Aquatic Invasive Species; AIS; Mollusk
published: 2009-06-19
Liu, Kevin; Raghavan, Sindhu; Nelesen, Serita; Linder, C. Randall; Warnow, Tandy (2009): Data for Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5139418_V1
This dataset contains the data for SATe-I. SATe-I data was used in the following article: K. Liu, S. Raghavan, S. Nelesen, C. R. Linder, T. Warnow, "Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees," Science, vol. 324, no. 5934, pp. 1561-1564, 19 June 2009.