Displaying 51 - 75 of 638 in total

Datasets

planned publication date: 2024-10-16

Smith, Rebecca; Huang, Conghui (2024): Data for A modeling study on SARS-CoV-2 transmission in primary and middle schools in Illinois. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3705306_V1

School testing data were provided by Shield Illinois (ShieldIL), which conducted weekly in-school testing on behalf of the Illinois Department of Public Health (IDPH) for all participating schools in the state excluding Chicago Public Schools. The populations and proportions of students and employees in the studied school districts are reported by Elementary/Secondary Information System (ElSi) database.

keywords: COVID-19; school testing

published: 2024-03-28

Mies, Timothy A. (2024): University of Illinois Urbana-Champaign Energy Farm Multiyear Weather Station Raw Data. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6955306_V1

This dataset contains weather data taken at the University of Illinois Urbana-Champaign Energy Farm using automatic sensors and averaged every 15 minutes. Measurements include average air temperature, average relative humidity, average wind speed, maximum wind speed, average wind direction, average photosynthetically active radiation, total precipitation, and average air pressure.

keywords: air temperature; relative humidity; wind speed; wind direction; photosynthetically active radiation; precipitation; air pressure

published: 2024-03-28

Zhang, Yue; Zhao, Helin; Huang, Siyuan; Hossain, Mohhamad Abir; van der Zande, Arend (2024): Enhancing Carrier Mobility In Monolayer MoS2 Transistors With Process induced Strain. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7519929_V1

Read me file for the data repository ******************************************************************************* This repository has raw data for the publication "Enhancing Carrier Mobility In Monolayer MoS2 Transistors With Process Induced Strain". We arrange the data following the figure in which it first appeared. For all electrical transfer measurement, we provide the up-sweep and down-sweep data, with voltage units in V and conductance unit in S. All Raman modes have unit of cm^-1. ******************************************************************************* How to use this dataset All data in this dataset is stored in binary Numpy array format as .npy file. To read a .npy file: use the Numpy module of the python language, and use np.load() command. Example: suppose the filename is example_data.npy. To load it into a python program, open a Jupyter notebook, or in the python program, run: import numpy as np data = np.load("example_data.npy") Then the example file is stored in the data object. *******************************************************************************

published: 2024-03-25

Xia, Yushu; Kwon, Hoyoung; Wander, Michelle (2024): Soil Nitrous Oxide Emissions Data for Estimating soil N2O emissions induced by organic and inorganic fertilizer inputs using a Tier-2, regression-based meta-analytic approach for U.S. agricultural lands". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9808669_V1

This accompanying study is published under the title "Estimating soil N2O emissions induced by organic and inorganic fertilizer inputs using a Tier-2, regression-based meta-analytic approach for U.S. agricultural lands" at Science of the Total Environment. The study is authored by Dr. Yushu Xia, Dr. Hoyoung Kwon, and Dr. Michelle Wander. The DOI for this study is (TBD). Please refer to the study for detailed data extraction and processing methods.

keywords: soil; nitrous oxide; agriculture; fertilizers; meta-analysis

published: 2024-01-30

Aishwarya, Anuva; Madhavan, Vidya (2024): Data for Melting of the charge density wave by generation of pairs of topological defects in UTe2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6515700_V1

The data files are for the paper entitled: Melting of the charge density wave by generation of pairs of topological defects in UTe2 to be published in Nature Physics. The data was obtained on a 300 mK custom designed Unisoku scanning tunneling microscope using the Nanonis module. All the data files have been named based on the Figure numbers that they represent.

keywords: superconductivity; triplet; topology; heavy fermion; Kondo; magnetic field; charge density wave

published: 2024-03-25

Mishra, Apratim; Lee, Haejin; Jeoung, Sullam; Torvik, Vetle; Diesner, Jana (2024): Diversity - PubMed Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5259667_V1

Diversity - PubMed dataset Contact: Apratim Mishra (March 22, 2024) This dataset presents article-level (pmid) and author-level (auid) diversity data for PubMed articles. The selection chosen includes articles retrieved from Authority 2018 [1], a total of 228 040 papers and 440 310 authors. The sample of papers is based on the top 40 journals in the dataset, limited to 2-10 authors published between 1990 – 2010, and stratified on paper count per year. Additionally, this dataset is limited to papers where the lead author is affiliated with one of the four countries: the US, the UK, Canada, and Australia. Files are encoded with ‘utf-8’. ################################################ File1: auids_plos.csv (Important columns defined, 7 in total) • AUID: a unique ID for each author • Ethnea: ethnicity prediction • Genni: gender prediction ################################################# File2: pmids_plos.csv (Important columns defined, 33 in total) • pmid: unique paper ID • year: Year of paper publication • no_authors: Author count • journal: Journal name • years: first year of publication for every author • age_bin: Binned age for every author • Country-temporal: Country of affiliation for every author • h_index: Journal h-index • TimeNovelty: Paper Time novelty [2] • nih_funded: Binary variable indicating NIH funding for any author • prior_cit_mean: Mean of all authors’ prior citation rate • Insti_impact_all: All authors’ respective institutions’ citation count • Insti_impact: Maximum of all institutions’ citation count • mesh_vals: Top MeSH values for every author for that paper • outer_mesh_vals: MeSH qualifiers for every author for that paper • relative_citation_ratio: RCR The ‘Readme’ includes a description for all columns. [1] Torvik, Vetle; Smalheiser, Neil (2021): Author-ity 2018 - PubMed author name disambiguated dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2273402_V1 [2] Mishra, Shubhanshu; Torvik, Vetle I. (2018): Conceptual novelty scores for PubMed articles. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5060298_V1

keywords: Diversity; PubMed; Citation

published: 2024-03-25

Suski, Cory; Dai, Qihong (2024): Data for "Differing physiological performance of coexisting cool- and warmwater fish species under heatwaves in the Midwestern United States". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1022017_V1

This is the dataset for the manuscript titled, "Differing physiological performance of coexisting cool- and warmwater fish species under heatwaves in the Midwestern United States"

keywords: climate change; heat wave; metabolic rate; swimming; predator-prey interaction; thermal tolerance; Sander vitreus; walleye; largemouth bass; species distributions

published: 2024-03-21

Becker, Maria; Han, Kanyao; Werthmann, Antonina; Rezapour, Rezvaneh; Lee, Haejin; Diesner, Jana; Witt, Andreas (2024): TextTransfer: Datasets for Impact Detection. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9934303_V1

Impact assessment is an evolving area of research that aims at measuring and predicting the potential effects of projects or programs. Measuring the impact of scientific research is a vibrant subdomain, closely intertwined with impact assessment. A recurring obstacle pertains to the absence of an efficient framework which can facilitate the analysis of lengthy reports and text labeling. To address this issue, we propose a framework for automatically assessing the impact of scientific research projects by identifying pertinent sections in project reports that indicate the potential impacts. We leverage a mixed-method approach, combining manual annotations with supervised machine learning, to extract these passages from project reports. This is a repository to save datasets and codes related to this project. Please read and cite the following paper if you would like to use the data: Becker M., Han K., Werthmann A., Rezapour R., Lee H., Diesner J., and Witt A. (2024). Detecting Impact Relevant Sections in Scientific Research. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING). This folder contains the following files: evaluation_20220927.ods: Annotated German passages (Artificial Intelligence, Linguistics, and Music) - training data annotated_data.big_set.corrected.txt: Annotated German passages (Mobility) - training data incl_translation_all.csv: Annotated English passages (Artificial Intelligence, Linguistics, and Music) - training data incl_translation_mobility.csv: Annotated German passages (Mobility) - training data ttparagraph_addmob.txt: German corpus (unannotated passages) model_result_extraction.csv: Extracted impact-relevant passages from the German corpus based on the model we trained rf_model.joblib: The random forest model we trained to extract impact-relevant passages Data processing codes can be found at: https://github.com/khan1792/texttransfer

keywords: impact detection; project reports; annotation; mixed-methods; machine learning

published: 2024-03-19

Curtis, Jeffrey H.; Riemer, Nicole; West, Matthew (2024): Data for Explicit stochastic advection algorithms for the regional scale particle-resolved atmospheric aerosol model WRF-PartMC (v1.0). University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3847217_V1

This dataset contains all material required to produce the figures found within the manuscript submitted to Geoscientific Model Development entitled “Explicit stochastic advection algorithms for the regional scale particle-resolved atmospheric aerosol model WRF-PartMC (v1.0)”. The dataset consists of Python Jupyter notebooks and any applicable WRF-PartMC output. This dataset covers the three numerical examples of the manuscript, 1D advection by a uniform constant wind, a 2D rotational flow and a 3D time-evolving WRF simulated flow.

keywords: Atmospheric chemistry; Atmospheric Science; Particle-resolved modeling; Numerical modeling; Advection;

planned publication date: 2025-01-01

Cao, Yanghui; Dietrich, Christopher H.; Dmitriev, Dmitry A.; Kits, Joel H.; Xue, Qingquan; Zhang, Yalin (2025): Datasets for "Phylogeny, Biogeography and Morphological Evolution of the Treehopper-Like Leafhoppers (Hemiptera: Cicadellidae) with Redefinition of Cicadellidae and Revised Status for Megophthalmidae and Ulopidae". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1475719_V1

The following files were used to reconstruct the phylogeny of the Megophthalmidae and Ulopidae. Taxon_sampling.csv: contains the sample IDs (1st column) which were used in the alignments and the taxonomic information (2nd to 6th columns). concatenated_aa_partition.nex: the partitioning schemes for the maximum likelihood analysis using concatenated_aa.phy. This file partitions the 52,474 amino acid positions into 427 character sets. concatenated_aa_.phy: a concatenated amino acid dataset with 52,474 amino acid positions. This dataset was used for the maximum likelihood analysis by IQ-TREE v1.6.12. Hyphens are used to represent gaps. concatenated_nt_partition.nex: the partitioning schemes for the maximum likelihood analysis using concatenated_nt.phy. This file partitions the 158,364 nucleotide positions into 427 character sets. concatenated_nt_.phy: a concatenated nucleotide dataset with 158,364 nucleotide positions. This dataset was used for the maximum likelihood analysis by IQ-TREE v1.6.12. Hyphens are used to represent gaps. Individual_gene_alignment.zip: contains 427 FASTA files, each one represents the nucleotide alignment for a gene. Hyphens are used to represent gaps. These files were used to construct gene trees using IQ-TREE v1.6.12, followed by multispecies coalescent analysis using ASTRAL v 4.10.5 based the consensus trees with a minimum average bootstrap value of 70.

keywords: Cicadellidae; Classification; Phylogenomics; Megophthalminae; Ulopinae

published: 2024-01-31

Wang, Xiudan; Dietrich, Christopher; Zhang, Yalin (2024): Datasets for Phylogeny and historical biogeography of leafhopper subfamily Coelidiinae (Hemiptera: Cicadellidae) based on morphological and molecular data . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5847605_V1

The included files were used to reconstruct the phylogeny of Coelidiinae using combined morphological and molecular data, estimate divergence times and reconstruct ancestral biogeographic areas as described in the manuscript submitted for publication. The file “Coelidiinae_dna_morph_combined.nex” is a text file in standard NEXUS format used by various phylogenetic analysis programs. This file includes the aligned and concatenated nucleotide sequences or five gene regions (mitochondrial COI and 16S, and nuclear 28S D-2, histone H3, histone H2A and wingless) indicated by standard “ACGT” nucleotide symbols with missing data indicated by “?”, and morphological character data as defined in Table S3 used in the analyses. The data partitions are indicated toward the end of the file by ranges of numbers (“charset Subset 1 – 4” for the DNA data and “charset morph” for the morphological characters) followed by commands for the phylogenetic analysis program MrBayes that specify the model settings for each data partition. Detailed data on species included (as rows) in the dataset, including collection localities and GenBank accession numbers are provided in the Table_S1_Specimen_information.csv file. The file "TablesS2-S4.pdf" lists the primers used for polymerase chain reaction amplification, the list of morphological character definitions, and the morphological character matrix. The file “RASP_Distribution.csv” contains a list of the species included in the phylogenetic dataset (first column) and a code (second column) indicating their distributions as follows: (A) Oriental, (B) Palaearctic, (C) Australian, (D) Afrotropical, (E) Neotropical, and (F) Nearctic. More than one letter indicates that the species occurs in more than one region. The file "infile_for_BEAST.txt" is the input file in XML format used for the molecular divergence time analysis using the program BEAST (Bayesian Evolutionary Analysis by Sampling Trees) as described in the Methods section of the manuscript. This file includes comments that document the steps of the analysis.

keywords: leafhopper; phylogeny; DNA sequence; insect; timetree; biogeography

published: 2024-02-25

Coshic, Kush; Maffeo, Christopher; Winogradoff, David; Aksimentiev, Aleksei (2024): Select trajectories, simulation setup, and analysis for "The structure and physical properties of a packaged bacteriophage particle". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4930709_V1

Simulation trajectory data and scripts for Nature manuscript "The structure and physical properties of a packaged bacteriophage particle" that reports the all-atom structure of a complete HK97 virion, including its entire 39,732 base pair genome, obtained through multi-resolution simulations.

keywords: Virus capsid; Bacteriophage packaging; Multiresolution simulations; all-atom MD simulation

published: 2023-02-23

Peyton, Buddy; Bajjalieh, Joseph; Shalmon, Dan; Martin, Michael; Bonaguro, Jonathan; Soto, Emilio (2023): Cline Center Coup d’État Project Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9651987_V6

Coups d'État are important events in the life of a country. They constitute an important subset of irregular transfers of political power that can have significant and enduring consequences for national well-being. There are only a limited number of datasets available to study these events (Powell and Thyne 2011, Marshall and Marshall 2019). Seeking to facilitate research on post-WWII coups by compiling a more comprehensive list and categorization of these events, the Cline Center for Advanced Social Research (previously the Cline Center for Democracy) initiated the Coup d'État Project as part of its Societal Infrastructures and Development (SID) project. More specifically, this dataset identifies the outcomes of coup events (i.e. realized or successful coups, unrealized coup attempts, or thwarted conspiracies) the type of actor(s) who initiated the coup (i.e. military, rebels, etc.), as well as the fate of the deposed leader. This current version, Version 2.1.2, adds 6 additional coup events that occurred in 2022 and updates the coding of an attempted coup event in Kazakhstan in January 2022. Version 2.1.1 corrects a mistake in version 2.1.0, where the designation of “dissident coup” had been dropped in error for coup_id: 00201062021. Version 2.1.1 fixes this omission by marking the case as both a dissident coup and an auto-coup. Version 2.1.0 added 36 cases to the data set and removes two cases from the v2.0.0 data. This update also added actor coding for 46 coup events and adds executive outcomes to 18 events from version 2.0.0. A few other changes were made to correct inconsistencies in the coup ID variable and the date of the event. Changes from the previously released data (v2.0.0) also include: 1. Adding additional events and expanding the period covered to 1945-2022 2. Filling in missing actor information 3. Filling in missing information on the outcomes for the incumbent executive 4. Dropping events that were incorrectly coded as coup events Items in this Dataset 1. Cline Center Coup d'État Codebook v.2.1.2 Codebook.pdf - This 16-page document provides a description of the Cline Center Coup d’État Project Dataset. The first section of this codebook provides a summary of the different versions of the data. The second section provides a succinct definition of a coup d’état used by the Coup d’État Project and an overview of the categories used to differentiate the wide array of events that meet the project's definition. It also defines coup outcomes. The third section describes the methodology used to produce the data. Revised February 2023 2. Coup Data v2.1.2.csv - This CSV (Comma Separated Values) file contains all of the coup event data from the Cline Center Coup d’État Project. It contains 29 variables and 981 observations. Revised February 2023 3. Source Document v2.1.2.pdf - This 315-page document provides the sources used for each of the coup events identified in this dataset. Please use the value in the coup_id variable to identify the sources used to identify that particular event. Revised February 2023 4. README.md - This file contains useful information for the user about the dataset. It is a text file written in markdown language. Revised February 2023 Citation Guidelines 1. To cite the codebook (or any other documentation associated with the Cline Center Coup d’État Project Dataset) please use the following citation: Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Scott Althaus. 2023. “Cline Center Coup d’État Project Dataset Codebook”. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.1.2. February 23. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V6 2. To cite data from the Cline Center Coup d’État Project Dataset please use the following citation (filling in the correct date of access): Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Emilio Soto. 2023. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.1.2. February 23. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V6

published: 2024-03-09

Mishra, Apratim; Diesner, Jana; Torvik, Vetle I. (2024): Hype - PubMed dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0651259_V1

Hype - PubMed dataset Prepared by Apratim Mishra This dataset captures ‘Hype’ within biomedical abstracts sourced from PubMed. The selection chosen is ‘journal articles’ written in English, published between 1975 and 2019, totaling ~5.2 million. The classification relies on the presence of specific candidate ‘hype words’ and their abstract location. Therefore, each article might have multiple instances in the dataset due to the presence of multiple hype words in different abstract sentences. The candidate hype words are 36 in count: 'major', 'novel', 'central', 'critical', 'essential', 'strongly', 'unique', 'promising', 'markedly', 'excellent', 'crucial', 'robust', 'importantly', 'prominent', 'dramatically', 'favorable', 'vital', 'surprisingly', 'remarkably', 'remarkable', 'definitive', 'pivotal', 'innovative', 'supportive', 'encouraging', 'unprecedented', 'bright', 'enormous', 'exceptional', 'outstanding', 'noteworthy', 'creative', 'assuring', 'reassuring', 'spectacular', and 'hopeful'. File 1: hype_dataset.csv Primary dataset. It has the following columns: 1. PMID: represents unique article ID in PubMed 2. Hype_word: Candidate hype word, such as ‘novel.’ 3. Sentence: Sentence in abstract containing the hype word. 4. Abstract_length: Length of article abstract. 5. Hype_percentile: Abstract relative position of hype word. 6. Hype_value: Propensity of hype based on the hype word, the sentence, and the abstract location. 7. Introduction: The ‘I’ component of the hype word based on IMRaD 8. Methods: The ‘M’ component of the hype word based on IMRaD 9. Results: The ‘R’ component of the hype word based on IMRaD 10. Discussion: The ‘D’ component of the hype word based on IMRaD File 2: hype_removed_phrases.csv Secondary dataset with same columns as File 1. Hype in the primary dataset is based on excluding certain phrases that are rarely hype. The phrases that were removed are included in File 2 and modeled separately. Removed phrases: 1. Major: histocompatibility, component, protein, metabolite, complex, surgery 2. Novel: assay, mutation, antagonist, inhibitor, algorithm, technique, series, method, hybrid 3. Central: catheters, system, design, composite, catheter, pressure, thickness, compartment 4. Critical: compartment, micelle, temperature, incident, solution, ischemia, concentration 5. Essential: medium, features, properties, opportunities 6. Unique: model, amino 7. Robust: regression 8. Vital: capacity, signs, organs, status, structures, staining, rates, cells, information 9. Outstanding: questions, issues, question, challenge, problems, problem, remains 10. Remarkable: properties 11. Definite: radiotherapy, surgery 12. Bright: field

keywords: Hype; PubMed; Abstracts; Biomedicine

published: 2024-03-06

OKeefe, Joy; Bennett, Andrew (2024): Multiplex Metagenomic analyses of North American Bats - DADA2 outputs for Phyloseq. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3079533_V1

These data are the result of analyses of the metagenome of North American bats, including 18s and 16s barcode genes designed to target microorganisms of the gut. These files are Phyloseq import files created by the DADA2 program. Each barcode gene is uploaded separately as the four files required to build a phyloseq object. For each barcode gene, the files include amplicon sequence variant (ASV) sequences, sequence tables (seqtab) which connect individual samples to the ASVs, tax tables (taxtab) which identify the taxa present as determined by a Bayesian RDP classifier, and rooted phylogenetic trees for the ASVs. Additionally, we have included a "sample_data" file which is necessary for sorting of samples across all four sequence analysis data sets by study and species. Some sample information which could identify the location of endangered species has been restricted. Multiple studies are represented in the data which can be accessed using standard methods in the Phyloseq program (e.g. For a study of bats, parasites, and gut microbiome dysregulation by Bennett, Suski, and OKeefe 2024 [in prep March 2024], study specific data can be accessed using the Study variable "DYSBIOMICS." File names include reference to the primer set used to generate them (18s primer sets: G3, G4, G6; 16s primer set: 341F3_806R5).

keywords: metagenomics

published: 2015-12-16

Nguyen, Nam-phuong; Mirarab, Siavash; Kumar, Keerthana; Warnow, Tandy (2015): Data for Ultra-Large Alignments Using Phylogeny-Aware Profiles. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3174395_V1

This dataset contains the data for PASTA and UPP. PASTA data was used in the following articles: Mirarab, Siavash, Nam Nguyen, Sheng Guo, Li-San Wang, Junhyong Kim, and Tandy Warnow. “PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.” Journal of Computational Biology 22, no. 5 (2015): 377–86. doi:10.1089/cmb.2014.0156. Mirarab, Siavash, Nam Nguyen, and Tandy Warnow. “PASTA: Ultra-Large Multiple Sequence Alignment.” Edited by Roded Sharan. Research in Computational Molecular Biology, 2014, 177–91. UPP data was used in: Nguyen, Nam-phuong D., Siavash Mirarab, Keerthana Kumar, and Tandy Warnow. “Ultra-Large Alignments Using Phylogeny-Aware Profiles.” Genome Biology 16, no. 1 (December 16, 2015): 124. doi:10.1186/s13059-015-0688-z.

published: 2017-09-16

Mirarab, Siavash; Warnow, Tandy (2017): Data for 16S and 23S rRNA alignments. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1614388_V1

This dataset contains the data for 16S and 23S rRNA alignments including their reference trees. The original alignments are from the Gutell Lab CRW, currently located at https://crw-site.chemistry.gatech.edu/DAT/3C/Alignment/.

published: 2009-06-19

Liu, Kevin; Raghavan, Sindhu; Nelesen, Serita; Linder, C. Randall; Warnow, Tandy (2009): Data for Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5139418_V1

This dataset contains the data for SATe-I. SATe-I data was used in the following article: K. Liu, S. Raghavan, S. Nelesen, C. R. Linder, T. Warnow, "Rapid and Accurate Large-Scale Coestimation of Sequence Alignments and Phylogenetic Trees," Science, vol. 324, no. 5934, pp. 1561-1564, 19 June 2009.

published: 2024-02-27

Peyton, Buddy; Bajjalieh, Joseph; Shalmon, Dan; Martin, Michael; Soto, Emilio (2024): Cline Center Coup d’État Project Dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9651987_V7

Coups d'Ètat are important events in the life of a country. They constitute an important subset of irregular transfers of political power that can have significant and enduring consequences for national well-being. There are only a limited number of datasets available to study these events (Powell and Thyne 2011, Marshall and Marshall 2019). Seeking to facilitate research on post-WWII coups by compiling a more comprehensive list and categorization of these events, the Cline Center for Advanced Social Research (previously the Cline Center for Democracy) initiated the Coup d’État Project as part of its Societal Infrastructures and Development (SID) project. More specifically, this dataset identifies the outcomes of coup events (i.e., realized, unrealized, or conspiracy) the type of actor(s) who initiated the coup (i.e., military, rebels, etc.), as well as the fate of the deposed leader. Version 2.1.3 adds 19 additional coup events to the data set, corrects the date of a coup in Tunisia, and reclassifies an attempted coup in Brazil in December 2022 to a conspiracy. Version 2.1.2 added 6 additional coup events that occurred in 2022 and updated the coding of an attempted coup event in Kazakhstan in January 2022. Version 2.1.1 corrected a mistake in version 2.1.0, where the designation of “dissident coup” had been dropped in error for coup_id: 00201062021. Version 2.1.1 fixed this omission by marking the case as both a dissident coup and an auto-coup. Version 2.1.0 added 36 cases to the data set and removed two cases from the v2.0.0 data. This update also added actor coding for 46 coup events and added executive outcomes to 18 events from version 2.0.0. A few other changes were made to correct inconsistencies in the coup ID variable and the date of the event. Version 2.0.0 improved several aspects of the previous version (v1.0.0) and incorporated additional source material to include: • Reconciling missing event data • Removing events with irreconcilable event dates • Removing events with insufficient sourcing (each event needs at least two sources) • Removing events that were inaccurately coded as coup events • Removing variables that fell below the threshold of inter-coder reliability required by the project • Removing the spreadsheet ‘CoupInventory.xls’ because of inadequate attribution and citations in the event summaries • Extending the period covered from 1945-2005 to 1945-2019 • Adding events from Powell and Thyne’s Coup Data (Powell and Thyne, 2011) Items in this Dataset 1. Cline Center Coup d'État Codebook v.2.1.3 Codebook.pdf - This 15-page document describes the Cline Center Coup d’État Project dataset. The first section of this codebook provides a summary of the different versions of the data. The second section provides a succinct definition of a coup d’état used by the Coup d'État Project and an overview of the categories used to differentiate the wide array of events that meet the project's definition. It also defines coup outcomes. The third section describes the methodology used to produce the data. Revised February 2024 2. Coup Data v2.1.3.csv - This CSV (Comma Separated Values) file contains all of the coup event data from the Cline Center Coup d’État Project. It contains 29 variables and 1000 observations. Revised February 2024 3. Source Document v2.1.3.pdf - This 325-page document provides the sources used for each of the coup events identified in this dataset. Please use the value in the coup_id variable to identify the sources used to identify that particular event. Revised February 2024 4. README.md - This file contains useful information for the user about the dataset. It is a text file written in markdown language. Revised February 2024 Citation Guidelines 1. To cite the codebook (or any other documentation associated with the Cline Center Coup d’État Project Dataset) please use the following citation: Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Scott Althaus. 2024. “Cline Center Coup d’État Project Dataset Codebook”. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.1.3. February 27. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V7 2. To cite data from the Cline Center Coup d’État Project Dataset please use the following citation (filling in the correct date of access): Peyton, Buddy, Joseph Bajjalieh, Dan Shalmon, Michael Martin, Jonathan Bonaguro, and Emilio Soto. 2024. Cline Center Coup d’État Project Dataset. Cline Center for Advanced Social Research. V.2.1.3. February 27. University of Illinois Urbana-Champaign. doi: 10.13012/B2IDB-9651987_V7

published: 2023-12-19

Bush, Daniel; Calla, Bernarda; Berenbaum, May (2023): Data for "An Aspergillus Strain from Bee Bread of the Western Honey Bee (Apis mellifera) Displays Adaptations to Distinctive Features of the Hive Environment". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7212497_V1

Data for the Appendices of Bush et al. article published in Ecology and Evolution. Contains genomic analysis information for a strain of Aspergillus flavus isolated from bee bread in East Central Illinois.

keywords: Excel; UIUC; Evolution and Ecology; Aspergillus flavus; genome

published: 2024-02-15

Hoggatt, Meredith; Starbuck, Clarissa; O'Keefe, Joy (2024): Data for "Acoustic monitoring yields informative bat population density estimates". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7001459_V1

Dataset includes the dataset for estimating bat density from acoustic data and the R code. The data support a publication by Meredith L. Hoggatt, Clarissa A. Starbuck, and Joy M. O'Keefe entitled Acoustic monitoring yields informative bat population density estimates.

keywords: acoustics; bats; monitoring; population density; random encounter model

published: 2024-02-21

Hartman, Jordan H; Corush, Joel B; Larson, Eric R; Tiemann, Jeremy S; Willink, Philip; Davis, Mark A (2024): Data for "Niche conservatism and spread explain hybridization and introgression between native and invasive fish". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6979965_V1

Data associated with the manuscript "Niche conservatism and spread explain hybridization and introgression between native and invasive fish" by Jordan H. Hartman, Joel B. Corush, Eric R. Larson, Jeremy S. Tiemann, Philip Willink, and Mark A. Davis. For this project, we combined results of ecological niche models (ENMs) and next-generation restriction site-associated DNA sequencing (RADseq) to test theories of niche conservatism and biotic resistance on the success of invasion, hybridization, and extent of introgression between native Western Banded Killifish and non-native Eastern Banded Killifish. This dataset provides the sampling locations and number of Banded Killifish in each population, accession numbers for RADseq from the National Center for Biotechnology Information Sequence Read Archive and the assignment of each Banded Killifish, the habitat associations of each population from the ENMs, and the occurrence points used to build the ENMs.

keywords: Banded Killifish; ecological niche model; Fundulus diaphanus; hybrid swarm; invasive species; Laurentian Great Lakes

published: 2024-02-16

Zhang, Mingxiao; Sutton, Bradley (2024): Sample Data for “Measuring CSF Shunt Flow with MRI Using Flow Enhancement of Signal Intensity (FENSI)”. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7252521_V1

Sample data from one typical phantom test and one deidentified shunt patient test (shown in Fig. 8 of the MRM paper), with the corresponding analysis code for the Shunt-FENSI technique. For the MRM paper “Measuring CSF Shunt Flow with MRI Using Flow Enhancement of Signal Intensity (FENSI)”

keywords: Shunt-FENSI; MRM; Hydrocephalus; VP Shunt; Flow Quantification; Pediatric Neurosurgery; Pulse Sequence; Signal Simulation

published: 2024-01-19

Digrado, Anthony; Montes, Christopher; Baxter, Ivan; Ainsworth, Elizabeth (2024): Soybean seed quality response to eCO2 data files. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6453957_V2

This data set is related to a SoyFACE experiment conducted in 2004, 2006, 2007, and 2008 with the soybean cultivars Loda and HS93-4118. The experiment looked at how seed elements were affected by elevated CO2 and yield. In this V2, 2 new files were added per journal requirement. Total there are 5 data files in text format within the digrado_et_al_gcb_data_V2 and 1 readme file. The name of files are listed below. Details about headers are explained in the readme.txt file. 1. ionomic_data.txt file contains the ionomic data (mg/kg) for the two cultivars. The file contains all six technical replicates for each plot. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 2. yield_data.txt file contains the yield data for the two cultivars (seed yield in kg/ha, seed yield in bu/a, Protein (%), Oil (%)). The file contains yield data for every plot. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 3. mineral_pro_oil_yield.txt file contains the yield per hectare for each mineral (g/ha) along with the yield per hectare for protein and oil (t/ha). This was obtained by multiplying the seed content of each element (minerals, protein, and oil) by the total seed yield. The file contains yield data for every plots. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 4. economic_assessment.txt file contains data used to assess the financial impact of altered seed oil content on soybean oil production. 5. meteorological_data.txt file contains the meteorological data recorded by a weather station located ~ 3km from the experimental site (Willard Airport Champaign). Data covering the period between May 28 and September 24 were used for 2004; between May 25 and September 24 were used in 2006; between May 23 and September 17 in 2007; and between June 16 and October 24 in 2008.

keywords: protein; oil; mineral; SoyFACE; nutrient; Glycine max; soybean; yield; CO2; agriculture; climate change

published: 2023-04-06

Yao, Lehan; Lyu, Zhiheng; Li, Jiahui; Chen, Qian (2023): Data for Unsupervised Sinogram Inpainting for Nanoparticle Electron Tomography (UsiNet) for missing wedge correction. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7963044_V1

Example data for https://github.com/chenlabUIUC/UsiNet The data contains computer simulated and experimental tilting series (or sinograms) of gold nanoparticles. Two training data examples are provided: 1. simulated_data.zip 2. experimental_data.zip In each zip folder, we include an image_data.zip and a training_data.zip. The former is for viewing and only the latter is needed for model training. For more details, please refer to our GitHub repository.

keywords: electron tomography; deep learning

Subject Area

Funder

Publication Year

License

Datasets