Illinois Data Bank Dataset Search Results
Results
published:
2021-08-20
von Haden, Adam C.; DeLucia, Evan H.; Yang, Wendy; Burnham, Mark
(2021)
In 2020, early-season extreme precipitation events occurred following the planting of Sorghum bicolor (L.) Moench and Zea mays L. in central Illinois that caused ponding. Following the first rainfall event 50m transects were established to assess the waterlogging effects on seedling emergence and crop yields. Soil moisture, emergence, stem and tiller count, LAI, and yield were measured at various points in the season along these transects.
keywords:
Sorghum; Maize; Emergence; Yield; LAI
published:
2022-02-11
Trivellone, Valeria; Cao, Yanghui; Blackshear, Millon; Kim, Chang-Hyun; Stone, Christopher
(2022)
The Culex_Trivellone_etal.fas fasta file contains the original final sequence alignment used in the haplotype analyses of Trivellone et al. (Frontiers in Public Health, under review). The 492 sequences (from specimens of Culex pipiens complex collected in different habitat types using a BG-sentinel traps) were aligned using PASTA v1.8.5 under default settings. The final dataset contains 686 positions of the cytochrome c oxidase subunit I (COI) mitochondrial gene.
The data analyses are further described in the cited original paper.
keywords:
Culex; Culicidae; COI; mosquito surveillance, species assemblages
published:
2024-03-27
Zheng, Heng; Schneider, Jodi
(2024)
To gather news articles from the web that discuss the Cochrane Review, we used Altmetric Explorer from Altmetric.com and retrieved articles on August 1, 2023. We selected all articles that were written in English, published in the United States, and had a publication date <b>prior to March 10, 2023</b> (according to the “Mention Date” on Altmetric.com). This date is significant as it is when Cochrane issued a statement about the "misleading interpretation" of the Cochrane Review.
The collection of news articles is presented in the Altmetric_data.csv file. The dataset contains the following data that we exported from Altmetric Explorer:
- Publication date of the news article
- Title of the news article
- Source/publication venue of the news article
- URL
- Country
We manually checked and added the following information:
- Whether the article still exists
- Whether the article is accessible
- Whether the article is from the original source
We assigned MAXQDA IDs to the news articles. News articles were assigned the same ID when they were (a) identical or (b) in the case of Article 207, closely paraphrased, paragraph by paragraph. Inaccessible items were assigned a MAXQDA ID based on their "Mention Title".
For each article from Altmetric.com, we first tried to use the Web Collector for MAXQDA to download the article from the website and imported it into MAXQDA (version 22.7.0). If an article could not be retrieved using the Web Collector, we either downloaded the .html file or in the case of Article 128, retrieved it from the NewsBank database through the University of Illinois Library.
We then manually extracted direct quotations from the articles using MAXQDA.
We included surrounding words and sentences, and in one case, a news agency’s commentary, around direct quotations for context where needed. The quotations (with context) are the positions in our analysis.
We also identified who was quoted. We excluded quotations when we could not identify who or what was being quoted. We annotated quotations with codes representing groups (government agencies, other organizations, and research publications) and individuals (authors of the Cochrane Review, government agency representatives, journalists, and other experts such as epidemiologists).
The MAXQDA_data.csv file contains excerpts from the news articles that contain the direct quotations we identified. For each excerpt, we included the following information:
- MAXQDA ID of the document from which the excerpt originates;
- The collection date and source of the document;
- The code with which the excerpt is annotated;
- The code category;
- The excerpt itself.
keywords:
altmetrics; MAXQDA; polylogue analysis; masks for COVID-19; scientific controversies; news articles
published:
2021-11-18
Pan, Chao; Tabatabaei, S Kasra; Tabatabaei Yazdi, S. M. Hossein; Hernandez, Alvaro; Schroeder, Charles; Milenkovic, Olgica
(2021)
This dataset contains sequencing data obtained from Illumina MiSeq device to prove the concept of the proposed 2DDNA framework. Please refer to README.txt for detailed description of each file.
keywords:
machine learning;image processing;computer vision;rewritable storage system;2D DNA-based data storage
published:
2025-01-31
Punyasena, Surangi W.; Romero, Ingrid; Urban, Michael A.
(2025)
Title: Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae
Authors: Surangi W. Punyasena, Ingrid Romero, Michael A. Urban
Subject: Biological sciences
Keywords: Malvaceae; superresolution microscopy; Zeiss; Bombacacidites; Neotropics; CZI
Funder: NSF-DBI Advances in Bioinformatics (NSF-DBI-1262561)
Corresponding Creator: Surangi W. Punyasena
This dataset includes a total of 430 images of extant specimens of the Malvaceae, with a focus on species that are or have been included within the subfamily Bombacoideae. There are 27 genera included within 26 folders. Each folder is named by genus and contains all the images that correspond to that genus. Note that the genus _Matisia_ is included with _Quararibea_ as detailed in the metadata READ ME file.
The specimens imaged are from the palynological collections of the Swedish Museum of Natural History and Smithsonian Tropical Research Institute, and herbarium specimens from the Smithsonian Herbarium National Museum.
The optical superresolution microscopy images were taken using a Zeiss LSM 880 with Airyscan at 630X magnification (63x/NA 1.4 oil DIC). The images are in the original CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or in ImageJ/FIJI. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/en/products/software/zeiss-zen/czi-image-file-format.html]
Image metadata and file organization are described in the CSV file "METADATA_Malvaceae_Bombacoideae_modern-species.csv". The column headings are:
Folder The folder in which the image file is found
Subfamily The current subfamily determination based on the literature. Note that _Pentaplaris_ and _Septotheca_ have not been assigned a subfamily.
Genus Genus name
Species Species name
Accepted name Accepted species name, updated from the literature
Slide name Species name as denoted on the herbarium slide
Collection Source of the herbarium slide: Sweden National Museum of Natural History or the Smithsonian Tropical Research Institute
File name File name using the species name denoted on the herbarium slide
Slide ID/Herbarium ID Specimen collection number
Please cite this dataset as:
Punyasena, Surangi W.; Romero, Ingrid; Urban, Michael A. (2025): Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-2968712_V1
keywords:
Malvaceae; superresolution microscopy; Zeiss; Bombacoideae; Neotropics; CZI
published:
2015-12-16
Nguyen, Nam-phuong; Mirarab, Siavash; Kumar, Keerthana; Warnow, Tandy
(2015)
This dataset contains the data for PASTA and UPP.
PASTA data was used in the following articles:
Mirarab, Siavash, Nam Nguyen, Sheng Guo, Li-San Wang, Junhyong Kim, and Tandy Warnow. “PASTA: Ultra-Large Multiple Sequence Alignment for Nucleotide and Amino-Acid Sequences.” Journal of Computational Biology 22, no. 5 (2015): 377–86. doi:10.1089/cmb.2014.0156.
Mirarab, Siavash, Nam Nguyen, and Tandy Warnow. “PASTA: Ultra-Large Multiple Sequence Alignment.” Edited by Roded Sharan. Research in Computational Molecular Biology, 2014, 177–91.
UPP data was used in:
Nguyen, Nam-phuong D., Siavash Mirarab, Keerthana Kumar, and Tandy Warnow. “Ultra-Large Alignments Using Phylogeny-Aware Profiles.” Genome Biology 16, no. 1 (December 16, 2015): 124. doi:10.1186/s13059-015-0688-z.
published:
2025-10-30
Koh, Hyun Gi; Kim, Jinhong; Rao, Christopher V.; Park, Sung-Jin; Jin, Yong-Su
(2025)
A small and efficient DNA mutation-inducing machine was constructed with an array of microplasma jet devices (7 × 1) that can be operated at atmospheric pressure for microbial mutagenesis. Using this machine, we report disruption of a plasmid DNA and generation of mutants of an oleaginous yeast Rhodosporidium toruloides. Specifically, a compact-sized microplasma channel (25 × 20 × 2 mm3) capable of generating an electron density of greater than 1013 cm–3 was constructed to produce reactive species (N2*, N2+, O, OH, and Hα) under helium atmospheric conditions to induce DNA mutagenesis. The length of microplasma channels in the device played a critical role in augmenting both the volume of plasma and the concentration of reactive species. First, we confirmed that microplasma treatment can linearize a plasmid by creating nicks in vitro. Second, we treated R. toruloides cells with a jet device containing 7 microchannels for 5 min; 94.8% of the treated cells were killed, and 0.44% of surviving cells showed different colony colors as compared to their parental colony. Microplasma-based DNA mutation is energy-efficient and can be a safe alternative for inducing mutations compared to conventional methods using toxic mutagens. This compact and scalable device is amenable for industrial strain improvement involving large-scale mutagenesis.
keywords:
Conversion;Genome Engineering
published:
2020-02-12
Asplund, Joshua; Karahalios, Karrie
(2020)
This dataset contains the results of a three month audit of housing advertisements. It accompanies the 2020 ICWSM paper "Auditing Race and Gender Discrimination in Online Housing Markets". It covers data collected between Dec 7, 2018 and March 19, 2019.
There are two json files in the dataset: The first contains a list of json objects representing advertisements separated by newlines. Each object includes the date and time it was collected, the image and title (if collected) of the ad, the page on which it was displayed, and the training treatment it received. The second file is a list of json objects representing a visit to a housing lister separated by newlines. Each object contains the url, training treatment applied, the location searched, and the metadata of the top sites scraped. This metadata includes location, price, and number of rooms.
The dataset also includes the raw images of ads collected in order to code them by interest and targeting. These were captured by selenium and named using a perceptive hash to de-duplicate images.
keywords:
algorithmic audit; advertisement audit;
published:
2020-05-31
Zhang, Chuanyi; El-Kebir, Mohammed; Ochoa, Idoia
(2020)
This repository includes a simulated dataset and related scripts used for the paper "Moss: Accurate Single-Nucleotide Variant Calling from Multiple Bulk DNA Tumor Samples".
keywords:
Somatic Mutations; Bulk DNA Sequencing; Cancer Genomics
published:
2022-02-14
Yao, Yu; Curtis, Jeffrey; Ching, Joseph; Zheng, Zhonghua; Riemer, Nicole
(2022)
This dataset contains simulation results from numerical model PartMC-MOSAIC used in the article "Quantifying the effects of mixing state on aerosol optical properties". This article is submitted to the journal Atmospheric Physics and Chemistry. There are total 100 scenario directories in this dataset, denoted from 00-99. Each scenario contains 25 NetCDF files hourly output from PartMC-MOSAIC simulations containing the simulated gas and particle information.
The data was produced using version 2.5.0 of PartMC-MOSAIC. Instructions to compile and run PartMC-MOSAIC are available at https://github.com/compdyn/partmc. The chemistry code MOSAIC is available by request from Rahul.Zaveri@pnl.gov. For more details of reproducing the cases, please contact nriemer@illinois.edu and yuyao3@illinois.edu.
keywords:
Aerosol mixing state; Aerosol optical properties; Mie calculation; Black Carbon
published:
2025-06-22
Stickley, Samuel; Crawford, John; Peterman, William; Fraterrigo, Jennifer
(2025)
keywords:
terrestrial salamanders, microhabitat, physiology, mechanistic models, ecological niche models, climate change, Great Smoky Mountains National Park
published:
2025-11-24
Maitra, Shraddha; Cheng, Ming-Hsun; Liu, Hui; Cao, Viet Dang; Kannan, Baskaran; Long, Stephen; Shanklin, John; Altpeter, Fredy; Singh, Vijay
(2025)
Development of sustainable and scalable technologies to convert lignocellulosic biomass to biofuels is critical to achieving carbon neutrality. The potential of transgenic bioenergy crops as a renewable source of sugars and lipids has been demonstrated at bench-scale. However, scaling up these processes is important for holistic analysis. Here proof-of-concept for chemical-free hydrothermal pretreatment of transgenic energycane-oilcane line L13 at an industrially relevant scale to recover vegetative lipids along with cellulosic sugars is presented. Pilot-scale processing of 97 kg of transgenic energycane-oilcane L13 stems and high solids pretreatment of bagasse enhanced the recovery of cellulosic glucose and xylose by 5-fold as compared to untreated bagasse and helped in the enrichment of vegetative lipids in the biomass residues which allowed its recovery at the end of the bioprocess. Palmitic and oleic acids were the predominant fatty acids (FAs) extracted from stems and leaves. The processing did not affect lipid composition. The efficiency of lipid recovery from untreated biomass was 75.9% which improved to 88.7% upon pretreatment. The vegetative tissues of transgenic energycane-oilcane L13 contained 0.42 metric tons/hectare of lipids. Processing vegetative tissues yielded 0.38 metric tons/hectare of lipids. This approaches an oil yield similar to soybean (global average 0.44 metric tons/hectare) and is almost twice as high as the oil yield from sugarcane engineered to hyperaccumulate lipids (0.20 metric tons/hectare). The study suggests that further optimization by state-of-the-art metabolic engineering and biomass processing can establish transgenic bioenergy crops for commercial drop-in fuel production.
keywords:
Conversion;Feedstock Production;Biomass Analytics;Energycane;Lipidomics
published:
2020-06-06
Zaya, David N.; Leicht-Young, Stacey A.; Pavlovic, Noel B.; Ashley, Mary V.
(2020)
These data are from an observational study and small experiment investigating reproductive biology and hybridization between two plants, Celastrus scandens L. and Celastrus orbiculatus Thunb. (Celastraceae). These data were collected during the 2008 growing season from the Indiana Dunes National Park (formerly Indiana Dunes National Lakeshore), just east of the municipality of Ogden Dunes, Indiana, USA. The five data files provide information on floral output of the two species, fertilization rate, fruit set rate, hybridization rate at two scales (individual flowers in both species, individual maternal plants in C. scandens), and the results of a hand-pollination experiment that exchanged pollen between the two species.
There are six data files associated with this submission, five data files in comma-separated values format and one text file (‘readme.txt’) that includes detailed explanations of the data files.
keywords:
Celastrus; invasive species; hybridization; heterospecific pollen; hand pollination
published:
2021-04-06
Hadley, Daniel; Abrams, Daniel; Mannix, Devin; Cullen, Cecilia
(2021)
These datasets contain modeling files and GIS data associated with a risk assessment study for the Cambrian-Ordovician sandstone aquifer system in Illinois from predevelopment (1863) to the year 2070. Modeling work was completed using the Illinois Groundwater Flow Model, a regional MODFLOW model developed for water supply planning in Illinois, as a base model. The model is run using the graphical user interface Groundwater Vistas 7.0. The development and technical details of the base Illinois Groundwater Flow Model, including hydraulic property zonation, boundary conditions, hydrostratigraphy, solver settings, and discretization, are described in Abrams et al. (2018). Modifications to this base model (the version presented here) are described in Mannix et al. (2018), Hadley et al. (2020) and Abrams and Cullen (2020). Modifications include removal of particular multi-aquifer wells to improve calibration, changing Sandwich Fault Zone properties to achieve calibration at production wells within and near the fault zone, and the incorporation of demand scenarios based on a participatory modeling project with the Southwest Water Planning Group.
The zipped folder of model files contains MODFLOW input (package) files, Groundwater Vistas files, and a head file for the entire model run. The zipped folder of GIS data contains rasters of: simulated drawdown in the St. Peter sandstone from predevelopment to 2018, simulated drawdown in the Ironton-Galesville sandstone from predevelopment to 2018, simulated head difference between the St. Peter and Ironton-Galesville sandstone units in 2018, simulated head above the top of the St. Peter sandstone for the years 2029, 2050, and 2070, and simulated head above the top of the Ironton-Galesville sandstone for the years 2029, 2050, and 2070. Raster outputs were derived directly from the simulated heads in the Illinois Groundwater Flow Model. Rasters are clipped to the 8 county northeastern Illinois region (Cook, DuPage, Grundy, Kane, Kendall, Lake, McHenry, and Will counties).
Well names, historic and current head targets, and spatial offsets for the Illinois Groundwater Flow Model are available upon request via a data license agreement. Please contact authors to set this up if needed.
keywords:
groundwater; aquifer; sandstone aquifer; risk assessment; depletion; Illinois; MODFLOW; modeling
published:
2023-04-05
Hartman, Jordan H. ; Tiemann, Jeremy S. ; Sherwood, Joshua L.; Willink, Philip W.; Ash, Kurt T. ; Davis, Mark A. ; Larson, Eric
(2023)
Data associated with the manuscript "Eastern banded killifish (Fundulus diaphanus diaphanus) in Lake Michigan and connected watersheds: the invasion of a non-native subspecies" by Jordan H. Hartman, Jeremy S. Tiemann, Joshua L. Sherwood, Philip W. Willink, Kurt T. Ash, Mark A. Davis, and Eric R. Larson. For this project, we sampled 109 locations in Lake Michigan and connected waters and found 821 total banded killifish. Using mitochondrial DNA analysis, we found 31 eastern and 25 western haplotypes which split our banded killifish into 422 eastern banded killifish and 398 western banded killifish. This dataset provides the sampling locations, banded killifish haplotypes, frequency of those haplotypes per location, accession numbers in GenBank, and the associated mitochondrial DNA sequences.
keywords:
intraspecific invasion; Lake Michigan; mtDNA; native transplant
published:
2014-10-29
Nguyen, Nam-phuong; Mirarab, Siavash; Bo, Liu; Pop, Mihai; Warnow, Tandy
(2014)
This dataset provides the data for Nguyen, Nam-phuong, et al. "TIPP: taxonomic identification and phylogenetic profiling." Bioinformatics 30.24 (2014): 3548-3555.
published:
2024-08-13
Maffeo, Christopher; Chhabra, Hemani; Aksimentiev, Aleksei
(2024)
Scripts used to computationally estimate the current through a DNA nanopore, starting from an equilibrated oxDNA configuration, in association with the manuscript "A lumen-tunable triangular DNA nanopore for molecular sensing and cross-membrane transport".
keywords:
DNA origami nanopore; Steric exclusion model; Ionic current
published:
2024-01-04
Kim, Hyunchul; Zhao, Helin; van der Zande, Arend
(2024)
This data set includes all of data related to stretchable TFTs based on 2D heterostructures including optical images of TFTs, Raman and Photoluminescence characteristics data, Transport measurement data, and AFM topography data.
Abstract
Two-dimensional (2D) materials are outstanding candidates for stretchable electronics, but a significant challenge is their heterogeneous integration into stretchable geometries on soft substrates. Here, we demonstrate a strategy for stretchable thin film transistors (2D S-TFT) based on wrinkled heterostructures on elastomer substrates where 2D materials formed the gate, source, drain, and channel, and characterized them with Raman spectroscopy and transport measurements.
keywords:
2D materials; 2D heterstructures; Stretchable electronics; transistors; buckling engineering
published:
2019-11-11
Molloy, Erin K.; Warnow, Tandy
(2019)
This repository includes scripts and datasets for the paper, "FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models."
Note: The results from estimating species trees with ASTRID-multi (included in this repository) are *not* included in the FastMulRFS paper. We estimated species trees with ASTRID-multi in the fall of 2019, but ASTRID-multi had an important bug fix in January 2020. Therefore, the ASTRID-multi species trees in this repository should be ignored.
keywords:
Species tree estimation; gene duplication and loss; statistical consistency; MulRF, FastRFS
published:
2019-07-26
Buckles, Brittany J; Harmon-Threatt, Alexandra
(2019)
Data used in paper published in the Journal of Applied Ecology titled " Bee diversity in tallgrass prairies affected by management and its effects on above- and below-ground resources"
Bee Community file contains info on bees sampled in each site. The first column contain the Tallgrass Prairie Sites sampled all additional columns contain the bee species name in the first row and all individuals recorded.
Plant Community file contains info on plants sampled in each site. The first column contain the Tallgrass Prairie Sites sampled all additional columns contain the plant species name in the first row and all individuals recorded.
Soil PC1 file contains the soil PC1 values used in the analyses. The first column contain the Tallgrass Prairie Sites sampled, the second column contains the calculated soil PC1 values.
keywords:
bee; community; tallgrass prairie; grazing
published:
2023-03-08
Majeed, Fahd; Khanna, Madhu
(2023)
A stochastic domination analysis model was developed to examine the effect that emerging carbon markets can have on the spatially varying returns and risk profiles of bioenergy crops relative to conventional crops. The code is written in MATLAB, and includes the calculated output.
See the README file for instructions to run the code.
keywords:
bioenergy crops; economic modeling; stochastic domination analysis model;
published:
2024-04-18
Liao, Ling-Hsiu; Wu, Wen-Yen; Berenbaum, May
(2024)
Data: Variation in pesticide toxicity in the western honey bee (Apis mellifera) associated with consuming phytochemically different monofloral honeys
Includes:
Identification and quantification of phenolic components of honeys:
Raw_data_JOCE.xlsx – sheet: “HoneyPhytochemicals”
Effects of honey phytochemicals on acute pesticide toxicity:
Raw_data_JOCE.xlsx – sheet: “raw_LD50
Raw_data_JOCE.xlsx – sheet: “raw_LD50_hive_based”
keywords:
Honey; honey bee; phenolic acid; flavonoids; bifenthrin; LD50
published:
2025-11-24
Mohebalhojeh, Matin; Frederick, Samuel; Riemer, Nicole; West, Matthew
(2025)
This dataset contains all the data and notebook files required to generate the figures presented in the manuscript “A metric for quantifying spatial heterogeneity in gridded atmospheric fields”, submitted to Earth and Space Science. The compressed folder Data.tar.gz contains the subdirectories "Emissions_data" and "Coagulation_simulation_data", which consist of netCDF files for EPA emissions and idealized coagulation simulation outputs, respectively. The compressed folder Notebooks contains three Python Jupyter notebooks used to generate the figures in Sections 3 and 4 of the manuscript, along with the necessary functions and classes.
keywords:
Spatial heterogeneity; Atmospheric Science; Aerosols
published:
2020-02-23
Ye, Di; Hill, Alison; Whitehorn (Fulton), Ashley; Schneider, Jodi
(2020)
Citation context annotation for papers citing retracted paper Matsuyama 2005 (RETRACTED: Matsuyama W, Mitsuyama H, Watanabe M, Oonakahara KI, Higashimoto I, Osame M, Arimura K. Effects of omega-3 polyunsaturated fatty acids on inflammatory markers in COPD. Chest. 2005 Dec 1;128(6):3817-27.), retracted in 2008 (Retraction in: Chest (2008) 134:4 (893) <a href="https://doi.org/10.1016/S0012-3692(08)60339-6">https://doi.org/10.1016/S0012-3692(08)60339-6<a/> ). This is part of the supplemental data for Jodi Schneider, Di Ye, Alison Hill, and Ashley Whitehorn. "Continued Citation of a Fraudulent Clinical Trial Report, Eleven Years after it was retracted for Falsifying Data" [R&R under review with Scientometrics].
Overall we found 148 citations to the retracted paper from 2006 to 2019, However, this dataset does not include the annotations described in the 2015. in Ashley Fulton, Alison Coates, Marie Williams, Peter Howe, and Alison Hill. "Persistent citation of the only published randomized controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction." Publications 3, no. 1 (2015): 17-26.
In this dataset 70 new and newly found citations are listed: 66 annotated citations and 4 pending citations (non-annotated since we don't have full-text).
"New citations" refer to articles published from March 25, 2014 to 2019, found in Google Scholar and Web of Science.
"Newly found citations" refer articles published 2006-2013, found in Google Scholar and Web of Science, but not previously covered in Ashley Fulton, Alison Coates, Marie Williams, Peter Howe, and Alison Hill. "Persistent citation of the only published randomised controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction." Publications 3, no. 1 (2015): 17-26.
NOTES:
This is Unicode data. Some publication titles & quotes are in non-Latin characters and they may contain commas, quotation marks, etc.
FILES/FILE FORMATS
Same data in two formats:
2006-2019-new-citation-contexts-to-Matsuyama.csv - Unicode CSV (preservation format only)
2006-2019-new-citation-contexts-to-Matsuyama.xlsx - Excel workbook (preferred format)
ROW EXPLANATIONS
70 rows of data - one citing publication per row
COLUMN HEADER EXPLANATIONS
Note - processing notes
Annotation pending - Y or blank
Year Published - publication year
ID - ID corresponding to the network analysis. See Ye, Di; Schneider, Jodi (2019): Network of First and Second-generation citations to Matsuyama 2005 from Google
Scholar and Web of Science. University of Illinois at Urbana-Champaign. <a href="https://doi.org/10.13012/B2IDB-1403534_V2">https://doi.org/10.13012/B2IDB-1403534_V2</a>
Title - item title (some have non-Latin characters, commas, etc.)
Official Translated Title - item title in English, as listed in the publication
Machine Translated Title - item title in English, translated by Google Scholar
Language - publication language
Type - publication type (e.g., bachelor's thesis, blog post, book chapter, clinical guidelines, Cochrane Review, consumer-oriented evidence summary, continuing education journal article, journal article, letter to the editor, magazine article, Master's thesis, patent, Ph.D. thesis, textbook chapter, training module)
Book title for book chapters - Only for a book chapter - the book title
University for theses - for bachelor's thesis, Master's thesis, Ph.D. thesis - the associated university
Pre/Post Retraction - "Pre" for 2006-2008 (means published before the October 2008 retraction notice or in the 2 months afterwards); "Post" for 2009-2019 (considered post-retraction for our analysis)
Identifier where relevant - ISBN, Patent ID, PMID (only for items we considered hard to find/identify, e.g. those without a DOI-based URL)
URL where available - URL, ideally a DOI-based URL
Reference number/style - reference
Only in bibliography - Y or blank
Acknowledged - If annotated, Y, Not relevant as retraction not published yet, or N (blank otherwise)
Positive / "Poor Research" (Negative) - P for positive, N for negative if annotated; blank otherwise
Human translated quotations - Y or blank; blank means Google scholar was used to translate quotations for Translated Quotation X
Specific/in passing (overall) - Specific if any of the 5 quotations are specific [aggregates Specific / In Passing (Quotation X)]
Quotation 1 - First quotation (or blank) (includes non-Latin characters in some cases)
Translated Quotation 1 - English translation of "Quotation 1" (or blank)
Specific / In Passing (Quotation 1) - Specific if "Quotation 1" refers to methods or results of the Matsuyama paper (or blank)
What is referenced from Matsuyama (Quotation 1) - Methods; Results; or Methods and Results - blank if "Quotation 1" not specific, no associated quotation, or not yet annotated
Quotation 2 - Second quotation (includes non-Latin characters in some cases)
Translated Quotation 2 - English translation of "Quotation 2"
Specific / In Passing (Quotation 2) - Specific if "Quotation 2" refers to methods or results of the Matsuyama paper (or blank)
What is referenced from Matsuyama (Quotation 2) - Methods; Results; or Methods and Results - blank if "Quotation 2" not specific, no associated quotation, or not yet annotated
Quotation 3 - Third quotation (includes non-Latin characters in some cases)
Translated Quotation 3 - English translation of "Quotation 3"
Specific / In Passing (Quotation 3) - Specific if "Quotation 3" refers to methods or results of the Matsuyama paper (or blank)
What is referenced from Matsuyama (Quotation 3) - Methods; Results; or Methods and Results - blank if "Quotation 3" not specific, no associated quotation, or not yet annotated
Quotation 4 - Fourth quotation (includes non-Latin characters in some cases)
Translated Quotation 4 - English translation of "Quotation 4"
Specific / In Passing (Quotation 4) - Specific if "Quotation 4" refers to methods or results of the Matsuyama paper (or blank)
What is referenced from Matsuyama (Quotation 4) - Methods; Results; or Methods and Results - blank if "Quotation 4" not specific, no associated quotation, or not yet annotated
Quotation 5 - Fifth quotation (includes non-Latin characters in some cases)
Translated Quotation 5 - English translation of "Quotation 5"
Specific / In Passing (Quotation 5) - Specific if "Quotation 5" refers to methods or results of the Matsuyama paper (or blank)
What is referenced from Matsuyama (Quotation 5) - Methods; Results; or Methods and Results - blank if "Quotation 5" not specific, no associated quotation, or not yet annotated
Further Notes - additional notes
keywords:
citation context annotation, retraction, diffusion of retraction
published:
2024-08-06
Xing, Yuqing; Bae, Seokjin; Madhavan, Vidya
(2024)
This is the raw topographies (without linear background subtraction) related to the publication: https://www.nature.com/articles/s41586-024-07519-5