Displaying 301 - 325 of 886 in total

Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2020-08-01

Rhoads, Bruce ; Lewis, Quinn; Sukhodolov, Alexander; Constantinescu, George (2020): Mixing Data for Three Small Confluences in East Central Illinois. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1255710_V1

This data set includes information used to determine patterns of mixing at three small confluences in East Central Illinois based on differences in the temperature or turbidity of the two confluent flows.

keywords: mixing; confluences; flow structure

published: 2020-10-15

Khanna, Madhu; Wang, Weiwei; Wang, Michael (2020): BEPAM Model Code and CABBI Simulation Results for "Assessing the Additional Carbon Savings with Biofuel". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4272529_V1

This dataset consists of various input data that are used in the GAMS model. All the data are in the format of .inc which can be read within GAMS or Notepad. Main data sources include: acreage data (acre), crop budget data ($/acre), crop yield data (e.g. bushel/acre), Soil carbon sequestration data (KgCO2/ha/yr). Model details can be found in the "Assessing the Additional Carbon Savings with Biofuel" and GAMS model package. ## File Description (1) GAMS Model.zip: This includes all the input files and scripts for running the model (2) Table*.csv: These files include the data from the tables in the manuscript (3) Figure2_3_4.csv: This contains the data used to create the figures in the manuscript (4) BaselineResults.csv: This includes a summary of the model results. (5) SensitivityResults_*.csv: Model results from the various sensitivity analyses performed (6) LUC_emission.csv: land use change emissions by crop reporting district for changes of pasturelands to annual crops.

keywords: Biogenic carbon intensity; Corn ethanol; Economic model; Dynamic optimization; Anticipated baseline approach; Life cycle carbon intenisty

published: 2020-10-14

Dalling, James W.; Heineman, Katherine D. (2020): Multiple stem and environmental variables dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4045402_V1

Data on permanent plots at Fortuna and the Panama Canal Watershed, Republic of Panama, containing counts and percent of trees with one or more multiple stems >10cm diameter, with and without palms. Accompanying environmental data includes elevation, precipitation, soil type and soil chemical variables (pH, total N, NO3, NO4, resin P, mehlich Ca, K and Mg.

keywords: multiple stems; resprouting; Panama Canal Watershed; Fortuna Forest Reserve

published: 2021-02-18

Wang, Shaowen; Lyu, Fangzheng; Wang, Shaohua; Catlet, Charles; Padmanabhan, Anand; Soltani, Kiumars (2021): Data for Integrating CyberGIS and Urban Sensing for Reproducible Streaming Analytics. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0286574_V1

Increasingly pervasive location-aware sensors interconnected with rapidly advancing wireless network services are motivating the development of near-real-time urban analytics. This development has revealed both tremendous challenges and opportunities for scientific innovation and discovery. However, state-of-the-art urban discovery and innovation are not well equipped to resolve the challenges of such analytics, which in turn limits new research questions from being asked and answered. Specifically, commonly used urban analytics capabilities are typically designed to handle, process, and analyze static datasets that can be treated as map layers and are consequently ill-equipped in (a) resolving the volume and velocity of urban big data; (b) meeting the computing requirements for processing, analyzing, and visualizing these datasets; and (c) providing concurrent online access to such analytics. To tackle these challenges, we have developed a novel cyberGIS framework that includes computationally reproducible approaches to streaming urban analytics. This framework is based on CyberGIS-Jupyter, through integration of cyberGIS and real-time urban sensing, for achieving capabilities that have previously been unavailable toward helping cities solve challenging urban informatics problems. The files included in this dataset functions as follows: 1) Spatial_interpolation.ipynb is a python based Jupyter notebook that enables users to conduct spatial interpolation with AoT data; 2) Urban_Informatics.ipynb is a Jupyter notebook that helps to explore the AoT dataset; 3) chicago-complete.weekly.2019-09-30-to-2019-10-06.tar includes all the high-frequency urban sensing data from AoT sensors from 2019 September 30th to 2019 October 6th collected in Chicago, US; 4) sensors.csv is a processed dataset including information about the temperature in Chicago, and it is used in Spatial_interpolation.ipynb.

keywords: CyberGIS; Urban informatics; Array of Things

published: 2020-05-17

Mishra, Sudhanshu; Prasad, Shivangi; Mishra, Shubhanshu (2020): Trained models for Multilingual Joint Fine-tuning of Transformer models for identifying Trolling, Aggression and Cyberbullying at TRAC 2020. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8882752_V1

Models and predictions for submission to TRAC - 2020 Second Workshop on Trolling, Aggression and Cyberbullying Our approach is described in our paper titled: Mishra, Sudhanshu, Shivangi Prasad, and Shubhanshu Mishra. 2020. “Multilingual Joint Fine-Tuning of Transformer Models for Identifying Trolling, Aggression and Cyberbullying at TRAC 2020.” In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying (TRAC-2020). The source code for training this model and more details can be found on our code repository: https://github.com/socialmediaie/TRAC2020 NOTE: These models are retrained for uploading here after our submission so the evaluation measures may be slightly different from the ones reported in the paper.

keywords: Social Media; Trolling; Aggression; Cyberbullying; text classification; natural language processing; deep learning; open source;

published: 2020-05-12

Eick, Brian (2020): Acceleration and strain data for free vibration of pre-tensioned, partially-submerged beams. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7897650_V1

The data provided herein is accelerometer and strain data taken from free vibration response of pre-tensioned, partially submerged steel beam specimens (modulus of elasticity assumed = 29,000 ksi). The specimens were subjected to various levels of pre-tension, and various levels of submersion in water. The purpose of the testing was to quantify the effects of partial submersion on the vibrating frequencies of pretensioned beams. Three specimens were tested, each with different cross section (but identical cross-sectional area). The different cross sections allow investigation of the effects of specimen width as the specimen vibrates through water. The testing procedure was as follows: 1) Apply a specified level of tension in the beam. Measure tension via 3 strain gages. 2) Submerge the specimens to a specified depth of water 3) Excite the beams with either a hammer impact or a pull-and-release method (physically pull the middle of the bar and quickly release) 4) Measure the free vibration of the beam with 2 accelerometers. Schematic drawings of the test setup and the test specimens are provided, as is a picture of the test setup.

keywords: free vibration; beam; partially-submerged; prestressed;

published: 2020-05-30

Long, Stephen Patrick (2020): Original data for "Light, Not Age, Underlies the Q9 Maladaptation of Maize and Miscanthus Photosynthesis to Self-Shading". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4821336_V1

Original leaf gas exchange and absorptance data used in the Collison et al. (2020) Light, Not Age, Underlies the Q9 Maladaptation of Maize and Miscanthus Photosynthesis to Self-Shading - Frontiers in Plant Science doi: 10.3389/fpls.2020.00783

keywords: C4 photosynthesis; canopy; bioenergy; food security; quantum yield; shade acclimation; photosynthetic light-use efficiency; leaf aging

published: 2020-07-01

Rykhlevskii, Andrei; Huff, Kathryn D. (2020): SaltProc output for TAP MSR and MSBR online reprocessing depletion simulations. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7364919_V1

keywords: molten salt; fuel cycle; reprocessing; refueling

published: 2020-07-15

Molloy, Erin K. (2020): Data from: Supertree-like methods for genome-scale species tree estimation. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4004605_V1

This repository includes scripts and datasets for Chapter 6 of my PhD dissertation, " Supertree-like methods for genome-scale species tree estimation," that had not been published previously. This chapter is based on the article: Molloy, E.K. and Warnow, T. "FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models." Bioinformatics, In press. https://doi.org/10.1093/bioinformatics/btaa444. The results presented in my PhD dissertation differ from those in the Bioinformatics article, because I re-estimated species trees using FastMulRF and MulRF on the same datasets in the original repository (https://doi.org/10.13012/B2IDB-5721322_V1). To re-estimate species trees, (1) a seed was specified when running MulRF, and (2) a different script (specifically preprocess_multrees_v3.py from https://github.com/ekmolloy/fastmulrfs/releases/tag/v1.2.0) was used for preprocessing gene trees (which were then given as input to MulRF and FastMulRFS). Note that this preprocessing script is a re-implementation of the original algorithm for improved speed (a bug fix also was implemented). Finally, it was brought to my attention that the simulation in the Bioinformatics article differs from prior studies, because I scaled the species tree by 10 generations per year (instead of 0.9 years per generation, which is ~1.1 generations per year). I re-simulated datasets (true-trees-with-one-gen-per-year-psize-10000000.tar.gz and true-trees-with-one-gen-per-year-psize-50000000.tar.gz) using 0.9 years per generation to quantify the impact of this parameter change (see my PhD dissertation or the supplementary materials of Bioinformatics article for discussion).

keywords: Species tree estimation; gene duplication and loss; statistical consistency; MulRF, FastRFS

published: 2021-10-11

Peng, Jianhao; Ochoa, Idoia (2021): ClonalKinetic Data and Intermediate Results of SimiC. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3975180_V1

This dataset contains the ClonalKinetic dataset that was used in SimiC and its intermediate results for comparison. The Detail description can be found in the text file 'clonalKinetics_Example_data_description.txt' and 'ClonalKinetics_filtered.DF_data_description.txt'. The required input data for SimiC contains: 1. ClonalKinetics_filtered.clustAssign.txt => cluster assignment for each cell. 2. ClonalKinetics_filtered.DF.pickle => filtered scRNAseq matrix. 3. ClonalKinetics_filtered.TFs.pickle => list of driver genes. The results after running SimiC contains: 1. ClonalKinetics_filtered_L10.01_L20.01_Ws.pickle => inferred GRNs for each cluster 2. ClonalKinetics_filtered_L10.01_L20.01_AUCs.pickle => regulon activity scores for each cell and each driver gene. NOTE: “ClonalKinetics_filtered.rds” file which is mentioned in “ClonalKinetics_filtered.DF_data_description.txt” is an intermediate file and the authors have put all the processed in the pickle/txt file as described in the filtered data text.

keywords: GRNs;SimiC;RDS;ClonalKinetic

published: 2024-05-23

Xing, Yuqing; Bae, Seokjin; Ritz, Ethan; Yang, Fan; Birol, Turan; Salinas , Andrea N. Capa ; Ortiz, Brenden R.; Wilson , Stephen D.; Wang, Ziqiang; Fernandes, Rafael M.; Madhavan, Vidya (2024): Data for manuscript entitled "Optical Manipulation of the Charge Density Wave state in RbV3Sb5". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4197245_V1

This dataset consists of all the figure files that are part of the main text and supplementary of the manuscript titled "Optical manipulation of the charge density wave state in RbV3Sb5". For detailed information on the individual files refer to the readme file.

keywords: kagome superconductor; optics; charge density wave

published: 2019-11-11

Molloy, Erin K.; Warnow, Tandy (2019): Data from: FastMulRFS: Statistically consistent polynomial time species tree estimation under gene duplication. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-5721322_V1

This repository includes scripts and datasets for the paper, "FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models." Note: The results from estimating species trees with ASTRID-multi (included in this repository) are *not* included in the FastMulRFS paper. We estimated species trees with ASTRID-multi in the fall of 2019, but ASTRID-multi had an important bug fix in January 2020. Therefore, the ASTRID-multi species trees in this repository should be ignored.

keywords: Species tree estimation; gene duplication and loss; statistical consistency; MulRF, FastRFS

published: 2020-02-01

Williams, Benjamin R.; Benson, Thomas J. (2020): Habitat Use of Spring Migrating Dabbling Ducks in the Wabash River Valley. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7017235_V1

This data describes habitat use, availability, landscape level influences, and daily movement of dabbling ducks in the Wabash River Valley of southeastern Illinois and southwestern Indiana. It contains triangulated locations of individual ducks, associated habitat assignments of those locations, flood survey data to determine water availability, and randomly generated points to assess landscape level questions.

keywords: waterfowl; ducks; dabbling; mallard; teal; habitat

published: 2020-12-12

Jones, Todd M.; Benson , Thomas J.; Ward, Michael P. (2020): Jones et al. FE-2019-01175. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1999427_V1

Dataset associated with Jones et al FE-2019-01175 submission: Does the size and developmental stage of traits at fledging reflect juvenile flight ability among songbirds? Excel CSV files with all of the data used in analyses and file with descriptions of each column. The flight ability variable in this dataset was derived from fledgling drop tests, examples of which can be found in the related dataset: Jones, Todd M.; Benson, Thomas J.; Ward, Michael P. (2019): Flight Ability of Juvenile Songbirds at Fledgling: Examples of Fledgling Drop Tests. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2044905_V1.

keywords: body condition; fledgling; flight ability; locomotor ability; post-fledging; songbirds; wing development; wing emergence

published: 2020-06-03

Zachwieja, Alexandra (2020): Ecological niche models of Late Pleistocene human land preference: an Australasian test case. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0065911_V1

This dataset provides files for use in analysis of human land preference across Australasia, and in a localized analysis of land preference in Laos and Vietnam. All files can be imported into ArcGIS for visualization, and re-analyzed using the open source Maxent species distribution modeling program. CSV files contain known human presence sites for model validation. ASC files contain geographically coded environmental data for mean annual temperature and mean annual precipitation during the Last Glacial Maximum, as well as downward slope data. All ASC files are in the WGS 1984 Mercator map projection for visualization in ArcGIS and can be opened as text files in text editors supporting large file sizes.

keywords: human dispersal; ecological niche modeling; Australasia; Late Pleistocene; land preference

published: 2020-02-05

Zahniser, James; Dietrich, Christopher (2020): NEXUS data file for phylogenetic analysis of Deltocephalinae (Hemiptera: Cicadellidae) . University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7290912_V1

The Delt_Comb.NEX text file contains the original data used in the phylogenetic analyses of Zahniser & Dietrich, 2013 (European Journal of Taxonomy, 45: 1-211). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first nine lines of the file indicate the file type (Nexus), that 152 taxa were analyzed, that a total of 3971 characters were analyzed, the format of the data, and specification for two symbols used in the dataset. There are four datasets separated into blocks, one each for: 28S rDNA gene, Histone H3 gene, morphology, and insertion/deletion characters scored based on the alignment of the 28S rDNA dataset. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the publication using this dataset. A text file, Delt_morph_char.txt, is available here that states the morphological characters and characters states that were scored in the Delt_Comb.NEX dataset. The original DNA sequence data are available from NCBI GenBank under the accession numbers indicated in publication. Chromatogram files for each sequencing read are available from the first author upon request.

keywords: phylogeny; DNA sequence; morphology; parsimony analysis; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; histone H3; bayesian analysis

published: 2020-02-12

Price, Edward; Spyreas, Greg; Matthews, Jeffrey (2020): Biotic homogenization of wetland vegetation in the conterminous United States driven by Phalaris arundinacea and anthropogenic disturbance. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7128075_V1

This is the dataset used in the Landscape Ecology publication of the same name. This dataset consists of the following files: NWCA_Int_Veg.txt NWCA_Reg_Veg.txt NWCA_Site_Attributes.txt NWCA_Int_Veg.txt is a site and plot by species matrix. Column labeled SITES consists of site IDs. Column labeled Plots consist of Plot ID numbers. All other columns represent species abundances (estimates of percent cover, summed across five plots). NWCA_Reg_Veg.txt is a site by species matrix of species abundances. Column labeled SITES consist of site IDs. All other columns represent species abundances (estimates of percent cover within individual plots). NWCA_Site_Attributes.txt is a matrix of site attributes. Column labeled SITES consist of site IDs. Column labeled AA_CENTER_LAT consist of latitudinal coordinates for the Assessment Area center point in decimal degrees. Column labeled AA_CENTER_LONG consist of longitudinal coordinates for the Assessment Area center point in decimal degrees. Column REFPLUS_NWCA represents disturbance gradient classes including MIN (minimally disturbed), L (least disturbed), I (intermediate), M (most disturbed). Column REFPLUS_NWCA2 represents revised disturbance gradient classes based on protocols described in the article. These revised classes were used for analysis. Column labeled STRESS_HEAVYMETAL represents heavy metal stressor classes, used to ascertain which wetlands were missing soil data. Classes in the STRESS_HEAVYMETAL column include Low, Moderate, High, and Missing. Sites with Missing STRESS_HEAVYMETAL classes were removed from analysis. More information about this dataset: All of the data used in this analysis was gathered from the National Wetlands Condition Assessment. Wetland surveys were conducted from 4/4/2011 to 11/2/2011. The entire National Wetlands Condition Assessment Dataset, which includes 3640 unique taxonomic identities of plants, can be found at: https://www.epa.gov/national-aquatic-resource-surveys/data-national-aquatic-resource-surveys

keywords: Anthropogenic disturbance; β-Diversity; Biotic homogenization; Phalaris arundinacea; reed canary grass; Wetlands

published: 2020-02-12

XSEDE-Extreme Science and Engineering Discovery Environment (2020): XSEDE: Allocations Awards for the NSF Cyberinfrastructure Portfolio, 2004-2019. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0757799_V1

The XSEDE program manages the database of allocation awards for the portfolio of advanced research computing resources funded by the National Science Foundation (NSF). The database holds data for allocation awards dating to the start of the TeraGrid program in 2004 to present, with awards continuing through the end of the second XSEDE award in 2021. The project data include lead researcher and affiliation, title and abstract, field of science, and the start and end dates. Along with the project information, the data set includes resource allocation and usage data for each award associated with the project. The data show the transition of resources over a fifteen year span along with the evolution of researchers, fields of science, and institutional representation.

keywords: allocations; cyberinfrastructure; XSEDE

published: 2020-02-27

Clem, Scott; Sparbanie, Taylor; Luro, Alec; Harmon-Threatt, Alexandra (2020): Data for: Anthophilous hover flies (Diptera: Syrphidae) may visually discriminate neonicotinoid insecticides in sucrose solution: a choice experiment. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0490928_V1

These data were collected for an experiment examining effects of neonicotinoid (clothianidin) presence on hover fly (Diptera: Syrphidae) behavior. Hover flies of two species (Eristalis arbustorum and Toxomerus marginatus) were offered a choice to feed on artificial flowers laced with sucrose solution that was either contaminated (CLO) or not contaminated (CON) with clothianidin. Two different concentrations of clothianidin in 0.5 M sucrose solution were tested: 2.5 ppb and 150 ppb. We conducted four sets of 10 trials, each trial set examining a different combination of species and clothianidin dose. Across 6 hours of video for each trial we recorded 1) number of visits to each flower that resulted in feeding, and 2) amount of time spent feeding during each visit. We found that while neither species fed significantly longer on either of the solutions, E. arbustorum appeared to avoid flowers with clothianidin particularly at high rates. In the paper, we attribute this avoidance response, partially, to hover fly-visible spectral differences between the two flower choices and discuss potential implications for field and lab-based studies. In the enclosed zip file we have included all data for this project and code scripts from R. * Note: Data folder contains 4 files (instead of 6 as mentioned in Readme): e.tenax_photoreceptors.csv; hoverfly_data_UPDATE.csv; number_visits_UPDATE.csv; and Original 2018 hover fly choice test data_Clem2020.xlsx

keywords: Syrphidae; hoverfly; Eristalis; Toxomerus; Choice Experiment; Neonicotinoid; Clothianidin

published: 2020-08-01

Horna Munoz, Daniel; Constantinescu, George; Rhoads, Bruce ; Lewis, Quinn; Sukhodolov, Alexander (2020): Confluence Density Effects Simulation Database. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6257171_V1

This data set shows how density effects have an important influence on mixing at a small river confluence. The data consist of results of simulations using a detached eddy simulation model.

keywords: confluence; flow dynamics; density effects

published: 2020-03-03

Schneider, Jodi; Ye, Di (2020): Network of First and Second-generation citations to Matsuyama 2005 from Google Scholar and Web of Science. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1403534_V2

This second version (V2) provides additional data cleaning compared to V1, additional data collection (mainly to include data from 2019), and more metadata for nodes. Please see NETWORKv2README.txt for more detail.

keywords: citations; retraction; network analysis; Web of Science; Google Scholar; indirect citation

published: 2020-04-07

Larsen, Ryan; Charles, Hillman; Kramer, Arthur; Cohen, Neal; Barbey, Aron (2020): Dataset for "Body mass and cardiorespiratory fitness are associated with altered brain metabolism". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9371397_V1

Baseline data from a multi-modal intervention study conducted at the University of Illinois at Urbana-Champaign. Data include results from a cardiorespiratory fitness assessment (maximal oxygen consumption, VO2max), a body composition assessment (Dual-Energy X-ray Absorptiometry, DXA), and Magnetic Resonance Spectroscopy Imaging. Data set includes data from 435 participants, ages 18-44 years.

keywords: Magnetic Resonance Spectroscopy; N-acetyl aspartic acid (NAA); Body Mass Index; cardiorespiratory fitness; body composition

published: 2020-05-15

Mishra, Shubhanshu; Agarwal, Sneha; Guo, Jinlong ; Phelps , Kirstin ; Picco, Johna ; Diesner , Jana (2020): Tweet IDs annotated for enthusiasm and support towards social causes: CTE, cyberbullying, and LGBT. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2603648_V1

This data has tweets collected in paper Shubhanshu Mishra, Sneha Agarwal, Jinlong Guo, Kirstin Phelps, Johna Picco, and Jana Diesner. 2014. Enthusiasm and support: alternative sentiment classification for social movements on social media. In Proceedings of the 2014 ACM conference on Web science (WebSci '14). ACM, New York, NY, USA, 261-262. DOI: https://doi.org/10.1145/2615569.2615667 The data only contains tweet IDs and the corresponding enthusiasm and support labels by two different annotators.

keywords: Twitter; text classification; enthusiasm; support; social causes; LGBT; Cyberbullying; NFL

published: 2019-09-25

Wong, Tony; Hughes, A; Tokuda, K; Indebetouw, R; Onishi, T; Bandurski, J. B.; Chen, C. H. R.; Fukui, Y; Glover, S. C. O.; Klessen, R. S.; Pineda, J. L.; Roman-Duval, J.; Sewilo, M.; Wojciechowski, E.; Zahorecz, S. (2019): Data for: Relations Between Molecular Cloud Structure Sizes and Line Widths in the Large Magellanic Cloud. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7090706_V1

12CO and 13CO maps for six molecular clouds in the Large Magellanic Cloud, obtained with the Atacama Large Millimeter/submillimeter Array (ALMA). See the associated article in the Astrophysical Journal, and README files within each ZIP archive. Please cite the article if you use these data.

keywords: Radio astronomy

published: 2020-02-23

Ye, Di; Hill, Alison; Whitehorn (Fulton), Ashley; Schneider, Jodi (2020): Citation context annotation for new and newly found citations (2006-2019) to retracted paper Matsuyama 2005. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8150563_V1

Citation context annotation for papers citing retracted paper Matsuyama 2005 (RETRACTED: Matsuyama W, Mitsuyama H, Watanabe M, Oonakahara KI, Higashimoto I, Osame M, Arimura K. Effects of omega-3 polyunsaturated fatty acids on inflammatory markers in COPD. Chest. 2005 Dec 1;128(6):3817-27.), retracted in 2008 (Retraction in: Chest (2008) 134:4 (893) <a href="https://doi.org/10.1016/S0012-3692(08)60339-6">https://doi.org/10.1016/S0012-3692(08)60339-6<a/> ). This is part of the supplemental data for Jodi Schneider, Di Ye, Alison Hill, and Ashley Whitehorn. "Continued Citation of a Fraudulent Clinical Trial Report, Eleven Years after it was retracted for Falsifying Data" [R&R under review with Scientometrics]. Overall we found 148 citations to the retracted paper from 2006 to 2019, However, this dataset does not include the annotations described in the 2015. in Ashley Fulton, Alison Coates, Marie Williams, Peter Howe, and Alison Hill. "Persistent citation of the only published randomized controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction." Publications 3, no. 1 (2015): 17-26. In this dataset 70 new and newly found citations are listed: 66 annotated citations and 4 pending citations (non-annotated since we don't have full-text). "New citations" refer to articles published from March 25, 2014 to 2019, found in Google Scholar and Web of Science. "Newly found citations" refer articles published 2006-2013, found in Google Scholar and Web of Science, but not previously covered in Ashley Fulton, Alison Coates, Marie Williams, Peter Howe, and Alison Hill. "Persistent citation of the only published randomised controlled trial of omega-3 supplementation in chronic obstructive pulmonary disease six years after its retraction." Publications 3, no. 1 (2015): 17-26. NOTES: This is Unicode data. Some publication titles & quotes are in non-Latin characters and they may contain commas, quotation marks, etc. FILES/FILE FORMATS Same data in two formats: 2006-2019-new-citation-contexts-to-Matsuyama.csv - Unicode CSV (preservation format only) 2006-2019-new-citation-contexts-to-Matsuyama.xlsx - Excel workbook (preferred format) ROW EXPLANATIONS 70 rows of data - one citing publication per row COLUMN HEADER EXPLANATIONS Note - processing notes Annotation pending - Y or blank Year Published - publication year ID - ID corresponding to the network analysis. See Ye, Di; Schneider, Jodi (2019): Network of First and Second-generation citations to Matsuyama 2005 from Google Scholar and Web of Science. University of Illinois at Urbana-Champaign. <a href="https://doi.org/10.13012/B2IDB-1403534_V2">https://doi.org/10.13012/B2IDB-1403534_V2</a> Title - item title (some have non-Latin characters, commas, etc.) Official Translated Title - item title in English, as listed in the publication Machine Translated Title - item title in English, translated by Google Scholar Language - publication language Type - publication type (e.g., bachelor's thesis, blog post, book chapter, clinical guidelines, Cochrane Review, consumer-oriented evidence summary, continuing education journal article, journal article, letter to the editor, magazine article, Master's thesis, patent, Ph.D. thesis, textbook chapter, training module) Book title for book chapters - Only for a book chapter - the book title University for theses - for bachelor's thesis, Master's thesis, Ph.D. thesis - the associated university Pre/Post Retraction - "Pre" for 2006-2008 (means published before the October 2008 retraction notice or in the 2 months afterwards); "Post" for 2009-2019 (considered post-retraction for our analysis) Identifier where relevant - ISBN, Patent ID, PMID (only for items we considered hard to find/identify, e.g. those without a DOI-based URL) URL where available - URL, ideally a DOI-based URL Reference number/style - reference Only in bibliography - Y or blank Acknowledged - If annotated, Y, Not relevant as retraction not published yet, or N (blank otherwise) Positive / "Poor Research" (Negative) - P for positive, N for negative if annotated; blank otherwise Human translated quotations - Y or blank; blank means Google scholar was used to translate quotations for Translated Quotation X Specific/in passing (overall) - Specific if any of the 5 quotations are specific [aggregates Specific / In Passing (Quotation X)] Quotation 1 - First quotation (or blank) (includes non-Latin characters in some cases) Translated Quotation 1 - English translation of "Quotation 1" (or blank) Specific / In Passing (Quotation 1) - Specific if "Quotation 1" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 1) - Methods; Results; or Methods and Results - blank if "Quotation 1" not specific, no associated quotation, or not yet annotated Quotation 2 - Second quotation (includes non-Latin characters in some cases) Translated Quotation 2 - English translation of "Quotation 2" Specific / In Passing (Quotation 2) - Specific if "Quotation 2" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 2) - Methods; Results; or Methods and Results - blank if "Quotation 2" not specific, no associated quotation, or not yet annotated Quotation 3 - Third quotation (includes non-Latin characters in some cases) Translated Quotation 3 - English translation of "Quotation 3" Specific / In Passing (Quotation 3) - Specific if "Quotation 3" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 3) - Methods; Results; or Methods and Results - blank if "Quotation 3" not specific, no associated quotation, or not yet annotated Quotation 4 - Fourth quotation (includes non-Latin characters in some cases) Translated Quotation 4 - English translation of "Quotation 4" Specific / In Passing (Quotation 4) - Specific if "Quotation 4" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 4) - Methods; Results; or Methods and Results - blank if "Quotation 4" not specific, no associated quotation, or not yet annotated Quotation 5 - Fifth quotation (includes non-Latin characters in some cases) Translated Quotation 5 - English translation of "Quotation 5" Specific / In Passing (Quotation 5) - Specific if "Quotation 5" refers to methods or results of the Matsuyama paper (or blank) What is referenced from Matsuyama (Quotation 5) - Methods; Results; or Methods and Results - blank if "Quotation 5" not specific, no associated quotation, or not yet annotated Further Notes - additional notes

keywords: citation context annotation, retraction, diffusion of retraction