Illinois Data Bank Dataset Search Results
Results
published:
2024-07-31
LaBonte, Nicholas R.; Zerpa-Catanho, Dessiree P.; Liu, Siyao; Xiao, Liang; Dong, Hongxu; Clark, Lindsay V.; Sacks, Erik J.
(2024)
This dataset contains all data and supplementary materials from "Improving precision and accuracy of genetic mapping with genotyping-by-sequencing data in outcrossing species". An Excel file a list of all QTLs and linkage group length (in cM) obtained with two different SNP-calling methods (Tassel-Uneak and Tassel-GBS), genetic map-construction method (linkage-only and reference order-corrected) and depth filters (12x, 20x, 30x and 40x) for genetic mapping of 18 biomass yield traits in a biparental Miscanthus sinensis population using RAD-Seq SNPs is provided as "Supplementary file 1". A Perl script with the code for filtering VCF and HapMap-formatted data files is provided as “Supplementary file 2”. Phenotype data used for QTL mapping is provided as “Supplementary File 3”. A Perl script with the code for the simulation study is provided as “Supplementary file 4”.
keywords:
HapMapParser; GenotypingSimulator
published:
2025-04-25
Tassitano, Rafael; Chakraborty, Shreyonti
(2025)
This is an Excel file containing data about the physical environments of four Brazilian schools and the average daily minutes/day of physical activity and sedentary behavior exhibited by schoolchildren during school hours.
The Following Key describes the basic variables:
Subject IDs and Characteristics
Subject_ID: ID of Subject
total_days: Total number of days subject participated in experiment
Gender : Gender of subject
Age: Age of subject
School IDs and Characteristics
ID_School = ID of School
school1 = 1 if ID_School = 1, else = 0
school2 = 1 if ID_School = 2, else = 0
school3 = 1 if ID_School = 3, else = 0
school4 = 1 if ID_School = 4, else = 0
TotalSiteArea: Total Site Area on School Campus
PatioArea: Area of Patio(s)
CourtyardArea: Area of Courtyard(s)
TotalOpenArea: Total Area of Open Spaces on Campus
Class: Number of Sections in the School
Population: Total Number of Students Enrolled in the School
keywords:
school environment; physical activity
published:
2025-11-06
Sweedler, Jonathan; Rosado Rosa, Joenisse M.
(2025)
SCiLS MSI data files, images used in the figures and table contents for the tables found in the manuscript. The figures are labeled by figure and by their title on each figure set, including those found in the Supplementary Information. The tables are in an MS Excel sheet with the corresponding contents. The tables list the metabolites found in the images. To reduce the number of images in the manuscript, the tables complete the metabolite information not observed in the images. The images can be found using the SCiLS data files. A software license is needed to open these files. The SCiLS data files contains the processed MSI data for all obtained images. All files in the corresponding SCiLS data file must be present to open the individual data file. The feature list used for MSI analysis should be saved on the attached bookmark inside the SCiLS file so it should be available once the file is opened. SCiLS files can only be opened with the Bruker SCiLS software. If using an outdated version (before Version 13.01.17218), the files may not open or show poor quality.
keywords:
Tendrils; Pyocyanin; Quinolones; Spatiochemical; Metabolomics
published:
2022-12-05
Ng, Yee Man Margaret ; Taneja, Harsh
(2022)
These are similarity matrices of countries based on dfferent modalities of web use. Alexa website traffic, trending vidoes on Youtube and Twitter trends. Each matrix is a month of data aggregated
keywords:
Global Internet Use
published:
2021-05-07
Prepared by Vetle Torvik 2021-05-07
The dataset comes as a single tab-delimited Latin-1 encoded file (only the City column uses non-ASCII characters).
• How was the dataset created?
The dataset is based on a snapshot of PubMed (which includes Medline and PubMed-not-Medline records) taken in December, 2018. (NLMs baseline 2018 plus updates throughout 2018). Affiliations are linked to a particular author on a particular article. Prior to 2014, NLM recorded the affiliation of the first author only. However, MapAffil 2018 covers some PubMed records lacking affiliations that were harvested elsewhere, from PMC (e.g., PMID 22427989), NIH grants (e.g., 1838378), and Microsoft Academic Graph and ADS (e.g. 5833220). Affiliations are pre-processed (e.g., transliterated into ASCII from UTF-8 and html) so they may differ (sometimes a lot; see PMID 27487542) from PubMed records. All affiliation strings where processed using the MapAffil procedure, to identify and disambiguate the most specific place-name, as described in:
Torvik VI. MapAffil: A bibliographic tool for mapping author affiliation strings to cities and their geocodes worldwide. D-Lib Magazine 2015; 21 (11/12). 10p
• Look for Fig. 4 in the following article for coverage statistics over time:
Palmblad, M., Torvik, V.I. Spatiotemporal analysis of tropical disease research combining Europe PMC and affiliation mapping web services. Trop Med Health 45, 33 (2017). <a href="https://doi.org/10.1186/s41182-017-0073-6">https://doi.org/10.1186/s41182-017-0073-6</a>
Expect to see big upticks in coverage of PMIDs around 1988 and for non-first authors in 2014.
• The code and back-end data is periodically updated and made available for query by PMID at http://abel.ischool.illinois.edu/cgi-bin/mapaffil/search.py
• What is the format of the dataset?
The dataset contains 52,931,957 rows (plus a header row). Each row (line) in the file has a unique PMID and author order, and contains the following eighteen columns, tab-delimited. All columns are ASCII, except city which contains Latin-1.
1. PMID: positive non-zero integer; int(10) unsigned
2. au_order: positive non-zero integer; smallint(4)
3. lastname: varchar(80)
4. firstname: varchar(80); NLM started including these in 2002 but many have been harvested from outside PubMed
5. initial_2: middle name initial
6. orcid: From 2019 ORCID Public Data File https://orcid.org/ and from PubMed XML
7. year: year of the publication
8. journal: name of journal that the publication is published
9. affiliation: author's affiliation??
10. disciplines: extracted from departments, divisions, schools, laboratories, centers, etc. that occur on at least unique 100 affiliations across the dataset, some with standardization (e.g., 1770799), English translations (e.g., 2314876), or spelling corrections (e.g., 1291843)
11. grid: inferred using a high-recall technique focused on educational institutions (but, for experimental purposes, includes a few select hospitals, national institutes/centers, international companies, governmental agencies, and 200+ other IDs [RINGGOLD, Wikidata, ISNI, VIAF, http] for institutions not in GRID). Based on 2019 GRID version https://www.grid.ac/
12. type: EDU, HOS, EDU-HOS, ORG, COM, GOV, MIL, UNK
13. city: varchar(200); typically 'city, state, country' but could include further subdivisions; unresolved ambiguities are concatenated by '|'
14. state: Australia, Canada and USA (which includes territories like PR, GU, AS, and post-codes like AE and AA)
15. country
16. lat: at most 3 decimals (only available when city is not a country or state)
17. lon: at most 3 decimals (only available when city is not a country or state)
18. fips: varchar(5); for USA only retrieved by lat-lon query to https://geo.fcc.gov/api/census/block/find
keywords:
PubMed, MEDLINE, Digital Libraries, Bibliographic Databases; Author Affiliations; Geographic Indexing; Place Name Ambiguity; Geoparsing; Geocoding; Toponym Extraction; Toponym Resolution; institution name disambiguation
published:
2021-05-14
This is the complete dataset for the "Anomalous density fluctuations in a strange metal" Proceedings of the National Academy of Sciences publication (https://doi.org/10.1073/pnas.1721495115). This is an integration of the Zenodo dataset which includes raw M-EELS data.
<b>METHODOLOGICAL INFORMATION</b>
1. Description of methods used for collection/generation of data: Data have been collected with a M-EELS instrument and according to the data acquisition protocol described in the original PNAS publication and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026)
2. Methods for processing the data: Raw data were collected with a channeltron-based M-EELS apparatus described in the reference PNAS publication and analyzed according to the procedure outlined both in the PNAS paper and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026). The raw M-EELS spectra at each momentum have been subject to minor data processing involving:
(a) averaging of different acquisitions at the same conditions,
(b) energy binning,
(c) division of an effective Coulomb matrix element (which yields a structure factor S(q,\omega)),
(d) antisymmetrization (which yields the imaginary chi)
All these procedures are described in the PNAS paper.
3. Instrument- or software-specific information needed to interpret the data: These data are simple .txt or .dat files which can be read with any standard data analysis software, notably Python notebooks, MatLab, Origin, IgorPro, and others. We do not include scripts in order to provide maximum flexibility.
4. Relationship between files, if important: We divided in different folders raw data, structure factors and imaginary chi.
<b>DATA-SPECIFIC INFORMATION</b>
There are 8 folders within the Data_public_deposition_v1.zip. Each folder contain data needed to create the corresponding figure in the publication.
<b>1. Fig1:</b> This folder contains 21 DAT files needed to plot the theory data in panels C and D, following this naming conventions:
[chiA]or[chiB]or[Pi]_q_number.dat
With chiA is the imaginary RPA charge susceptibility with a Coulomb interaction of electronically weakly coupled layers
chiB is the imaginary RPA charge susceptibility with the usual 4\pi e^2/q^2 Coulomb interaction.
Pi is the imaginary Lindhard polarizability.
q is momentum in reciprocal lattice units
Number is the numerical momentum value in reciprocal lattice units
<b>2. Fig2:</b> Files needed to plot Fig. 2 of the PNAS paper. Contains 3 folders as listed below. The files in this folder are named following this convention: Bi2212_295K_(1,-1)_50eV_161107_q_number_2.16_avg.dat,
295K is the sample temperature
(1,-1) is the momentum direction in reciprocal lattice units
50 eV is the incident e beam energy
161107 is the start date of the experiment in yymmdd format
Q is the momentum
Number is the momentum in reciprocal lattice units
2.16 is the energy range covered by the data in eV
Avg identifies averaged data
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>3. Fig3:</b> Files needed to plot Fig. 3 of the PNAS paper. OP/ OD prefix identifies optimally doped or overdosed sample data, respectively.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>4. Fig4:</b> Files needed to plot Fig. 4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>5. FigS1:</b> Files needed to plot Fig. S1 of the PNAS paper. There are 5 files in this folder. DAT files are M-EELS data following the prior naming convention, while the two .txt files are digitized data from N. Nücker, U. Eckern, J. Fink, and P. Müller, Long-Wavelength Collective Excitations of Charge Carriers in High-Tc Superconductors, Phys. Rev. B 44, 7155(R) (1991), and K. H. G. Schulte, The interplay of Spectroscopy and Correlated Materials, Ph.D. thesis, University of Groningen (2002).
<b>6. FigS2:</b> Files needed to plot Fig. S2 of the PNAS paper.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>7. FigS3:</b> Files needed to plot Fig. S3 of the PNAS paper. There are 2 files in this folder:
20K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 20 K
295K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 295 K
<b>8. FigS4:</b> Files needed to plot Fig. S4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
keywords:
Momentum resolved electron energy loss spectroscopy (M-EELS); cuprates; plasmons; strange metal
published:
2025-10-14
Jagtap, Sujit Sadashiv; Deewan, Anshu; Liu, Jing-Jing; Walukiewicz, Hanna E.; Yun, Eun Ju; Jin, Yong-Su; Rao, Christopher V.
(2025)
Rhodosporidium toruloides is an oleaginous yeast capable of producing a variety of biofuels and bioproducts from diverse carbon sources. Despite numerous studies showing its promise as a platform microorganism, little is known about its metabolism and physiology. In this work, we investigated the central carbon metabolism in R. toruloides IFO0880 using transcriptomics and metabolomics during growth on glucose, xylose, acetate, or soybean oil. These substrates were chosen because they can be derived from plants. Significant changes in gene expression and metabolite concentrations were observed during growth on these four substrates. We mapped these changes onto the governing metabolic pathways to better understand how R. toruloides reprograms its metabolism to enable growth on these substrates. One notable finding concerns xylose metabolism, where poor expression of xylulokinase induces a bypass leading to arabitol production. Collectively, these results further our understanding of central carbon metabolism in R. toruloides during growth on different substrates. They may also help guide the metabolic engineering and development of better models of metabolism for R. toruloides.
keywords:
Conversion;Metabolomics;Transcriptomics
published:
2022-10-14
Zhou, Shan; Li, Jiahui; Lu, Jun; Liu, Haihua; Kim, Ji-Young; Kim, Ahyoung; Yao, Lehan; Liu, Chang; Qian, Chang; Hood, Zachary D. ; Lin, Xiaoying; Chen, Wenxiang; Gage, Thomas E. ; Arslan, Ilke; Travesset, Alex; Sun, Kai; Kotov, Nicholas A.; Chen, Qian
(2022)
This dataset is the raw data including SEM, TEM, PINEM images and FDTD simulation as well as pairwise interaction calculation results.
published:
2025-10-14
Jia, Yuyao; Kumar, Deepak; Winkler-Moser, Jill K.; Dien, Bruce S.; Rausch, Kent D.; Tumbleson, M.E.; Singh, Vijay
(2025)
Efforts to engineer high-productivity crops to accumulate oils in their vegetative tissue present the possibility of expanding biodiesel production. However, processing the new crops for lipid recovery and ethanol production from cell wall saccharides is challenging and expensive. In a previous study using corn germ meal as a model substrate, we reported that liquid hot water (LHW) pretreatment enriched the lipid concentration by 2.2 to 4.2 fold. This study investigated combining oil recovery with ethanol production by extracting oil following LHW and simultaneous saccharification and co-fermentation (SSCF) of the biomass. Corn germ meal was again used to model the oil-bearing energy crops. Pretreated germ meal hydrolysate or solids (160 °C and 180 °C for 10 minutes) were fermented, and lipids were extracted from both the spent fermentation whole broth and fermentation solids, which were recovered by centrifugation and convective drying. Lipid contents in spent fermentation solids increased 3.7 to 5.7 fold compared to the beginning germ meal. The highest lipid yield achieved after fermentation was 36.0 mg lipid g−1 raw biomass; the maximum relative amount of triacylglycerol (TAG) was 50.9% of extracted oil. Although the fermentation step increased the lipid concentration of the recovered solids, it did not improve the lipid yields of pretreated biomass and detrimentally affected oil compositions by increasing the relative concentrations of free fatty acids.
keywords:
Conversion;Hydrolysate;Lipidomics
published:
2025-10-03
Kang, Nam Kyu; Lee, Jaewon; Ort, Donald; Jin, Yong-Su
(2025)
L-malic acid is widely used in the food, chemical, and pharmaceutical industries. Here, we report on production of malic acid from xylose, the second most abundant sugar in lignocellulosic hydrolysates, by engineered Saccharomyces cerevisiae. To enable malic acid production in a xylose-assimilating S. cerevisiae, we overexpressed PYC1 and PYC2, coding for pyruvate carboxylases, a truncated MDH3 coding for malate dehydrogenase, and SpMAE1, coding for a Schizosaccharomyces pombe malate transporter. Additionally, both the ethanol- and glycerol-producing pathways were blocked to enhance malic acid production. The resulting strain produced malic acid from both glucose and xylose, but it produced much higher titers of malic acid from xylose than glucose. Interestingly, the engineered strain had higher malic acid yield from lower concentrations (10 g L‒1) of xylose, with no ethanol production, than from higher xylose concentrations (20 and 40 g L‒1). As such, a fed-batch culture maintaining xylose concentrations at low levels was conducted and 61.2 g L‒1 of malic acid was produced, with a productivity of 0.32 g L‒1 h. These results represent successful engineering of S. cerevisiae for the production of malic acid from xylose, confirming that that xylose offers the efficient production of various biofuels and chemicals by engineered S. cerevisiae.
keywords:
Conversion;Feedstock Production;Genome Engineering
published:
2023-12-23
Rodriguez-Zas, Sandra
(2023)
Supplemental document corresponding to a submission to Physiological Genomics (Data supplements and source materials must now be deposited in a community-recognized data repository or to a generalist public access repository if no community resource is available. See "Author/Production Requirements" for more information.) https://pg.msubmit.net/
keywords:
Supplemental, Physiological Genomics
published:
2025-03-20
This dataset contains white-tailed deer (Odocoileus virginianus) land cover utility score (deer LCU score) data for every TRS (township, range, and section), township-range, and county in Illinois, USA, based on annual National Land Cover Database (NLCD) data released for all years between 2000 and 2023. LCU data is provided in CSV files for each spatial scale, with TRS data split into 2 CSV files due to size limits. Rasters (TIF) showing all deer habitat in Illinois are also provided to show the location, quality, and quantity of deer habitat. A metadata file is also included for additional information.
keywords:
habitat; white-tailed deer; deer; Odocoileus virginianus; land cover; land classification; landscape; habitat suitability index; ecology; environment
published:
2025-11-03
von Haden, Adam C.; Eddy, William; Burnham, Mark B.; Brzostek, Edward; Yang, Wendy; DeLucia, Evan H.
(2025)
Root exudation is a key process for plant nutrient acquisition, but the controls on root exudation and its relationship to soil C and N processes in agroecosystems are unclear. We hypothesized that root exudation rates would be related to root morphological traits, N fertilization, and soil moisture. We also anticipated that root exudation would be correlated with bulk soil enzyme activity. Root exudation, root traits, and bulk soil extracellular enzyme activity were assessed in maize (Zea mays L.), soybean (Glycine max (L.) Merr.), biomass sorghum (Sorghum bicolor (L.) Moench), giant miscanthus (Miscanthus × giganteus), and switchgrass (Panicum virgatum L.). Measurements were taken in situ during two growing seasons with contrasting precipitation regimes, and N fertilization rate was varied in sorghum during one year. Specific root exudation (per unit root surface area) was negatively related to root diameter and was generally higher in annuals than perennials. Sorghum N fertilization did not affect root exudation rates, and soil moisture regime had no effect on annual root exudation rates within maize, sorghum, and miscanthus. Specific root exudation was negatively related to bulk soil C- and N-degrading soil enzyme activities. Intrinsic plant characteristics appeared more important than environmental variables in controlling in situ root exudation rates. The relationships between root diameter, root exudation, and soil C and N processes link root morphological traits to soil functions and demonstrate the potential tradeoffs among plant nutrient acquisition strategies in agroecosystems.
keywords:
Sustainability;Biomass Analytics;Field Data
published:
2021-10-27
de Jesús Astacio, Luis Miguel ; Prabhakara, Kaumudi Hassan; Li, Zeqian; Mickalide, Harry; Kuehn , Seppe
(2021)
Shared dataset consists of 16S sequencing data of microbial communities. Each community is composed of heterotrophic bacteria derived from one of two soil samples and the model algae Chlamydomonas reinhardtii. Each comunity was placed in a materially closed environment with an initial supply of carbon in the media and subjected to light-dark cycles. The closed microbial ecosystems (CES) survived via carbon cycling. Each CES was subjected to rounds of dilution, after which the community was sequenced (data provided here). The shared dataset allowed us to conclude that CES consistently self-assembled to cycle carbon (data not provided) via conserved metabolic capabilites (data not provided) dispite differences in taxonomic composition (data provided).
---------------------------
Naming convention:
[soil sample = A or B][CES replicate = 1,2,3, or 4]_[round number = 1,2,3,or 4]_[reverse read = R or forward read = F]_filt.fastq
Example -- A1_r1_F_filt.fastq means soil sample A, CES replicate 1, end of round1, forward read
keywords:
16S seq; .fastq; closed microbial ecosystems; carbon cycling
published:
2022-09-07
Jiang, Chongya; Guan, Kaiyu; Khanna, Madhu; Chen, Luoye; Peng, Jian
(2022)
The availability of economically marginal land for energy crops is identified using the Cropland Data Layer and other soil, wind, climate data resources. All data are recognized on a 30m spatial resolution across the continental United States.
keywords:
marginal land; biofuel production; remote sensing; land use change; Cropland Data Layer
published:
2020-05-20
Origin Ventures Academy for Entrepreneurial Leadership, Gies College of Business
(2020)
This dataset is a snapshot of the presence and structure of entrepreneurship education in U.S. four-year colleges and universities in 2015, including co-curricular activities and related infrastructure. Public, private not-for-profit and for-profit institutions are included, as are specialized four-year institutions. The dataset provides insight into the presence of entrepreneurship education both within business units and in other units of college campuses. Entrepreneurship is defined broadly, to include small business management and related career-focused options.
keywords:
Entrepreneurship education; Small business education; Ewing Marion Kauffman Foundation; csv
published:
2025-11-04
Berardi, Danielle; Hartman, Melannie; Brzostek, Edward; Bernacchi, Carl; DeLucia, Evan H.; von Haden, Adam C.; Kantola, Ilsa B.; Moore, Caitlin; Yang, Wendy; Hudiburg, Tara; Parton, William J.
(2025)
Globally, soils hold approximately half of ecosystem carbon and can serve as a source or sink depending on climate, vegetation, management, and disturbance regimes. Understanding how soil carbon dynamics are influenced by these factors is essential to evaluate proposed natural climate solutions and policy regarding net ecosystem carbon balance. Soil microbes play a key role in both carbon fluxes and stabilization. However, biogeochemical models often do not specifically address microbial-explicit processes. Here, we incorporated microbial-explicit processes into the DayCent biogeochemical model to better represent large perennial grasses and mechanisms of soil carbon formation and stabilization. We also take advantage of recent model improvements to better represent perennial grass structural complexity and life-history traits. Specifically, this study focuses on: 1) a plant sub-model that represents perennial phenology and more refined plant chemistry with downstream implications for soil organic matter (SOM) cycling though litter inputs, 2) live and dead soil microbe pools that influence routing of carbon to physically protected and unprotected pools, 3) Michaelis-Menten kinetics rather than first-order kinetics in the soil decomposition calculations, and 4) feedbacks between decomposition and live microbial pools. We evaluated the performance of the plant sub-model and two SOM cycling sub-models, Michaelis-Menten (MM) and first-order (FO), using observations of net ecosystem production, ecosystem respiration, soil respiration, microbial biomass, and soil carbon from long-term bioenergy research plots in the mid-western United States. The MM sub-model represented seasonal dynamics of soil carbon fluxes better than the FO sub-model which consistently overestimated winter soil respiration. While both SOM sub-models were similarly calibrated to total, physically protected, and physically unprotected soil carbon measurements, the models differed in future soil carbon response to disturbance and climate, most notably in the protected pools. Adding microbial-explicit mechanisms of soil processes to ecosystem models will improve model predictions of ecosystem carbon balances but more data and research are necessary to validate disturbance and climate change responses and soil pool allocation.
keywords:
Sustainability;Field Data;Modeling;Plant-Soil Microbiome
published:
2025-02-20
Zhou, Xiaoran; Zheng, Heng
(2025)
To gather news articles from the web that discuss the Cochrane Review (DOI: 10.1002/14651858.CD006207.pub6), we retrieved articles on August 1, 2023 from used Altmetric.com's Altmetric Explorer. We selected all articles that were written in English, published in the United States, and had a publication date <b>on or after March 10, 2023</b> (according to the "Mention Date" from Altmetric.com). This date is significant as it is when Cochrane issued a statement (https://www.cochrane.org/news/statement-physical-interventions-interrupt-or-reduce-spread-respiratory-viruses-review) about the "misleading interpretation" of the Cochrane Review made by news articles.
A previously published dataset for "Arguing about Controversial Science in the News: Does Epistemic Uncertainty Contribute to Information Disorder?" (DOI: 10.13012/B2IDB-4781172_V1) contains annotation of the news articles published before March 10, 2023. Our dataset annotates the news published on or after March 10, 2023.
The Altmetric_data.csv describes the selected news articles with both data exported from Altmetric Explorer and data we manually added
Data exported from Altmetric Explorer:
- Publication date of the news article
- Title of the news article
- Source/publication venue of the news article
- URL
- Country
Data we manually added:
- Whether the article is accessible
- The date we checked the article
- The corresponding ID of the article in MAXQDA
For each article from Altmetric.com, we first tried to use the Web Collector for MAXQDA to download the article from the website and imported it into MAXQDA (version 22.8.0).
We manually extracted direct quotations from the articles using MAXQDA.
We included surrounding words and sentences around direct quotations for context where needed.
We manually added codes and code categories in MAXQDA to identify the individuals (chief editors of the Cochrane Review, government agency representatives, journalists, and other experts such as physicians) or organizations (government agencies, other organizations, and research publications) who were quoted.
The MAXQDA_data.csv file contains excerpts from the news articles that contain the direct quotations we annotated.
For each excerpt, we included the following information:
- MAXQDA ID of the document from which the excerpt originates
- The collection date and source of the document
- The code we assigned to the excerpt
- The code category
- The excerpt itself
keywords:
altmetrics; MAXQDA; masks for COVID-19; scientific controversies; news articles
published:
2025-10-10
Sun, Liang; Atkinson, Christine A.; Lee, Ye-Gi; Jin, Yong-Su
(2025)
β‐Carotene is a natural pigment and health‐promoting metabolite, and has been widely used in the nutraceutical, feed, and cosmetic industries. Here, we engineered a GRAS yeast Saccharomyces cerevisiae to produce β‐carotene from xylose, the second most abundant and inedible sugar component of lignocellulose biomass. Specifically, a β‐carotene biosynthetic pathway containing crtYB, crtI, and crtE from Xanthophyllomyces dendrorhous was introduced into a xylose‐fermenting S. cerevisiae. The resulting strain produced β‐carotene from xylose at a titer threefold higher than from glucose. Interestingly, overexpression of tHMG1, which has been reported as a critical genetic perturbation to enhance metabolic fluxes in the mevalonate pathway and β‐carotene production in yeast when glucose is used, did not further improve the production of β‐carotene from xylose. Through fermentation profiling, metabolites analysis, and transcriptional studies, we found the advantages of using xylose as a carbon source, instead of glucose, for β‐carotene production to be a more respiratory feature of xylose consumption, a larger cytosolic acetyl‐CoA pool, and an upregulated expression level of rate‐limiting genes in the β‐carotene‐producing pathway, including ACS1 and HMG1. As a result, 772.8 mg/L of β‐carotene was obtained in a fed‐batch bioreactor culture with xylose feeding. Considering the inevitable large scale production of xylose when cellulosic biomass‐based bioeconomy is implemented, our results suggest xylose utilization is a promising strategy for overproduction of carotenoids and other isoprenoids in engineered S. cerevisiae.
keywords:
Conversion;Genome Engineering
published:
2025-11-10
Raj, Tirath; Dien, Bruce; Singh, Vijay
(2025)
Sugarcane is being enhanced as a bioenergy crop by engineering it to accumulate and store lipids along with polymeric sugars in vegetative tissues. However, there is no existing process that allows for processing this new crop to recover both lipid and cellulosic sugars from the oilcane bagasse. Therefore, a comprehensive investigation of two pretreatment methods—natural deep eutectic solvents (NADES) and chemical-free hydrothermal pretreatment (HT) was conducted to judge their suitability for recovering fermentable sugars, lipids, and lignin from bagasse. Two NADES, i.e., choline chloride: lactic acid (ChCl:LA) and betaine: lactic acid (BT:LA) were prepared using a 1:2 M ratio and were evaluated for pretreatment of oilcane bagasse at 10, 20, and 50 % (w/w) solids, followed by enzymatic hydrolysis at 10 % (w/w) solids. Notably, ChCl:LA NADES treatment at 10 % (w/w) solids at 140 °C for 2 h, solubilized 78.8 % of lignin and 80.4 % of hemicellulose and allowed 82.7 % enzymatic conversion of glucans to glucose. In contrast, HT pretreatment removed approximately 87.6 % of the hemicellulose and provided an enzymatic glucose yield of 69.7 %. Furthermore, ChCl:LA operated at 50 % solids loading the enriched lipids 2.6-fold (9.2 wt%) in recovered solids compared to HT (6.4 %) and BT:LA (5.1 %) pretreatment processes. NMR-HSQC and GPC analysis showed that ChCl:LA also cleaved the most lignin β–O–4 linkages and demonstrated lower molecular weight compared to HT. This study demonstrates that NADES pretreatment is an effective green processing method for recovering lipids, sugars, and lignin from bioenergy crops at high solid loading (50 % w/w) within the context of an integrated biorefinery.
keywords:
Conversion;Hydrolysate;Lipidomics
published:
2022-08-06
Carson, Dawn; Kopsco, Heather; Gronemeyer, Peg; Mateus-Pinilla, Nohra; Smith, Genee; Sandstrom, Emma; Smith, Rebecca
(2022)
An online knowledge, attitudes, and practices survey on ticks and tick-borne diseases was distributed to medical professionals in Illinois during summer 2020 to fall 2021. These are the raw data associated with that survey and the survey questions used. Age, gender, and county of practice have been removed for identifiability. We have added calculated values (columns 165 to end), including: the tick knowledge score, TBD knowledge score, and total knowledge score, which are the sum of the total number of correct answers in each category, and score percent, which are the proportion of correct answers in each category; region, which is determined from the county of practice; TBD relevant practice, which separates the practice variable into TBD primary, secondary, and non-responders; and several variables which group categories.
keywords:
ticks; medicine; tick-borne disease; survey
published:
2018-10-03
Das, Anupam; Acar, Gunes; Borisov, Nikita; Pradeep, Amogh
(2018)
This dataset is the result of three crawls of the web performed in May 2018. The data contains raw crawl data and instrumentation captured by OpenWPM-Mobile, as well as analysis that identifies which scripts access mobile sensors, which ones perform some of browser fingerprinting, as well as clustering of scripts based on their intended use. The dataset is described in the included README.md file; more details about the methodology can be found in our ACM CCS'18 paper: Anupam Das, Gunes Acar, Nikita Borisov, Amogh Pradeep. The Web's Sixth Sense: A Study of Scripts Accessing Smartphone Sensors. In Proceedings of the 25th ACM Conference on Computer and Communications Security (CCS), Toronto, Canada, October 15–19, 2018. (Forthcoming)
keywords:
mobile sensors; web crawls; browser fingerprinting; javascript
published:
2022-11-02
This dataset contains the behavioral, metabolic, and capture data which is reported within the manuscript Data for Capture is predicted by behavior and size, not metabolism, in Muskellunge
published:
2022-02-07
Karakoc, Deniz Berfin; Wang, Junren; Konar, Megan
(2022)
This dataset provides estimates of agricultural and food commodity flows [kg] between all county pairs within the United States for the years 2007, 2012, and 2017. The database provides 206.3 million data points, since pairwise information is provided between 3134 counties, for 7 commodity categories, and 3 time periods. The commodity categories correspond to the Standardized Classification of Transported Goods and are:
- SCTG 1: Iive animals and fish
- SCTG 2: cereal grains
- SCTG 3: agricultural products (except for animal feed, cereal grains, and forage products)
- SCTG 4: animal feed, eggs, honey, and other products of animal origin
- SCTG 5: meat, poultry, fish, seafood, and their preparations
- SCTG 6: milled grain products and preparations, and bakery products
- SCTG 7: other prepared foodstuffs, fats and oils
For additional information, please see the related paper by Karakoc et al. (2022) in Environmental Research Letters.
keywords:
food flows; high-resolution; county-scale; time-series; United States
published:
2021-08-27
The dataset shows all poison frogs (superfamily Dendrobatoidea) in private U.S. collections during 1990–2020. For each species and color morph, there is a date of arrival, the way it arrived in U.S. collections, and detailed notes related to its presence in the pet trade.
keywords:
pet trade; amphibians; Dendrobatidae