Illinois Data Bank Dataset Search Results
Results
published:
2022-07-25
A set of species entity mentions derived from an NERC dataset analyzing 900 synthetic biology articles published by the ACS. This data is associated with the Synthetic Biology Knowledge System repository (https://web.synbioks.org/). The data in this dataset are raw mentions from the NERC data.
keywords:
synthetic biology; NERC data; species mentions
published:
2025-11-07
Ahmed, Md Wadud; Esquerre, Carlos A.; Eilts, Kristen; Allen, Dylan P.; McCoy, Scott M.; Varela, Sebastian; Singh, Vijay; Leakey, Andrew; Kamruzzaman, Mohammad
(2025)
Compositional characterization of biomass is vital for the biofuel industry. Traditional wet chemistry-based methods for analyzing biomass composition are laborious, time-consuming, and require extensive use of chemical reagents as well as highly skilled personnel. In this study, near-infrared (NIR) spectroscopy was used to quickly assess the composition of above-ground vegetative biomass from 113 diverse, photoperiod-sensitive, biomass-type sorghum (Sorghum bicolor) accessions cultivated under field conditions in Central Illinois. Biomass samples were analyzed using NIR spectra collected in the spectral range of 867–2536 nm, with their chemical compositions determined following the National Renewable Energy Laboratory (NREL) protocol. Advanced spectral pre-treatment and band selection techniques were utilized to develop calibration models using partial least squares regression (PLSR). The models’ effectiveness was assessed through cross-validation and independent data tests. The predictions for moisture, ash, extractives, glucan, xylan, acid-soluble lignin (ASL), acid-insoluble lignin (AIL), and total lignin were accurate and reliable, demonstrating the capability of NIR spectroscopy to provide rapid and precise characterization of sorghum biomass. The results demonstrated that NIR spectroscopy is an efficient tool for rapidly characterizing sorghum biomass, making it a sustainable option for screening desirable feedstock for biofuel or bioproduct production.
keywords:
Conversion;Feedstock Production;Biomass Analytics;Modeling
published:
2023-07-20
Atallah, Shady; Huang, Ju-Chin; Leahy, Jessica; Bennett, Karen P.
(2023)
This is a dataset from a choice experiment survey on family forest landowner preferences for managing invasive species.
keywords:
ecosystem services, forests, invasive species control, neighborhood effect
published:
2025-11-10
Banerjee, Shivali; Eilts, Kristen; Singh, Vijay
(2025)
Oilcane is an engineered sugarcane with the ability to hyper-accumulate vegetative lipids. It is processed to obtain juice and bagasse as a potential substrate for the production of biofuels and biochemicals. The juice comprises solid particles that are separated as waste mud before the fermentation of the juice. In this study, the oilcane waste mud (OWM) generated from 1000 liters of oilcane juice was quantified and evaluated as a potential resource for recovering biobased waxes. Hexane and ethyl acetate were evaluated as two different solvents for extracting waxes from OWM followed by its purification using acetone. The extracted biobased wax samples were characterized for their chemical and thermal profiles which were then compared with commercial natural waxes. Detailed mass balance shows that 53.6 ± 2.6 kg (dry basis) of solid OWM gets generated upon processing 1000 L (~1068 kg) of oilcane juice. Hexane and ethyl acetate led to a crude wax yield of 25.6 ± 0.2% and 16.6 ± 0.4% (wt/wt, dry basis) respectively from OWM at the end of 8 h. The relative purification of the wax samples was reported in the range of 58%–65% (wt/wt). The purified OWM wax has a melting point of 74.7°C. The waste mud was valorized as a source of biobased waxes with characteristic chemical and thermal profiles comparable to commercial natural waxes (carnauba and beeswax). Considering the decline in the supply of petroleum wax in the future coupled with the switch to “greener” alternative products by consumers, OWM could be a valuable source of natural wax in the industrial sector reducing the dependence on petroleum waxes. Eventually, recovering biobased wax as a co-product from OWM would bring in an additional stream of revenue leading to the development of a zero-waste biorefinery based on bioenergy crops.
keywords:
Conversion;Biomass Analytics;Feedstock Bioprocessing;Hydrolysate
published:
2021-12-28
Xia, Yushu; Wander, Michelle
(2021)
*Updates for this V3: added a few more records and rearranged the sequence of the tables in order to support our new paper "Evaluation of Indirect and Direct Scoring Methods to Relate Biochemical Soil Quality Indicators to Ecosystem Services" accepted by the Soil Science Society of America Journal.
We summarize peer reviewed literature reporting associations between for three soil quality indicators (SQIs) (β-glucosidase (BG), fluorescein diacetate (FDA) hydrolysis, and permanganate oxidizable carbon (POXC)) and crop yield and greenhouse gas emissions. Peer-reviewed articles published between January of 1990 and May 2018 were searched using the Thomas Reuters Web of Science database (Thomas Reuters, Philadelphia, Pennsylvania) and Google Scholar to identify studies reporting results for: “β-glucosidase”, “permanganate oxidizable carbon”, “active carbon”, “readily oxidizable carbon”, or “fluorescein diacetate hydrolysis”, together with one or more of the following: “crop yield”, “productivity”, “greenhouse gas’, “CO2”, “CH4”, or “N2O”.
Meta-data for records include the following descriptor variables and covariates useful for scoring function development: 1) identifying factors for the study site (location, duration of the experiment), 2) soil textural class, pH, and SOC, 3) depth of soil sampling, 4) units used in published works (i.e.: equivalent mass, concentration), 5) SQI abundances and measured ecosystem functions, and 6) summary statistics for correlation between SQIs and functions (yield and greenhouse gas emissions).
*Note: Blank values in tables are considered unreported data.
keywords:
Soil health promoting practices; Soil quality indicators; β-glucosidase; fluorescein diacetate hydrolysis; Permanganate oxidizable carbon; Greenhouse gas emissions; Scoring curves; Soil Management Assessment Framework
published:
2025-01-06
Shilikbay, Temirlan; Nawaz, Aatiqa; Doon, Megan; Ceman, Stephanie
(2025)
The complete data for the publication "RNA helicase MOV10 suppresses fear memory and dendritic arborization and regulates microtubule dynamics in hippocampal neurons," excluding sequencing data deposited in GEO, is provided here.
keywords:
MOV10; NUMA1; hippocampal neurons; behavior; cytoskeleton; tiff; czi; dv; mp4; mpg; ndpi; csv; xlsx; R
published:
2025-09-08
Hudson, Matthew; Zhao, Huimin; Sweedler, Jonathan; Shanklin, John; Cahoon, Edgar; Root, Mike; Burgess, Steven; Park, Kiyoul; Zhou, Shuaizhen; Blanford, Jantana; Lane, Stephan; Croslow, Seth; Dong, Jia
(2025)
Plant bioengineering is a time-consuming and labor-intensive process with no guarantee of achieving desired traits. Here, we present a fast, automated, scalable, high-throughput pipeline for plant bioengineering (FAST-PB) in maize (Zea mays) and Nicotiana benthamiana. FAST-PB enables genome editing and product characterization by integrating automated biofoundry engineering of callus and protoplast cells with single-cell matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS). We first demonstrated that FAST-PB could streamline Golden Gate cloning, with the capacity to construct 96 vectors in parallel. Using FAST-PB in protoplasts, we found that PEG2050 increased transfection efficiency by over 45%. For proof-of-concept, we established a reporter-gene-free method for CRISPR editing and phenotyping via mutation of high chlorophyll fluorescence 136. We show that diverse lipids were enhanced up to 6-fold using CRISPR activation of lipid controlling genes. In callus cells, an automated transformation platform was employed to regenerate plants with enhanced lipid traits through introducing multigene cassettes. Lastly, FAST-PB enabled high-throughput single-cell lipid profiling by integrating MALDI-MS with the biofoundry, protoplast, and callus cells, differentiating engineered and unengineered cells using single-cell lipidomics. These innovations massively increase the throughput of synthetic biology, genome editing, and metabolic engineering and change what is possible using single-cell metabolomics in plants.
keywords:
AI/ML; genome engineering; metabolic engineering; phenotyping
published:
2024-04-10
Konar, Megan; Ruess, Paul J.; Wanders, Niko; Bierkens, Marc F.P.
(2024)
This dataset provides estimates of total Irrigation Water Use (IWU) by crop, county, water source, and year for the Continental United States. Total irrigation from Surface Water Withdrawals (SWW), total Groundwater Withdrawals (GWW), and nonrenewable Groundwater Depletion (GWD) is provided for 20 crops and crop groups from 2008 to 2020 at the county spatial resolution.
In total, there are nearly 2.5 million data points in this dataset (3,142 counties; 13 years; 3 water sources; and 20 crops). This dataset supports the paper by Ruess et al (2024) "Total irrigation by crop in the Continental United States from 2008 to 2020", Scientific Data, doi: 10.1038/s41597-024-03244-w
When using, please cite as:
Ruess, P.J., Konar, M., Wanders, N., and Bierkens, M.F.P. (2024) Total irrigation by crop in the Continental United States from 2008 to 2020, Scientific Data, doi: 10.1038/s41597-024-03244-w
keywords:
water use; irrigation; surface water; groundwater; groundwater depletion; counties; crops; time series
published:
2024-10-31
Liu, Shanshan; Vlachokostas, Alex; Kontou, Eleftheria
(2024)
School buses transport 20 million students annually and are currently undergoing electrification in the US. With Vehicle-to-Building (V2B) technology, electric school buses (ESBs) can supply energy to school buildings during power outages, ensuring continued operation and safety. This study proposes assessing the resilience of secondary schools during outages by leveraging ESB fleets as backup power across various US climate regions. The findings indicate that the current fleet of ESBs in representative cities across different climate regions in the US is insufficient to meet the power demands of an entire school or even its HVAC system. However, we estimated the number of ESBs required to support the school's power needs, and we showed that the use of V2B technology significantly reduces carbon emissions compared to backup diesel generators. While adjusting HVAC setpoints and installing solar panels have limited impacts on enhancing school resilience, gathering students in classrooms during outages significantly improved resilience in our case study in Houston, Texas. Given the ongoing electrification of school buses, it is essential for schools to complement ESBs with stationary batteries and other backup power sources, such as solar and/or diesel generators, to effectively address prolonged outages. Determining the deployment of direct current fast and Level 2 chargers can reduce infrastructure costs while maintaining the resilience benefits of ESBs. This dataset includes the simulation process and results of this study.
keywords:
Electric school bus; Power outages,;Vehicle-to-Building technology; Carbon emission reduction; Backup power source
published:
2025-11-19
Xu, Hao; Shi, Longyuan; Boob, Aashutosh; Park, Wooyoung; Tan, Shih-I; Tran, Vinh; Schultz, J. Carl; Zhao, Huimin
(2025)
Rhodotorula toruloides is a non-model, oleaginous yeast uniquely suited to produce acetyl-CoA-derived chemicals. However, the lack of well-characterized genomic integration sites has impeded the metabolic engineering of this organism. Here we report a set of computationally predicted and experimentally validated chromosomal integration sites in R. toruloides. We first implemented an in silico platform by integrating essential gene information and transcriptomic data to identify candidate sites that meet stringent criteria. We then conducted a full experimental characterization of these sites, assessing integration efficiency, gene expression levels, impact on cell growth, and long-term expression stability. Among the identified sites, 12 exhibited integration efficiencies of 50% or higher, making them sufficient for most metabolic engineering applications. Using selected high-efficiency sites, we achieved simultaneous double and triple integrations and efficiently integrated long functional pathways (up to 14.7 kb). Additionally, we developed a new inducible marker recycling system that allows multiple rounds of integration at our characterized sites. We validated this system by performing five sequential rounds of GFP integration and three sequential rounds of MaFAR integration for fatty alcohol production, demonstrating, for the first time, precise gene copy number tuning in R. toruloides. These characterized integration sites should significantly advance metabolic engineering efforts and future genetic tool development in R. toruloides.
keywords:
Conversion;Metabolic Engineering;Software;Transcriptomics
published:
2022-04-21
This dataset was created based on the publicly available microdata from PNS-2019, a national health survey conducted by the Instituto Brasileiro de Geografia e Estatistica (IBGE, Brazilian Institute of Geography and Statistics). IBGE is a federal agency responsible for the official collection of statistical information in Brazil – essentially, the Brazilian census bureau. Data on selected variables focusing on biopsychosocial domains related to pain prevalence, limitations and treatment are available. The Fundação Instituto Oswaldo Cruz has detailed information about the PNS, including questionnaires, survey design, and datasets (www.pns.fiocruz.br). The microdata can be found on the IBGE website (https://www.ibge.gov.br/estatisticas/downloads-estatisticas.html?caminho=PNS/2019/Microdados/Dados).
keywords:
back pain; health status disparities; biopsychosocial; Brazil
published:
2025-11-19
Petersen, Bryan; Emran, Shah-Al; Miguez, Fernando; Heaton, Emily; VanLoocke, Andy
(2025)
Various works have quantitatively characterized the effects of environmental and management factors on Miscanthus x giganteus Greef et Deu (mxg) yield and, therefore, anticipated land requirement per unit production. However, little work has addressed the effects of cutting height, which may significantly contribute to the difference between the standing aboveground biomass at harvest (i.e., biological yield) and harvested yield. This study quantitatively characterized the effect of cutting height using a replicated nitrogen trial of a 5-year-old mxg stand in southeast Iowa and related this information to observations of cutting height in nearby commercial fields. Nitrogen fertilizer did not significantly change the relationship of the stem segment mass to length, and overall, a 1-cm stem segment contributes 0.5% of the total stem biomass within the bottom 44 cm of the stem. This results in an average harvest loss of 15% of the aboveground standing biomass when cutting at 30 cm, typically seen in commercial mxg fields in eastern Iowa. Cutting height should be considered when accurately predicting commercial mxg harvest yields and changes in soil organic carbon in a commercial mxg agroecosystem.
keywords:
Feedstock Production;Sustainability;Biomass Analytics;Miscanthus;Modeling
published:
2025-08-13
Tang, Wenhan; Arabas, Sylwester; Curtis, Jeffrey H.; Knopf, Daniel A.; West, Matthew; Riemer, Nicole
(2025)
This dataset contains the values directly shown in the figures of the article "The impact of aerosol mixing state on immersion freezing: Insights from classical nucleation theory and particle-resolved simulations". This article is in preparation for submission to the journal Atmospheric Chemistry and Physics. The dataset consists of 15 NetCDF files processed from the raw output of the PartMC model. It does not include the theoretical values of frozen fraction, which can be computed using the equations provided in the paper.
keywords:
Aerosol mixing state; Ice nucleating particles; Classical nucleation theory
published:
2025-10-27
Deshavath, Narendra Naik; Dien, Bruce; Slininger, Patricia J.; Jin, Yong-Su; Singh, Vijay
(2025)
A wide range of inorganic and organic chemicals are used during the pretreatment and enzymatic hydrolysis of lignocellulosic biomass to produce biofuels. Developing an industrially relevant 2G biorefinery process using such chemicals is challenging and requires more unit operations for downstream processing. A sustainable process has been developed to achieve industrially relevant titers of bioethanol with significant ethanol yield. The pretreatment of sorghum biomass was performed by a continuous pilot-scale hydrothermal reactor followed by disk milling. Enzymatic hydrolysis was performed without washing the pretreated biomass. Moreover, citrate buffer strength was reduced to 100-fold (50 mM to 0.5 mM) during the enzymatic hydrolysis. Enzymatic hydrolysis at 0.5 mM citrate buffer strength showed that significant sugar concentrations of 222 ± 2.3 to 241 ± 2.3 g/L (glucose + xylose) were attained at higher solids loadings of 50 to 60% (w/v). Furthermore, hydrolysates were fermented to produce bioethanol using two different xylose-fermenting Saccharomyces cerevisiae strains and a co-culture of xylose-fermenting and non-GMO yeast cultures. Bioethanol titer of 81.7 g/L was achieved with an ethanol yield of 0.48 gp/gs. Additionally, lipids were produced using the oleaginous yeast Rhodosporidium toruloides, yielding 13.2 g/L lipids with cellular lipid accumulation of 38.5% w/w from 100 g/L of sugar concentration. In summary, reducing the strength of the citrate buffer during enzymatic hydrolysis and omitting inorganic chemicals from the pretreatment process enhances the fermentability of hydrolysates and can also reduce operating costs.
keywords:
Conversion;Hydrolysate;Lipidomics
published:
2024-07-28
Xing, Yuqing; Bae, Seokjin; Madhavan, Vidya
(2024)
This is a set of topographies to study the magnetic field response of RbV3Sb5 (related to Fig.4 of https://www.nature.com/articles/s41586-024-07519-5)
published:
2025-06-04
These datasets contain the complete output from a Monte Carlo simulation of the number of wild cervids to test for chronic wasting disease (CWD) depending on true prevalence. Five CSVs of the simulation results are provided, split due to limitations in file size. The R code used to run the simulation and process the data is included. The data to replicated Table 1 and the data used to compare the simulation results to the CWD surveillance efforts of the Illinois Department of Natural Resources (IDNR) are also provided.
keywords:
chronic wasting disease; cwd; cervid; test; sample size; diagnostic testing; surveillance
published:
2019-06-13
Rezapour, Rezvaneh; Diesner, Jana
(2019)
This lexicon is the expanded/enhanced version of the Moral Foundation Dictionary created by Graham and colleagues (Graham et al., 2013).
Our Enhanced Morality Lexicon (EML) contains a list of 4,636 morality related words.
This lexicon was used in the following paper - please cite this paper if you use this resource in your work.
Rezapour, R., Shah, S., & Diesner, J. (2019). Enhancing the measurement of social effects by capturing morality. Proceedings of the 10th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA). Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Minneapolis, MN.
In addition, please consider citing the original MFD paper:
<a href="https://doi.org/10.1016/B978-0-12-407236-7.00002-4">Graham, J., Haidt, J., Koleva, S., Motyl, M., Iyer, R., Wojcik, S. P., & Ditto, P. H. (2013). Moral foundations theory: The pragmatic validity of moral pluralism. In Advances in experimental social psychology (Vol. 47, pp. 55-130)</a>.
keywords:
lexicon; morality
published:
2022-07-25
This dataset is derived from the raw dataset (https://doi.org/10.13012/B2IDB-4163883_V1) and collects entity mentions that were manually determined to be noisy, non-chemical entities.
keywords:
synthetic biology; NERC data; chemical mentions, noisy entities
published:
2025-09-12
Dong, Hongxu; Clark, Lindsay; Lipka, Alexander; Brummer, Joe E.; Głowacka, Katarzyna; Hall, Megan C.; Heo, Kweon; Jin, Xiaoli; Peng, Junhua; Yamada, Toshihiko; Ghimire, Bimal Kumar; Yoo, Ji Hye; Yu, Chang Yeon; Zhao, Hua; Long, Stephen; Sacks, Erik
(2025)
Overwintering ability is an important selection criterion for Miscanthus breeding in temperate regions. Insufficient overwintering ability of the currently leading Miscanthus biomass cultivar, M. ×giganteus (M×g) ‘1993–1780′, in regions where average annual minimum temperatures are −26.1°C (USDA hardiness zone 5) or lower poses a pressing need to develop new cultivars with superior cold tolerance. To facilitate breeding of Miscanthus, this study characterized phenotypic and genetic variation of overwintering ability in an M. sinensis germplasm panel consisting of 564 accessions, evaluated in field trials at three locations in North America and two in Asia. Genome‐wide association (GWA) and genomic prediction analyses were performed. The Korea/N China M. sinensis genetic group is a valuable gene pool for cold tolerance. The Yangtze‐Qinling, Southern Japan, and Northern Japan genetic groups were also potential sources of cold tolerance. A total of 73 marker–trait associations were detected for overwintering ability. Estimated breeding value for overwintering ability based on these 73 markers could explain 55% of the variation for first winter overwintering ability among M. sinensis. Average genomic prediction ability for overwintering ability across 50 fivefold cross‐validations was high (~0.73) after accounting for population structure. Common genomic regions for overwintering ability were detected by GWA analyses and a previous parallel QTL mapping study using three interconnected biparental F1 populations. One QTL on Miscanthus LG 8 encompassed five GWA hits and a known cold‐responsive gene, COR47. The other overwintering ability QTL on Miscanthus LG 11 contained two GWA hits and three known cold stress‐related genes, carboxylesterase 13 (CEX13), WRKY2 transcription factor, and cold shock domain (CSDP1). Miscanthus accessions collected from high latitude locations with cold winters had higher rates of overwintering, and more alleles for overwintering, than accessions collected from southern locations with mild winters.
keywords:
Feedstock Production;Biomass Analytics;Genomics
published:
2021-11-03
Liu, Baqiao; Warnow, Tandy
(2021)
This dataset contains re-estimated gene trees from the ASTRAL-II [1] simulated datasets. The re-estimated variants of the datasets are called MC6H and MC11H -- they are derived from the MC6 and MC11 conditions from the original data (the MC6 and MC11 names are given by ASTRID [2]). The uploaded files contain the sequence alignments (half-length their original alignments), and the re-estimated species trees using FastTree2.
Note:
- "mc6h.tar.gz" and "mc11h.tar.gz" contain the sequence alignments and the re-estimated gene trees for the two conditions
- the sequence alignments are in the format "all-genes.phylip.splitted.[i].half" where i means that this alignment is for the i-th alignment of the original dataset, but truncating the alignment halving its length
- "g1000.trees" under each replicate contains the newline-separated re-estimated gene trees. The gene trees were estimated from the above described alignments using FastTree2 (version 2.1.11) command "FastTree -nt -gtr"
[1]: Mirarab, S., & Warnow, T. (2015). ASTRAL-II: coalescent-based species tree estimation with many hundreds of taxa and thousands of genes. Bioinformatics, 31(12), i44-i52.
[2]: Vachaspati, P., & Warnow, T. (2015). ASTRID: accurate species trees from internode distances. BMC genomics, 16(10), 1-13.
keywords:
simulated data; ASTRAL; alignments; gene trees
published:
2022-05-16
Clem, Scott; Hobson, Keith; Harmon-Threatt, Alexandra
(2022)
This dataset is for the publication "Do Nearctic hover flies (Diptera: Syrphidae) engage in long-distance migration? An assessment of evidence and mechanisms." It consists of 11 Excel spreadsheets and 4 R scripts which correspond to the analyses which were conducted.
Paper abstract:
Long-distance insect migration is poorly understood despite its tremendous ecological and economic importance. As a group, Nearctic hover flies (Diptera: Syrphidae: Syrphinae), which are crucial pollinators as adults and biological control agents as larvae, are almost entirely unrecognized as migratory despite examples of highly migratory behavior among several Palearctic species. Here, we examined evidence and mechanisms of migration for four hover fly species (Allograpta obliqua, Eupeodes americanus, Syrphus rectus, and Syrphus ribesii) common throughout eastern North America using stable hydrogen isotope (δ2H) measurements of chitinous tissue, morphological assessments, abundance estimations, and cold-tolerance assays. While further studies are needed, non-local isotopic values obtained from hover fly specimens collected in central Illinois support the existence of long-distance fall migratory behavior in Eu. americanus, and to a lesser extent S. ribesii and S. rectus. Elevated abundance of Eu. americanus during the expected autumn migratory period further supports the existence of such behavior. Moreover, high phenotypic plasticity of morphology associated with dispersal coupled with significant differences between local and non-local specimens suggest that Eu. americanus exhibits a unique suite of morphological traits that decrease costs associated with long-distance flight. Finally, compared to the ostensibly non-migratory A. obliqua, Eu. americanus was less cold tolerant, a factor that may be associated with migratory behavior. Collectively, our findings imply that fall migration occurs in Nearctic hover flies, but we consider methodological limitations of our study in addition to potential ecological and economic consequences of these novel findings.
keywords:
Insect migration; hover fly; Syrphidae; stable isotopes; deuterium; morphometrics; cold tolerance
published:
2020-06-12
Fu, Yuanxi; Hsiao, Tzu-Kun
(2020)
This is a network of 14 systematic reviews on the salt controversy and their included studies. Each edge in the network represents an inclusion from one systematic review to an article. Systematic reviews were collected from Trinquart (Trinquart, L., Johns, D. M., & Galea, S. (2016). Why do we think we know what we know? A metaknowledge analysis of the salt controversy. International Journal of Epidemiology, 45(1), 251–260. https://doi.org/10.1093/ije/dyv184 ).
<b>FILE FORMATS</b>
1) Article_list.csv - Unicode CSV
2) Article_attr.csv - Unicode CSV
3) inclusion_net_edges.csv - Unicode CSV
4) potential_inclusion_link.csv - Unicode CSV
5) systematic_review_inclusion_criteria.csv - Unicode CSV
6) Supplementary Reference List.pdf - PDF
<b>ROW EXPLANATIONS</b>
1) Article_list.csv - Each row describes a systematic review or included article.
2) Article_attr.csv - Each row is the attributes of a systematic review/included article.
3) inclusion_net_edges.csv - Each row represents an inclusion from a systematic review to an article.
4) potential_inclusion_link.csv - Each row shows the available evidence base of a systematic review.
5) systematic_review_inclusion_criteria.csv - Each row is the inclusion criteria of a systematic review.
6) Supplementary Reference List.pdf - Each item is a bibliographic record of a systematic review/included paper.
<b>COLUMN HEADER EXPLANATIONS</b>
<b>1) Article_list.csv:</b>
ID - Numeric ID of a paper
paper assigned ID - ID of the paper from Trinquart et al. (2016)
Type - Systematic review / primary study report
Study Groupings - Groupings for related primary study reports from the same report, from Trinquart et al. (2016) (if applicable, otherwise blank)
Title - Title of the paper
year - Publication year of the paper
Attitude - Scientific opinion about the salt controversy from Trinquart et al. (2016)
Doi - DOIs of the paper. (if applicable, otherwise blank)
Retracted (Y/N) - Whether the paper was retracted or withdrawn (Y). Blank if not retracted or withdrawn.
<b>2) Article_attr.csv:</b>
ID - Numeric ID of a paper
year - Publication year
Attitude - Scientific opinion about the salt controversy from Trinquart et al. (2016)
Type - Systematic review/ primary study report
<b>3) inclusion_net_edges.csv:</b>
citing_ID - The numeric ID of a systematic review
cited_ID - The numeric ID of the included articles
<b>4) potential_inclusion_link.csv:</b>
This data was translated from the Sankey diagram given in Trinquart et al. (2016) as Web Figure 4. Each row indicates a systematic review and each column indicates a primary study. In the matrix, "p" indicates that a given primary study had been published as of the search date of a given systematic review.
<b>5)systematic_review_inclusion_criteria.csv:</b>
ID - The numeric IDs of systematic reviews
paper assigned ID - ID of the paper from Trinquart et al. (2016)
attitude - Its scientific opinion about the salt controversy from Trinquart et al. (2016)
No. of studies included - Number of articles included in the systematic review
Study design - Study designs to include, per inclusion criteria
population - Populations to include, per inclusion criteria
Exposure/Intervention - Exposures/Interventions to include, per inclusion criteria
outcome - Study outcomes required for inclusion, per inclusion criteria
Language restriction - Report languages to include, per inclusion criteria
follow-up period - Follow-up period required for inclusion, per inclusion criteria
keywords:
systematic reviews; evidence synthesis; network visualization; tertiary studies
published:
2023-07-27
Feng, Ling; Takiya, Daniela; Krishnankutty, Sindhu; Dietrich, Christopher; Zhang, Yalin
(2023)
The text file contains the original aligned DNA nucleotide sequence data used in the phylogenetic analyses of Feng et al. (in review), comprising the 3 protein-coding genes (histone H3, cytochrome oxidase I and 2) and 2 ribosomal genes (28S D8 and 16S). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 257 taxa (species) and 2995 characters (nucleotide positions), indicate that the characters are DNA sequence, that gaps inserted into the DNA sequence alignment are indicated by a dash, and that missing data are indicated by a question mark. The remainder of the file contains the aligned nucleotide sequence data for the five genes. Data partitions, representing the individual genes and different codon positions of the protein-coding genes, are indicated by the lines beginning "charset" near the end of the file. Two supplementary tables in the provided PDF file provide additional information on the species in the dataset, including the GenBank accession numbers for the sequence data (Table S1) and the DNA substitution models used for each of the data partitions used for analyses in the phylogenetic analysis program IQ-Tree (version 1.6.8) (Table S3), as described in the Methods section of the paper. The supplemental tables will also be linked to the article upon publication at the journal website.
keywords:
Insect; leafhopper; dispersal; vicariance; evolution
published:
2022-08-31
Chen, Wenxiang; Zhan, Xun; Yuan, Renliang; Pidaparthy, Saran; Yong, Adrian Xiao Bin; An, Hyosung; Tang, Zhichu; Yin, Kaijun; Patra, Arghya; Jeong, Heonjae; Zhang, Cheng; Ta, Kim; Riedel, Zachary; Stephens, Ryan; Shoemaker, Daniel; Yang, Hong; Gewirth, Andrew; Braun, Paul; Ertekin, Elif; Zuo, Jian-Min; Chen, Qian
(2022)
These datasets are for the four-dimensional scanning transmission electron microscopy (4D-STEM) and electron energy loss spectroscopy (EELS) experiments for cathode nanoparticles at different cutoff voltages and in different electrolytes. The raw 4D-STEM experiment datasets were collected by TEM image & analysis software (FEI) and were saved as SER files. The raw 4D-STEM datasets of SER files can be opened and viewed in MATLAB using our analysis software package of imToolBox available at <a href="https://github.com/flysteven/imToolBox">https://github.com/flysteven/imToolBox</a>. The raw EELS datasets were collected by DigitalMicrograph software and were saved as DM4 files. The raw EELS datasets can be opened and viewed in DigitalMicrograph software or using our analysis codes available at <a href="https://github.com/chenlabUIUC/OrientedPhaseDomain">https://github.com/chenlabUIUC/OrientedPhaseDomain</a>. All the datasets are from the work "Formation and impact of nanoscopic oriented phase domains in electrochemical crystalline electrodes" (2022).
The 4D-STEM experiment data include four example datasets for cathode nanoparticles collected at different cutoff voltages and in different electrolytes as described below. Each dataset contains a stack of diffraction patterns collected at different probe positions scanned across the cathode nanoparticle.
1. Pristine cathode particle: "Pristine particle 4D-STEM.ser"
2. Cathode particle at the cutoff voltage of 0.09V during discharge at C/10 in the aqueous electrolyte: "Intermediate cutoff0_09V discharge (aqueous) 4D-STEM.ser"
3. Fully discharged cathode particle at C/10 in the aqueous electrolyte: "Fully discharged particle 4D-STEM.ser"
4. Fully discharged cathode particle at C/10 in the dry organic electrolyte: "Fully discharge particle (dry organic electrolyte).ser"
The EELS experiment data includes three example datasets for cathode nanoparticles collected at different cutoff voltages during discharge in the aqueous electrolyte (in "EELS datasets.zip") as described below. Each EELS dataset contains the zero-loss and core-loss EELS spectra collected at different probe positions scanned across the cathode nanoparticle.
1. Pristine cathode particle: "Pristine particle EELS.zip"
2. Cathode particle at the cutoff voltage of 0.09V during discharge at C/10 in the aqueous electrolyte: "intermediate discharge (aqueous) EELS.zip"
3. Fully discharged cathode particle at C/10 in the aqueous electrolyte: "fully discharge (aqueous) EELS.zip"
The details of the software package and codes that can be used to analyze the 4D-STEM datasets and EELS datasets are available at: https://github.com/chenlabUIUC/OrientedPhaseDomain. Once our paper is formally published, we will update the relationship of these datasets with our paper.
keywords:
4D-STEM; microstructure; phase transformation; strain; cathode; nanoparticle; energy storage
published:
2025-09-18
Cao, Mingfeng; Fatma, Zia; Song, Xiaofei; Hsieh, Ping-Hung; Tran, Vinh G.; Lyon, William L.; Sayadi, Maryam; Shao, Zengyi; Yoshikuni, Yasuo; Zhao, Huimin
(2025)
The nonconventional yeast Issatchenkia orientalis can grow under highly acidic conditions and has been explored for production of various organic acids. However, its broader application is hampered by the lack of efficient genetic tools to enable sophisticated metabolic manipulations. We recently constructed an episomal plasmid based on the autonomously replicating sequence (ARS) from Saccharomyces cerevisiae (ScARS) in I. orientalis and developed a CRISPR/Cas9 system for multiplexed gene deletions. Here we report three additional genetic tools including: (1) identification of a 0.8 kb centromere-like (CEN-L) sequence from the I. orientalis genome by using bioinformatics and functional screening; (2) discovery and characterization of a set of constitutive promoters and terminators under different culture conditions by using RNA-Seq analysis and a fluorescent reporter; and (3) development of a rapid and efficient in vivo DNA assembly method in I. orientalis, which exhibited ~100% fidelity when assembling a 7 kb-plasmid from seven DNA fragments ranging from 0.7 kb to 1.7 kb. As proof of concept, we used these genetic tools to rapidly construct a functional xylose utilization pathway in I. orientalis.
keywords:
Conversion;Genome Engineering;Genomics;Transcriptomics