Illinois Data Bank Dataset Search Results
Results
published:
2018-06-18
Clark, Lindsay V.; Jin, Xiaoli; Petersen, Karen K.; Anzoua, Kossanou G.; Bagmet, Larissa; Chebukin, Pavel; Deuter, Martin; Dzyubenko, Elena; Dzyubenko, Nicolay; Heo, Kweon; Johnson, Douglas A.; Jørgensen, Uffe; Kjeldsen, Jens B.; Nagano, Hironori; Peng, Junhua; Sabitov, Andrey; Yamada, Toshihiko; Yoo, Ji Hye; Yu, Chang Yeon; Long, Stephen P.; Sacks, Erik J.
(2018)
This repository contains datasets and R scripts that were used in a study of the population structure of Miscanthus sacchariflorus in its native range across East Asia. Notably, genotypes of 764 individuals at 34,605 SNPs, called from reduced-representation DNA sequencing using a non-reference bioinformatics pipeline, are provided. Two similar SNP datasets, used for identifying clonal duplicates and for determining the ancestry of ornamental and hybrid Miscanthus plants identified in previous studies respectively, are also provided. There is also a spreadsheet listing the provenance and ploidy of all individuals along with their plastid (chloroplast) haplotypes. Software output for Structure, Treemix, and DIYABC is also included. See README.txt for more information about individual files. Results of this study are described in a manuscript in revision in Annals of Botany by the same authors, "Population structure of Miscanthus sacchariflorus reveals two major polyploidization events, tetraploid-mediated unidirectional introgression from diploid Miscanthus sinensis, and diversity centered around the Yellow Sea."
keywords:
Miscanthus; restriction site-associated DNA sequencing (RAD-seq); single nucleotide polymorphism (SNP); population genetics; Miscanthus xgiganteus; Miscanthus sacchariflorus; R scripts; germplasm; plastid haplotype
published:
2025-12-08
Li, Shuai; Moller, Christopher; Mitchell, Noah G.; Martin, Duncan; Sacks, Erik; Saikia, Sampurna; Labonte, Nicholas R.; Baldwin, Brian S.; Morrison, Jesse; Ferguson, John; Leakey, Andrew; Ainsworth, Elizabeth
(2025)
The leaf economics spectrum (LES) describes multivariate correlations in leaf structural, physiological and chemical traits, originally based on diverse C3 species grown under natural ecosystems. However, the specific contribution of C4 species to the global LES is studied less widely. C4 species have a CO2 concentrating mechanism which drives high rates of photosynthesis and improves resource use efficiency, thus potentially pushing them towards the edge of the LES. Here, we measured foliage morphology, structure, photosynthesis, and nutrient content for hundreds of genotypes of the C4 grass Miscanthus × giganteus grown in two common gardens over two seasons. We show substantial trait variations across M. × giganteus genotypes and robust genotypic trait relationships. Compared to the global LES, M. × giganteus genotypes had higher photosynthetic rates, lower stomatal conductance, and less nitrogen content, indicating greater water and photosynthetic nitrogen use efficiency in the C4 species. Additionally, tetraploid genotypes produced thicker leaves with greater leaf mass per area and lower leaf density than triploid genotypes. By expanding the LES relationships across C3 species to include C4 crops, these findings highlight that M. × giganteus occupies the boundary of the global LES and suggest the potential for ploidy to alter LES traits.
keywords:
Feedstock Production;Biomass Analytics;Field Data
published:
2025-09-15
Cheng, Ming-Hsun; Dien, Bruce; Lee, D. K.; Singh, Vijay
(2025)
Chemical-free pretreatments are attracting increased interest because they generate less inhibitor in hydrolysates. In this study, pilot-scaled continuous hydrothermal (PCH) pretreatment followed by disk refining was evaluated and compared to laboratory-scale batch hot water (LHW) pretreatment. Bioenergy sorghum bagasse (BSB) was pretreated at 160-190 °C for 10 min with and without subsequent disk milling. Hydrothermal pretreatment and disk milling synergistically improved glucose and xylose release by 10-20% compared to hydrothermal pretreatment alone. Maximum yields of glucose and xylose of 82.55% and 70.78%, respectively were achieved, when BSB was pretreated at 190 °C and 180 °C followed by disk milling. LHW pretreated BSB had 5-15% higher sugar yields compared to PCH for all pretreatment conditions. The surface area improvement was also performed. PCH pretreatment combined with disk milling increased BSB surface area by 31.80-106.93%, which was greater than observed using LHW pretreatment.
keywords:
Conversion;Sustainability;Genomics;Hydrolysate
published:
2017-08-11
Schiffer, Peter; Le, Brian L.
(2017)
Enclosed in this dataset are transport data of kagome connected artificial spin ice networks composed of permalloy nanowires. The data herein are reproductions of the data seen in Appendix B of the dissertation titled "Magnetotransport of Connected Artificial Spin Ice". Field sweeps with the magnetic field applied in-plane were performed in 5 degree increments for armchair orientation kagome artificial spin ice and zigzag orientation kagome artificial spin ice.
keywords:
Magnetotransport; artificial spin ice; nanowires
published:
2020-09-07
Chen, Luoye; Blanc-Betes, Elena; Hudiburg, Tara; Hellerstein, Daniel; Wallander, Steven; DeLucia, Evan; Khanna, Madhu
(2020)
This dataset contains BEPAM model code and input data to the replicate the results for "Assessing the Returns to Land and Greenhouse Gas Savings from Producing Energy Crops on Conservation Reserve Program Land."
The dataset consists of:
(1) The replication codes and data for the BEPAM model. The code file is named as output_0213-2020_Complete_daycent-agversion-[rental payment level]%_[biomass price].gms. (BEPAM-CRP model-Sep2020.zip)
(2) Simulation results from the BEPAM model (BEPAM_Simulation_Results.csv)
* Item (1) is in GAMS format. Item (2) is in text format.
keywords:
Miscanthus; Switchgrass; soil carbon sequestration; greenhouse gas savings; rental payments; biomass price
published:
2025-09-15
Butler, Nathaniel; Voytas, Daniel; Starker, Colby
(2025)
Recent advancements in monocot transformation, using leaf tissue as explant material, have expanded the number of grass species capable of transgenesis. However, the complexity of vectors and reliance on inducible excision of essential morphogenic regulators have so far limited widespread application. Plant RNA viruses, such as Foxtail Mosaic Virus (FoMV), present a unique opportunity to express morphogenic regulator genes, such as Babyboom (Bbm), Wuschel2 (Wus2), Wuschel-like homeobox protein 2a (Wox2a) and the GROWTH-REGULATING FACTOR 4 (GRF4) GRF-INTERACTING FACTOR 1 (GIF1) fusion protein transiently in leaf explant tissues. Furthermore, altruistic delivery of conventional and viral vectors could provide opportunities to simplify vectors used for leaf transformation—facilitating vector optimization and reducing reliance on morphogenic regulator gene integration. In this study, both viral and conventional T-DNA vectors were tested for their ability to promote the formation of embryonic calli, a critical step in leaf transformation protocols, using Sorghum bicolor leaf explants. Although conventional leaf transformation vectors yielded viable embryonic calli (43.2 ± 2.9%: GRF4-GIF1, 50.2 ± 3%: Bbm/Wus2), altruistic conventional vectors employing the GRF4-GIF1 morphogenic regulator resulted in improved efficiencies (61.3 ± 4.7%). Altruistic delivery was further enhanced with the use of viral vectors employing both GRF4-GIF1 and Bbm/Wus2 regulators, resulting in 75.1 ± 2.3% and 79.2 ± 2.5% embryonic calli formation, respectively. Embryonic calli generated from both conventional and viral vectors produced shoots expressing fluorescent reporters, which were confirmed using molecular analysis. This work provides an important proof-of-concept for the use of both altruistic vectors and viral-expressed morphogenic regulators for improving plant transformation.
keywords:
gene editing; sorghum
published:
2023-04-06
Warnow, Tandy; Park, Minhyuk
(2023)
This is a simulated sequence dataset generated using INDELible and processed via a sequence fragmentation procedure.
keywords:
sequence length heterogeneity;indelible;computational biology;multiple sequence alignment
published:
2022-03-20
Lee, Sangjun; Huang, Edwin W.; Johnson, Thomas A.; Guo, Xuefei; Husain, Ali A.; Mitrano, Matteo; Lu, Kannan; Zakrzewski, Alexander V.; de la Pena, Gilberto A.; Peng, Yingying; Huang, Hai; Lee, Sang-Jun; Jang, Hoyoung; Lee, Jun-Sik; Joe, Young Il; Doriese, William B.; Szypryt, Paul; Swetz, Daniel S.; Chi, Songxue; Aczel, Adam A.; MacDougall, Gregory J.; Kivelson, Steven A. ; Fradkin, Eduardo; Abbamonte, Peter
(2022)
Data for "Generic character of charge and spin density waves in superconducting cuprates".
- Neutron scattering data for SDW
- RSXS scans of CDW of LESCO x=0.10, 0.125, 0.15, 0.17, 0.20 at various temperatures.
- Temperature dependence of CDW peak intensity, correlation length, Qcdw (Lorentzian fit, S(q,T) fit, Landau-Ginzburg fit)
- XAS data of LESCO x=0.10, 0.125, 0.15, 0.17, 0.20
published:
2021-12-09
Burnham, Mark; Simon, Sandra; Lee, DK; Kent, Angela; DeLucia, Evan; Yang, Wendy
(2021)
These data were collected in 2018 and 2019 at the University of Illinois Energy Farm (N 40.063607, W 88.206926). During each growing season, bulk and rhizosphere soil were collected from replicate Sorghum bicolor nitrogen use efficiency trial plots at three separate time points (approximately July 1, August 1, and September 1). We measured soil moisture, pH, soil nitrate and ammonium, potential nitrification, potential denitrification, and extracted and sequenced the V4 region of the 16S rRNA gene for microbial community analysis. All microbial sequence data is archived in the National Center for Biotechnology Information’s (NCBI) Sequence Read Archive (accession number SRP326979, project number PRJNA741261).
keywords:
soil nitrogen; nitrification; nitrogen cycle; sorghum; bioenergy; Center for Advanced Bioenergy and Bioproducts Innovation
published:
2026-01-09
Schultz, J Carl; Cao, Mingfeng; Zhao, Huimin
(2026)
Rhodotorula toruloides has been increasingly explored as a host for bioproduction of lipids, fatty acid derivatives and terpenoids. Various genetic tools have been developed, but neither a centromere nor an autonomously replicating sequence (ARS), both necessary elements for stable episomal plasmid maintenance, has yet been reported. In this study, cleavage under targets and release using nuclease (CUT&RUN), a method used for genome-wide mapping of DNA–protein interactions, was used to identify R. toruloides IFO0880 genomic regions associated with the centromeric histone H3 protein Cse4, a marker of centromeric DNA. Fifteen putative centromeres ranging from 8 to 19 kb in length were identified and analyzed, and four were tested for, but did not show, ARS activity. These centromeric sequences contained below average GC content, corresponded to transcriptional cold spots, were primarily nonrepetitive and shared some vestigial transposon-related sequences but otherwise did not show significant sequence conservation. Future efforts to identify an ARS in this yeast can utilize these centromeric DNA sequences to improve the stability of episomal plasmids derived from putative ARS elements.
keywords:
Genome Engineering; Genomics
published:
2019-07-27
Clark, Lindsay V.; Dwiyanti, Maria Stefanie; Anzoua, Kossonou G.; Brummer, Joe E.; Glowacka, Katarzyna; Hall, Megan; Heo, Kweon; Jin, Xiaoli; Lipka, Alexander E.; Peng, Junhua; Yamada, Toshihiko; Yoo, Ji Hye; Yu, Chang Yeon; Zhao, Hua; Long, Stephen P.; Sacks, Erik J.
(2019)
Genotype calls are provided for a collection of 583 Miscanthus sinensis clones across 1,108,836 loci mapped to version 7 of the Miscanthus sinensis reference genome. Sequence and alignment information for all unique RAD tags is also provided to facilitate cross-referencing to other genomes.
keywords:
variant call format (VCF); sequence alignment/map format (SAM); miscanthus; single nucleotide polymorphism (SNP); restriction site-associated DNA sequencing (RAD-seq); bioenergy; grass
published:
2025-10-10
Yang, Pan; Cai, Ximing; Leibensperger, Carrie; Khanna, Madhu
(2025)
The success of a bioenergy policy relies largely on the wide adoption of perennial energy crops at the farm scale. This study uses survey data to examine potential adoption decisions by farmers in the U.S. Midwest and the causal effects of various direct and indirect influencing factors, especially heterogeneous preferences of farmers. A Bayesian network (BN) model is developed to delineate the causal relationship between farmers adoption decisions and the influencing factors. We find a dominating role of economic factors and a non-negligible impact of non-economic factors, such as the perceived environmental benefits and the extent of familiarity with perennial energy crops. To examine the effect of heterogeneity in farmer preferences, we classify the surveyed farmers into four categories based on their attitudes toward the economic, social, and environmental dimensions of perennial energy crops. We identified statistically significant between-group differences in the responses of the four types of farmers to the various influencing factors. Our findings contribute to disentangling the complicated motivations that will influence perennial energy crop adoption decisions and provide implications for more targeted policy development that need to consider the heterogeneous drivers of farmer decisions about land use.
keywords:
Sustainability;Modeling
published:
2018-12-31
Sixty undergraduate STEM lecture classes were observed across 14 departments at the University of Illinois Urbana-Champaign in 2015 and 2016. We selected the classes to observe using purposive sampling techniques with the objectives of (1) collecting classroom observations that were representative of the STEM courses offered; (2) conducting observations on non-test, typical class days; and (3) comparing these classroom observations using the Class Observation Protocol for Undergraduate STEM (COPUS) to record the presence and frequency of active learning practices utilized by Community of Practice (CoP) and non-CoP instructors.
Decimal values are the result of combined observations. All COPUS codes listed are from Smith (2013) "The Classroom Observation Protocol for Undergraduate STEM (COPUS): A New Instrument to Characterize STEM Classroom Practices" paper.
For more information on the data collection process, see "Evidence that communities of practice are associated with active learning in large STEM lectures" by Tomkin et. al. (2019) in the International Journal of STEM Education.
keywords:
COPUS, Community of Practice
published:
2023-07-05
Njuguna, Joyce; Clark, Lindsay; Lipka, Alexander; Anzoua, Kossonou; Bagmet, Larisa; Chebukin, Pavel; Dwiyanti, Maria; Dzyubenko, Elena; Dzyubenko, Nicolay; Ghimire, Bimal; Jin, Xiaoli; Johnson, Douglas; Kjeldsen, Jens; Nagano, Hironori; Oliveira, Ivone; Peng, Junhua; Petersen, Karen; Sabitov, Andrey; Seong, Eun; Yamada, Toshihiko; Yoo, Ji; Yu, Chang; Zhao, Hu; Munoz, Patricio; Long, Stephen; Sacks, Erik
(2023)
This dataset contains all data used in the paper "Impact of genotype-calling methodologies on genome-wide association and genomic prediction in polyploids". The dataset includes genotypes and phenotypic data from two autotetraploid species Miscanthus sacchariflorus and Vaccinium corymbosum that was used used for genome wide association studies and genomic prediction and the scripts used in the analysis.
In this V2, 2 files have the raw data are added:
"Miscanthus_sacchariflorus_RADSeq.vcf" is the VCF file with the raw SNP calls of the Miscanthus sacchariflorus data used for genotype calling using the 6 genotype calling methods.
"Blueberry_data_read_depths.RData" is the a RData file with the read depth data that was used for genotype calling in the Blueberry dataset.
keywords:
Polyploid; allelic dosage; Bayesian genotype-calling; Genome-wide association; Genomic prediction
published:
2025-09-29
Zhai, Zhiyang; Liu, Hui; Shanklin, John
(2025)
During the transformation of wild-type (WT) Arabidopsis thaliana, a T-DNA containing OLEOSIN-GFP (OLE1-GFP) was inserted by happenstance within the GBSS1 gene, resulting in significant reduction in amylose and increase in leaf oil content in the transgenic line (OG). The synergistic effect on oil accumulation of combining gbss1 with the expression of OLE1-GFP was confirmed by transforming an independent gbss1 mutant (GABI_914G01) with OLE1-GFP. The resulting OLE1-GFP/gbss1 transgenic lines showed higher leaf oil content than the individual OLE1-GFP/WT or single gbss1 mutant lines. Further stacking of the lipogenic factors WRINKLED1, Diacylglycerol O-Acyltransferase (DGAT1), and Cys-OLEOSIN1 (an engineered sesame OLEOSIN1) in OG significantly elevated its oil content in mature leaves to 2.3% of dry weight, which is 15 times higher than that in WT Arabidopsis. Inducible expression of the same lipogenic factors was shown to be an effective strategy for triacylglycerol (TAG) accumulation without incurring growth, development, and yield penalties.
keywords:
Feedstock Production;Biomass Analytics
published:
2011-09-20
Swenson, M. Shel; Suri, Rahul; Linder, C. Randal; Warnow, Tandy; Nguyen, Nam-puhong; Mirarab, Siavash; Neves, Diogo Telmo; Sobral, João Luís; Pingali, Keshav; Nelesen, Serita; Liu, Kevin; Wang, Li-San
(2011)
This page provides the data for SuperFine, DACTAL, and BeeTLe publications.
- Swenson, M. Shel, et al. "SuperFine: fast and accurate supertree estimation." Systematic biology 61.2 (2012): 214.
- Nguyen, Nam, Siavash Mirarab, and Tandy Warnow. "MRL and SuperFine+ MRL: new supertree methods." Algorithms for Molecular Biology 7 (2012): 1-13.
- Neves, Diogo Telmo, et al. "Parallelizing superfine." Proceedings of the 27th Annual ACM Symposium on Applied Computing. 2012.
- Nelesen, Serita, et al. "DACTAL: divide-and-conquer trees (almost) without alignments." Bioinformatics 28.12 (2012): i274-i282.
- Liu, Kevin, and Tandy Warnow. "Treelength optimization for phylogeny estimation." PLoS One 7.3 (2012): e33104.
published:
2025-09-23
Zhao, Huimin; Chen, Li-Qing; Martin, Teresa; Xue, Xueyi; Singh, Nilmani; Tan, Shi-I; Boob, Aashutosh
(2025)
Mitochondria play a key role in energy production and metabolism, making them a promising target for metabolic engineering and disease treatment. However, despite the known influence of passenger proteins on localization efficiency, only a few protein-localization tags have been characterized for mitochondrial targeting. To address this limitation, we leverage a Variational Autoencoder to design novel mitochondrial targeting sequences. In silico analysis reveals that a high fraction of the generated peptides (90.14%) are functional and possess features important for mitochondrial targeting. We characterize artificial peptides in four eukaryotic organisms and, as a proof-of-concept, demonstrate their utility in increasing 3-hydroxypropionic acid titers through pathway compartmentalization and improving 5-aminolevulinate synthase delivery by 1.62-fold and 4.76-fold, respectively. Moreover, we employ latent space interpolation to shed light on the evolutionary origins of dual-targeting sequences. Overall, our work demonstrates the potential of generative artificial intelligence for both fundamental research and practical applications in mitochondrial biology.
keywords:
AI/ML; metabolic engineering; modeling; software
published:
2020-12-15
Khanna, Madhu; Chen, Xiaoguang; Wang, Weiwei; Oliver, Anthony
(2020)
The dataset consists of results and various input data that are used in the GAMS model for the publication "Repeal of the Clean Power Plan: Social Cost and Distributional Implications". All the data are either excel files or in the .inc format which can be read within GAMS or Notepad. Main data sources include: agriculture, transportation and electricity data. Model details can be found in the paper and the GAMS model package.
keywords:
carbon abatement; welfare cost; electricity sector; partial equilibrium model
published:
2025-09-15
Zhao, Yang; Kim, Jae Y.; Karan, Ratna; Jung, Je Hyeong; Pathak, Bhuvan; Williamson, Bruce; Kannan, Baskaran; Wang, Duoduo; Fan, Chunyang; Yu, Wenjin; Dong, Shujie; Srivastava, Vibha; Altpeter, Fredy
(2025)
Sugarcane, a tropical C4 grass in the genus Saccharum (Poaceae), accounts for nearly 80% of sugar produced worldwide and is also an important feedstock for biofuel production. Generating transgenic sugarcane with predictable and stable transgene expression is critical for crop improvement. In this study, we generated a highly expressed single copy locus as landing pad for transgene stacking. Transgenic sugarcane lines with stable integration of a single copy nptII expression cassette flanked by insulators supported higher transgene expression along with reduced line to line variation when compared to single copy events without insulators by NPTII ELISA analysis. Subsequently, the nptII selectable marker gene was efficiently excised from the sugarcane genome by the FLPe/FRT site-specific recombination system to create selectable marker free plants. This study provides valuable resources for future gene stacking using site-specific recombination or genome editing tools.
keywords:
Feedstock Production;Biomass Analytics;Genomics
published:
2019-10-23
Ouldali, Hadjer; Sarthak, Kumar; Ensslen, Tobias; Piguet, Fabien; Manivet, Philippe; Pelta, Juan; Behrends, Jan C.; Aksimentiev, Aleksei; Oukhaled, Abdelghani
(2019)
Raw MD simulation trajectory, input and configuration files, SEM current data, and experimental raw data accompanying the publication, "Electrical recognition of the twenty proteinogenic amino acids using an aerolysin nanopore". README.md contains a description of all associated files.
keywords:
molecular dynamics; protein sequencing; aerolysin; nanopore sequencing
published:
2019-12-03
This is the data set associated with the manuscript titled "Extensive host-switching of avian feather lice following the Cretaceous-Paleogene mass extinction event." Included are the gene alignments used for phylogenetic analyses and the cophylogenetic input files.
keywords:
phylogenomics, cophylogenetics, feather lice, birds
published:
2020-08-31
Chen, Luoye; Khanna, Madhu; Debnath, Deepayan; Zhong, Jia; Ferin, Kelsie; VanLoocke, Andy
(2020)
This dataset contains BEPAM model code and input data to replicate the outcomes for "The Economic and Environmental Costs and Benefits of the Renewable Fuel Standard".
The dataset consists of:
(1) The replication codes and data for the BEPAM model. The code file is named as output.gms. (BEPAM-Social cost model-ERL.zip)
(2) Simulation results from the BEPAM model (BEPAM_Simulation_Results.csv)
* Item (1) is in GAMS format. Item (2) is in text format.
keywords:
Social Cost of Carbon; Social Cost of Nitrogen; Cost-Benefit Analysis; Indirect Land-Use Change