Illinois Data Bank Dataset Search Results
Results
published:
2025-10-17
Cao, Mingfeng; Tran, Vinh G.; Qin, Jiansong; Olson, Andrew; Mishra, Shekhar; Schultz, J. Carl; Huang, Chunshuai; Xie, Dongming; Zhao, Huimin
(2025)
The plant-sourced polyketide triacetic acid lactone (TAL) has been recognized as a promising platform chemical for the biorefinery industry. However, its practical application was rather limited due to low natural abundance and inefficient cell factories for biosynthesis. Here, we report the metabolic engineering of oleaginous yeast Rhodotorula toruloides for TAL overproduction. We first introduced a 2-pyrone synthase gene from Gerbera hybrida (GhPS) into R. toruloides and investigated the effects of different carbon sources on TAL production. We then systematically employed a variety of metabolic engineering strategies to increase the flux of acetyl-CoA by enhancing its biosynthetic pathways and disrupting its competing pathways. We found that overexpression of ATP-citrate lyase (ACL1) improved TAL production by 45% compared to the GhPS overexpressing strain, and additional overexpression of acetyl-CoA carboxylase (ACC1) further increased TAL production by 29%. Finally, we characterized the resulting strain I12-ACL1-ACC1 using fed-batch bioreactor fermentation in glucose or oilcane juice medium with acetate supplementation and achieved a titer of 28 or 23 g/L TAL, respectively. This study demonstrates that R. toruloides is a promising host for the production of TAL and other acetyl-CoA-derived polyketides from low-cost carbon sources.
keywords:
Conversion;Metabolic Engineering
published:
2019-03-25
Clark, Lindsay V.; Dwiyanti, Maria Stefanie; Anzoua, Kossonou G.; Brummer, Joe E.; Ghimire, Bimal Kumar; Głowacka, Katarzyna; Hall, Megan; Heo, Kweon; Jin, Xiaoli; Lipka, Alexander E.; Peng, Junhua; Yamada, Toshihiko; Yoo, Ji Hye; Yu, Chang Yeon; Zhao, Hua; Long, Stephen P.; Sacks, Erik J.
(2019)
This dataset contains genotypic and phenotypic data, R scripts, and the results of analysis pertaining to a multi-location field trial of Miscanthus sinensis. Genome-wide association and genomic prediction were performed for biomass yield and 14 yield-component traits across six field trial locations in Asia and North America, using 46,177 single-nucleotide polymorphism (SNP) markers mined from restriction site-associated DNA sequencing (RAD-seq) and 568 M. sinensis accessions. Genomic regions and candidate genes were identified that can be used for breeding improved varieties of M. sinensis, which in turn will be used to generate new M. xgiganteus clones for biomass.
keywords:
miscanthus; genotyping-by-sequencing (GBS); genome-wide association studies (GWAS); genomic selection
published:
2024-08-16
Halligan, Susannah; Schummer, Michael; Fournier, Auriel; Musni, Vergie; Davis, J. Brian; Downs, Cynthia; Lavretsky, Philip
(2024)
Dataset used for the paper entitled "Morphological differences between wild and game-farm Mallards in North America".
Large-scale releases of domesticated, game-farm Mallards to supplement wild populations have resulted in wide-spread introgressive hybridization that changed the genetic constitution of wild populations in eastern North America. The resulting gene flow is well-documented between game-farm and wild Mallards, but the mechanistic consequences from such interactions remain unknown in North America. We provide the first study to characterize and investigate potential differences in morphology between genetically known, wild and game-farm Mallards in North America. We used nine morphological measurements to discriminate between wild and game-farm Mallards with 96% accuracy. Compared to their wild counterparts, game-farm Mallards had longer bodies and tarsi, shorter heads and wings, and shorter, wider, and taller bills. The nail on the end of the bill of game-farm Mallards was longer, and game-farm Mallard bills had a greater lamellae:bill length ratio than wild Mallards. Differences in body morphologies between wild and game-farm Mallards are consistent with an artificial, terrestrial life whereby game-farm Mallards are fed pelleted foods resulting in artificial selection for a more “goose-like” bill. We posit that 1) game-farm Mallards have diverged from their wild ancestral traits of flying and filter feeding towards becoming optimized to run and peck for food; 2) game-farm morphological traits optimized over the last 400 years in domestic environments are likely to be maladaptive in the wild; and 3) the introgression of such traits into wild populations is likely to reduce fitness. Understanding effects of game-farm Mallard introgression requires analysis of various game-farm × wild hybrid generations to determine how domestically-derived traits persist or diminish with each generation.
keywords:
Mallard; Game Farm; Morphology; Waterfowl; Duck
published:
2025-09-29
Li, Shuai; Moller, Christopher; Mitchell, Noah G.; Lee, DoKyoung; Ainsworth, Elizabeth
(2025)
Elevated tropospheric ozone concentration (O3) significantly reduces photosynthesis and productivity in several C4 crops including maize, switchgrass and sugarcane. However, it is unknown how O3 affects plant growth, development and productivity in sorghum (Sorghum bicolor L.), an emerging C4 bioenergy crop. Here, we investigated the effects of elevated O3 on photosynthesis, biomass and nutrient composition of a number of sorghum genotypes over two seasons in the field using free-air concentration enrichment (FACE), and in growth chambers. We also tested if elevated O3 altered the relationship between stomatal conductance and environmental conditions using two common stomatal conductance models. Sorghum genotypes showed significant variability in plant functional traits, including photosynthetic capacity, leaf N content and specific leaf area, but responded similarly to O3. At the FACE experiment, elevated O3 did not alter net CO2 assimilation (A), stomatal conductance (gs), stomatal sensitivity to the environment, chlorophyll fluorescence and plant biomass, but led to reductions in the maximum carboxylation capacity of phosphoenolpyruvate and increased stomatal limitation to A in both years. These findings suggest that bioenergy sorghum is tolerant to O3 and could be used to enhance biomass productivity in O3 polluted regions.
keywords:
Feedstock Production;Sustainability;Field Data
published:
2025-09-22
Curtis, Amanda; Harpey, Lynsey; Davis, Mark; Larson, Eric
(2025)
Environmental DNA metabarcoding data for fish communities at 50 sites in the Tennessee River watershed of northern Alabama, United States collected in summer 2018 used in the calculation of an Index of Biotic Integrity for biological monitoring.
* New in this V2: In response to peer review at a journal and associated revised statistical analyses, we have added four variables to the file Curtis_etal_IBImetrics.csv and edited the Curtis_etal_readme.txt to explain these variables. The files Curtis_etal_FishDetections.csv and Curtis_etal_FishReadsbySite.csv remain unchanged.
- 4 new variables in Curtis_etal_IBImetrics.csv are: fishIBI_noDELT, MaxHab, Stressor, and Distance.
keywords:
Alabama; biological monitoring; environmental DNA; fish; Index of Biotic Integrity; water quality
published:
2024-01-01
Supplementary data tables for the dissertation "Hybridization dynamics and population genomics of a Manacus hybrid zone." This work focuses on the dynamics of hybridization over time in two species of tropical birds, the golden-collared manakin (Manacus vitellinus) and white-collared manakin (Manacus candei) comparing data from historical museum samples and contemporary wild-caught birds. Table A1 contains the sample metadata for the Manacus Restriction site-associated DNA sequencing dataset used in the dissertation with associated NCBI Biosample Accession numbers, Smithsonian Museum of Natural History number (where applicable), sample IDs, sampling site locations, and sample information of year the sample was taken, age, and sex. Table A6 contains phenotypic measurements of male plumage traits of manakins used in cline analyses to assess hybrid zone movement over time in historical and contemporary datasets, including beard length (mm), epaulet width (mm), tail length (mm), collar color (nm), and belly color (nm). Table A7 contains a summary of male plumage measurements across the hybrid zone. Table C1 contains a list of annotated protein coding genes in candidate regions of interest in Manacus genomes using outlier regions of genomic divergence, linkage disequilibrium, and enrichment of parental private alleles.
keywords:
csv; manacus; manakin; genomics; dissertation
published:
2025-10-27
Jindra, Michael A.; Choe, Kisurb; Chowdhury, Ratul; Kong, Ryan; Ghaffari, Soodabeh; Sweedler, Jonathan; Pfleger, Brian
(2025)
The dominant strategy for tailoring the chain-length distribution of free fatty acids (FFA) synthesized by heterologous hosts is expression of a selective acyl-acyl carrier protein (ACP) thioesterase. However, few of these enzymes can generate a precise (greater than 90% of a desired chain-length) product distribution when expressed in a microbial or plant host. The presence of alternative chain-lengths can complicate purification in situations where blends of fatty acids are not desired. We report the assessment of several strategies for improving the dodecanoyl-ACP thioesterase from the California bay laurel to exhibit more selective production of medium-chain free fatty acids to near exclusivity. We demonstrated that matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-ToF MS) was an effective library screening technique for identification of thioesterase variants with favorable shifts in chain-length specificity. This strategy proved to be a more effective screening technique than several rational approaches discussed herein. With this data, we isolated four thioesterase variants which exhibited a more selective FFA distribution over wildtype when expressed in the fatty acid accumulating E. coli strain, RL08. We then combined mutations from the MALDI isolates to generate BTE-MMD19, a thioesterase variant capable of producing free fatty acids consisting of 90% of C12 products. Of the four mutations which conferred a specificity shift, we noted that three affected the shape of the binding pocket, while one occurred on the positively charged acyl carrier protein landing pad. Finally, we fused the maltose binding protein (MBP) from E. coli to the N – terminus of BTE-MMD19 to improve enzyme solubility and achieve a titer of 1.9 g per L of twelve-carbon fatty acids in a shake flask.
keywords:
Conversion;Genomics
published:
2020-08-01
Xu, Ye; Dietrich, Christopher H.; Zhang, Yalin; Dmitriev, Dmitry; Zhang, Li; Wang, Yi-Mei; Lu, Si-Han; Qin, Dao-Zheng
(2020)
The Empoascini_morph_data.nex text file contains the original data used in the phylogenetic analyses of Xu et al. (Systematic Entomology, in review). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first nine lines of the file indicate the file type (Nexus), that 110 taxa were analyzed, that a total of 99 characters were analyzed, the format of the data, and specification for symbols used in the dataset to indicate different character states. For species that have more than one state for a particular character, the states are enclosed in square brackets. Question marks represent missing data.The pdf file, Appendix1.pdf, is available here and describes the morphological characters and character states that were scored in the dataset. The data analyses are described in the cited original paper.
keywords:
Hemiptera; Cicadellidae; morphology; biogeography; evolution
published:
2021-04-19
Xia, Yushu; Wander, Michelle
(2021)
Dataset compiled by Yushu Xia and Michelle Wander for the Soil Health Institute.
Data were recovered from peer reviewed literature reporting results for three soil quality indicators (SQIs) (β-glucosidase (BG), fluorescein diacetate (FDA) hydrolysis, and permanganate oxidizable carbon (POXC)) in terms of their relative response to management where soils under grassland cover, no-tillage, cover crops, residue return and organic amendments were compared to conventionally managed controls. Peer-reviewed articles published between January of 1990 and May 2018 were searched using the Thomas Reuters Web of Science database (Thomas Reuters, Philadelphia, Pennsylvania) and Google Scholar to identify studies reporting results for: “β-glucosidase”, “permanganate oxidizable carbon”, “active carbon”, “readily oxidizable carbon”, and “fluorescein diacetate hydrolysis”, together with one or more of the following: “management practice”, “tillage”, “cover crop”, “residue”, “organic fertilizer”, or “manure”. Records were tabulated to compare SQI abundance in soil maintained under a control and soil aggrading practice with the intent to contribute to SQI databases that will support development of interpretive frameworks and/or algorithms including pedo-transfer functions relating indicator abundance to management practices and site specific factors.
Meta-data include the following key descriptor variables and covariates useful for development of scoring functions: 1) identifying factors for the study site (location, year of initiation of study and year in which data was reported), 2) soil textural class, pH, and SOC, 3) depth and timing of soil sampling, 4) analytical methods for SQI quantification, 5) units used in published works (i.e. equivalent mass, concentration), 6) SQI abundances, and 7) statistical significance of difference comparisons.
*Note: Blank values in tables are considered unreported data.
keywords:
Soil health promoting practices; Soil quality indicators; β-glucosidase; fluorescein diacetate hydrolysis; Permanganate oxidizable carbon; Greenhouse gas emissions; Scoring curves; Soil Management Assessment Framework
published:
2025-05-10
Bakken, George; O'Keefe, Joy
(2025)
This dataset provides instructions for procedures to use heat transfer analyses to estimate thermal conditions in artificial roosts for bats. The dataset contains scripts to employ in the program GNU Octave, example meteorology data, and example text files specifying roost dimensions and material properties.
keywords:
Bat box; design; heat storage; heat transfer analysis; insulation; temperature
published:
2018-05-06
Sukenik, Shahar; Salam, Mohammed; Wang, Yuhan; Gruebele, Martin
(2018)
This deposit contains all raw data and analysis from the paper "In-cell titration of small solutes controls protein stability and aggregation". Data is collected into several types:
1) analysis*.tar.gz are the analysis scripts and the resulting data for each cell. The numbers correspond to the numbers shown in Fig.S1. (in publication)
2) scripts.tar.gz contains helper scripts to create the dataset in bash format.
3) input.tar.gz contains headers and other information that is fed into bash scripts to create the dataset.
4) All rawData*.tar.gz are tarballs of the data of cells in different solutes in .mat files readable by matlab, as follows:
- Each experiment included in the publication is represented by two matlab files: (1) a calibration jump under amber illumination (_calib.mat suffix) (2) a full jump under blue illumination (FRET data)
- Each file contains the following fields:
coordleft - coordinates of cropped and aligned acceptor channel on the original image
coordright - coordinates of cropped and aligned donor channel on the original image]
dataleft - a 3d 12-bit integer matrix containing acceptor channel flourescence for each pixel and time step. Not available in _calib files
dataright - a 3d 12-bit integer matrix containing donor channel flourescence for each pixel and time step. This will be mCherry in _calib files and AcGFP in data files.
frame1 - original image size
imgstd - cropped dimensions
numFrames - number of frames in dataleft and dataright
videos - a structure file containing camera data. Specifically, videos.TimeStamp includes the time from each frame.
keywords:
Live cell; FRET microscopy; osmotic challenge; intracellular titrations; protein dynamics
published:
2025-09-26
Dong, Hongxu; Clark, Lindsay; Jin, Xiaoli; Anzoua, Kossonou; Bagmet, Larisa; Chebukin, Pavel; Dzyubenko, Elena; Dzyubenko, Nicolay; Ghimire, Bimal Kumar; Heo, Kweon; Johnson, Douglas A.; Nagano, Hironori; Sabitov, Andrey; Peng, Junhua; Yamada, Toshihiko; Yoo, Ji Hye; Yu, Chang Yeon; Zhao, Hua; Long, Stephen P.; Sacks, Erik
(2025)
Miscanthus is a close relative of saccharum and a potentially valuable genetic resource for improving sugarcane. Differences in flowering time within and between miscanthus and saccharum hinders intra- and interspecific hybridizations. A series of greenhouse experiments were conducted over three years to determine how to synchronize flowering time of saccharum and miscanthus genotypes. We found that day length was an important factor influencing when miscanthus and saccharum flowered. Sugarcane could be induced to flower in a central Illinois greenhouse using supplemental lighting to reduce the rate at which days shortened during the autumn and winter to 1 min d-1, which allowed us to synchronize the flowering of some sugarcane genotypes with Miscanthus genotypes primarily from low latitudes. In a complementary growth chamber experiment, we evaluated 33 miscanthus genotypes, including 28 M. sinensis, 2 M. floridulus, and 3 M. ×giganteus collected from 20.9° S to 44.9° N for response to three day lengths (10 h, 12.5 h, and 15 h). High latitude-adapted M. sinensis flowered mainly under 15 h days, but unexpectedly, short days resulted in short, stocky plants that did not flower; in some cases, flag leaves developed under short days but heading did not occur. In contrast, for M. sinensis and M. floridulus from low latitudes, shorter day lengths typically resulted in earlier flowering, and for some low latitude genotypes, 15 h days resulted in no flowering. However, the highest ratio of reproductive shoots to total number of culms was typically observed for 12.5 h or 15 h days. Latitude of origin was significantly associated with culm length, and the shorter the days, the stronger the relationship. Nearly all entries achieved maximal culm length under the 15 h treatment, but the nearer to the equator an accession originated, the less of a difference in culm length between the short-day treatments and the 15 h day treatment. Under short days, short culms for high-latitude accessions was achieved by different physiological mechanisms for M. sinensis genetic groups from the mainland in comparison to those from Japan; for mainland accessions, the mechanism was reduced internode length, whereas for Japanese accessions the phyllochron under short days was greater than under long days. Thus, for M. sinensis, short days typically hastened floral induction, consistent with the expectations for a facultative short-day plant. However, for high latitude accessions of M. sinensis, days less than 12.5 h also signaled that plants should prepare for winter by producing many short culms with limited elongation and development; moreover, this response was also epistatic to flowering. Thus, to flower M. sinensis that originates from high latitudes synchronously with sugarcane, the former needs day lengths >12.5 h (perhaps as high as 15 h), whereas that the latter needs day lengths <12.5 h.
keywords:
Feedstock Production;Phenomics
published:
2022-12-21
Sherwood, Joshua; Tiemann, Jeremy; Stein, Jeffrey
(2022)
This dataset is associated with a larger manuscript published in 2022 in the Illinois Natural History Survey Bulletin that summarized the Fishes of Champaign County project from 2012-2015. With data spanning over 120 years, the Fishes of Champaign County is a comprehensive, long-term investigation into the changing fish communities of east-central Illinois. Surveys first occurred in Champaign County in the late 1880s (40 sites), with subsequent surveys in 1928–1929 (125 sites), 1959–1960 (143 sites), and 1987–1988 (141 sites). Between 2012 and 2015, we resampled 122 sites across Champaign County. The combined data from these five surveys have produced a unique perspective into not only the fish communities of the region, but also insight into in-stream habitat changes during the past 120 years.
The dataset is in Microsoft Access format, with five data tables, one for each time period surveyed. Field names are self-explanatory, with some variation in data types collected during different surveys as follows: Forbes & Richardson (1880s) collected presence/absence only. Thompson & Hunt (1928-1929) collected abundance only, Larimore & Smith (1959-1960) collected length and weight for some samples, but only presence/absence at others. In some cases, fish of the same species were weighed in bulk, with the fields “LOW” and “HIGH” indicating the lower and upper limits of total length in the batch, and weight indicating the gross weight of all fish in the batch. Larimore and Bayley (1987-1988) collected length and weight for all surveys, and Sherwood and Stein (2012-2015) collected length and weight for all surveys except for cases where extremely abundant single species where subsampled. Lengths are reported in millimeters, and weight in grams. Two lookup tables provide information about species codes used in the data tables and sample site location and notes.
keywords:
fishes of Champaign County; streams; anthropogenic disturbances; long-term dataset
published:
2024-09-19
Klimasmith, Isaac; Kent, Angela
(2024)
The use of potentially beneficial microorganisms in agriculture (microbial inoculants) has rapidly accelerated in recent years. For microbial inoculants to be effective as agricultural tools, these organisms must be able to survive and persist in novel environments while not destabilizing the resident community or spilling over into adjacent natural ecosystems. Here, we adapt a macroecological propagule pressure model to a microbial scale and present an experimental approach for testing the role of propagule pressure in microbial inoculant introductions. We experimentally determined the risk-release relationship for an IAA-expressing Pseudomonas simiae inoculant in a model monocot system. We then used this relationship to simulate establishment outcomes under a range of application frequencies (propagule number) and inoculant concentrations (propagule size). Our simulations show that repeated inoculant applications may increase establishment, even when increased inoculant concentration does not alter establishment probabilities.
The dataset filed here includes the experimemtal datafile, and a RMarkdown file that includes all the code used in in both the modeling and anaylsis.
keywords:
microbial inoculants; invasion ecology; propagule pressure; agriculture; modeling
published:
2024-06-17
Stuchiner, Emily; Jernigan, Wyatt; Zhang, Ziliang; Eddy, William; DeLucia, Evan; Yang, Wendy
(2024)
Data includes carbon mineralization rates, potential denitrification rates, net nitrous oxide fluxes, and soil chemical properties from a laboratory incubation of soil samples collected from 20 locations across an Illinois maize field.
keywords:
denitrification; nitrous oxide; dissolved organic carbon; maize
published:
2025-10-30
Dwivedi, Nidhi; Yamamoto, Senri; Zhao, Yunjun; Hou, Guichuan; Bowling, Forrest; Tobimatsu, Yuki; Liu, Chang-Jun
(2025)
Grass lignocelluloses feature complex compositions and structures. In addition to the presence of conventional lignin units from monolignols, acylated monolignols and flavonoid tricin also incorporate into lignin polymer; moreover, hydroxycinnamates, particularly ferulate, cross-link arabinoxylan chains with each other and/or with lignin polymers. These structural complexities make grass lignocellulosics difficult to optimize for effective agro-industrial applications. In the present study, we assess the applications of two engineered monolignol 4-O-methyltransferases (MOMTs) in modifying rice lignocellulosic properties. Two MOMTs confer regiospecific para-methylation of monolignols but with different catalytic preferences. The expression of MOMTs in rice resulted in differential but drastic suppression of lignin deposition, showing more than 50% decrease in guaiacyl lignin and up to an 90% reduction in syringyl lignin in transgenic lines. Moreover, the levels of arabinoxylan-bound ferulate were reduced by up to 50%, and the levels of tricin in lignin fraction were also substantially reduced. Concomitantly, up to 11 μmol/g of the methanol-extractable 4-O-methylated ferulic acid and 5–7 μmol/g 4-O-methylated sinapic acid were accumulated in MOMT transgenic lines. Both MOMTs in vitro displayed discernible substrate promiscuity towards a range of phenolics in addition to the dominant substrate monolignols, which partially explains their broad effects on grass phenolic biosynthesis. The cell wall structural and compositional changes resulted in up to 30% increase in saccharification yield of the de-starched rice straw biomass after diluted acid-pretreatment. These results demonstrate an effective strategy to tailor complex grass cell walls to generate improved cellulosic feedstocks for the fermentable sugar-based production of biofuel and bio-chemicals.
keywords:
Feedstock Production;Biomass Analytics;Genome Engineering
published:
2025-10-10
Singh, Ramkrishna; Liu, Hui; Shanklin, John; Singh, Vijay
(2025)
Lipids accumulated in the vegetative tissues of cellulosic feedstocks can be a potential raw material for biodiesel and bioethanol production. In this work, bagasse of genetically engineered sorghum was subjected to liquid hot-water pretreatment at 170, 180, and 190 °C for different reaction time. Under the optimal pretreatment condition (170 °C, 20 min), the residue was enriched in glucan (57.39 ± 2.63 % w/w) and xylan (13.38 ± 0.49 % w/w). The total lipid content of the pretreated residue was 6.81% w/w, similar to that observed in untreated bagasse (6.30% w/w). Pretreatment improved the enzymatic digestibility of bagasse, allowing a recovery of 79% w/w and 86% w/w of glucose and xylose, respectively. The pretreatment and enzymatic saccharification resulted in a 2-fold increase in total lipid in enzymatic residue compared to the original bagasse. Thus, pretreatment and enzymatic hydrolysis enabled high sugar recovery while concentrating triglycerides and free fatty acids in the residue.
keywords:
Conversion;Feedstock Production;Feedstock Bioprocessing
published:
2018-11-21
Clark, Lindsay V.; Lipka, Alexander E.; Sacks, Erik J.
(2018)
This set of scripts accompanies the manuscript describing the R package polyRAD, which uses DNA sequence read depth to estimate allele dosage in diploids and polyploids. Using several high-confidence SNP datasets from various species, allelic read depth from a typical RAD-seq dataset was simulated, then genotypes were estimated with polyRAD and other software and compared to the true genotypes, yielding error estimates.
keywords:
R programming language; genotyping-by-sequencing (GBS); restriction site-associated DNA sequencing (RAD-seq); polyploidy; single nucleotide polymorphism (SNP); Bayesian genotype calling; simulation
published:
2020-06-02
Xue, Qingquan; Dietrich, Christopher; Zhang, Yalin
(2020)
The text file contains the original data used in the phylogenetic analyses of Xue et al. (2020: Systematic Entomology, in press). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 89 taxa (species) and 2676 characters, indicate that the first 2590 characters are DNA sequence and the last 86 are morphological, that gaps inserted into the DNA sequence alignment and inapplicable morphological characters are indicated by a dash, and that missing data are indicated by a question mark. The file contains aligned nucleotide sequence data for 5 gene regions and 86 morphological characters. The positions of data partitions are indicated in the mrbayes block of commands for the phylogenetic program MrBayes at the end of the file (Subset1 = 16S gene; Subset2 = 28S gene; Subset3 = COI gene; Subset 4 = Histone H3 and H2A genes). The mrbayes block also contains instructions for MrBayes on various non-default settings for that program. These are explained in the original publication. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the supplementary document included as a separate pdf, also available from the journal website. The original raw DNA sequence data are available from NCBI GenBank under the accession numbers indicated in the supplementary file.
keywords:
phylogeny; DNA sequence; morphology; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; 16S rDNA; histone H3; histone H2A; cytochrome oxidase I; Bayesian analysis
published:
2025-06-30
Mori, Jameson; Skowron, Nicholas; Barr, Daniel; Johnson, Ben; Novakofski, Jan; Mateus-Pinilla, Nohra
(2025)
This dataset contains measurements of water loss as white-tailed deer (Odocoileus virginianus) retroypharyngeal lymph nodes air-dried in a refrigerator for 31 days. Daily weights for lymph nodes are recorded every 24 hours, as are the variables "firmness" and "surface wetness". "Firmness" is a categorical variable measuring how much the tissue deforms to the touch (soft, medium, or hard). "Surface wetness" is the amount of visible moisture on the outside of the lymph node (all, some, or none). Lymph node weights were measured until their weights stabilized for 3 consecutive days at two decimal places (ex. 3.02, 3.02, 3.02) or until the weights fluctuated only by 0.01 (ex. 3.02, 3.03, 3.02). Lymph nodes were from northern Illinois white-tailed deer collected as part of the Illinois Department of Natural Resources' ongoing chronic wasting disease (CWD) management efforts.
keywords:
cervid; lymph node; chronic wasting disease; cwd; diagnostic testing; dessication; drying; tissue
published:
2025-10-24
Choe, Kisurb; Jindra, Michael A.; Hubbard, Susan; Pfleger, Brian; Sweedler, Jonathan
(2025)
Creating controlled lipid unsaturation locations in oleochemicals can be a key to many bioengineered products. However, evaluating the effects of modifications to the acyl-ACP desaturase on lipid unsaturation is not currently amenable to high-throughput assays, limiting the scale of redesign efforts to <200 variants. Here, we report a rapid mass spectrometry (MS) assay for profiling the positions of double bonds on membrane lipids produced by Escherichia coli colonies after treatment with ozone gas. By MS measurement of the ozonolysis products of Δ6 and Δ8 isomers of membrane lipids from colonies expressing recombinant Thunbergia alata desaturase, we screened a randomly mutagenized library of the desaturase gene at 5 s per sample. Two variants with altered regiospecificity were isolated, indicated by an increase in 16:1 Δ8 proportion. We also demonstrated the ability of these desaturase variants to influence the membrane composition and fatty acid distribution of E. coli strains deficient in the native acyl-ACP desaturase gene, fabA. Finally, we used the fabA deficient chassis to concomitantly express a non-native acyl-ACP desaturase and a medium-chain thioesterase from Umbellularia californica, demonstrating production of only saturated free fatty acids.
keywords:
Conversion;Lipidomics;Mass Spectrometry
published:
2025-11-06
Deshavath, Narendra Naik; Woodruff, William; Eller, Fred; Susanto, Vionna; Yang, Cindy; Rao, Christopher V.; Singh, Vijay
(2025)
Microbial oils are a sustainable biomass-derived substitute for liquid fuels and vegetable oils. Oilcane, an engineered sugarcane with superior feedstock characteristics for biodiesel production, is a promising candidate for bioconversion. This study describes the processing of oilcane stems into juice and hydrothermally pretreated lignocellulosic hydrolysate and their valorization to ethanol and microbial oil using Saccharomyces cerevisiae and engineered Rhodosporidium toruloides strains, respectively. A bioethanol titer of 106 g/L was obtained from S. cerevisiae grown on oilcane juice in a 3 L fermenter, and a lipid titer of 8.8 g/L was obtained from R. toruloides grown on oilcane hydrolysate in a 75 L fermenter. Oil was extracted from the R. toruloides cells using supercritical CO2, and the observed fatty acid profile was consistent with previous studies on this strain. These results demonstrate the feasibility of pilot-scale lipid production from oilcane hydrolysate as part of an integrated bioconversion strategy.
keywords:
Conversion;Bioproducts;Feedstock Bioprocessing;Hydrolysate
published:
2026-01-15
Singh, Ramkrishna; Bhagwat, Sarang; Viswanathan, Mothi Bharath; Cortes-Pena, Yoel; Eilts, Kristen; Mingfeng, Cao; Guest, Jeremy; Zhao, Huimin; Singh, Vijay
(2026)
Triacetic acid lactone (TAL) can be microbially produced and further chemically upgraded to several high-value chemicals. In this work, several acidic and basic ion-exchange resins and activated charcoal were evaluated for their ability to adsorb microbially produced TAL. Activated charcoal and a weak base resin, Dowex 66, showed similar TAL adsorption capacity of 0.18 ± 0.002 g/g. At 15% w/v activated charcoal, about 98% of TAL present in fermentation broth could be adsorbed. Further, ethanol washing allowed recovery of 72% of adsorbed TAL. A biorefinery producing TAL from sucrose was designed, simulated, and evaluated (through technoeconomic analysis) under uncertainty, for an estimated TAL minimum product selling price (MPSP) of $4.27/kg [$3.71−4.94/kg; 5th-95th percentiles] for the current state of technology and $2.83/kg [$2.46–3.29/kg] following potential near-term improvements to fermentation. Thus, this work provides an adsorptive process to recover microbially produced TAL that can be chemically upgraded to several industrial products.
keywords:
Bioproducts; Feedstock Bioprocessing
published:
2024-10-07
Kole Aspray, Elise; Ainsworth, Elizabeth; McGrath, Jesse; McGrath, Justin; Montes, Christopher; Whetten, Andrew; Ort, Donald; Long, Stephen; Puthuval, Kannan; Mies, Timothy; Bernacchi, Carl; DeLucia, Evan; Dalsing, Bradley; Leakey, Andrew; Li, Shuai; Herriott, Jelena; Miglietta, Franco
(2024)
This data set is related to the SoyFACE experiments, which are open-air agricultural climate change experiments that have been conducted since 2001. The fumigation experiments take place at the SoyFACE farm and facility in Champaign County, Illinois during the growing season of each year, typically between June and October.
This V4 contains new experimental data files, hourly fumigation files, and weather/ambient files for 2022 and 2023, since the original dataset only included files for 2001-2021. The MATLAB code has also been updated for efficiency, and explanatory files have been updated accordingly. Below are new changes in V4:
- The "SoyFACE Plot Information 2001 to 2021" file is renamed to “SoyFACE ring information 2001 to 2023.xlsx”. Data for 2022 and 2023 were added. File contains information about each year of the SoyFACE experiments, including the fumigation treatment type (CO2, O3, or a combination treatment), the crop species, the plots (also referred to as 'rings' and labeled with numbers between 2 and 31) used in each experiment, important experiment dates, and the target concentration levels or 'setpoints' for CO2 and O3 in each experiment.
- The "SoyFACE 1-Minute Fumigation Data Files" were updated to contain sub-folders for each year of the experiments (2001-2023), each of which contains sub-folders for each ring used in that year's experiments. This data set also includes hourly data files for the fumigation experiments ("SoyFACE Hourly Fumigation Data Files" folder) created from the 1-minute files, and hourly ambient/weather data files for each year of the experiments ("Hourly Weather and Ambient Data Files" folder which has also been updated to include 2022 and 2023 data). The ambient CO2 and O3 data are collected at SoyFACE, and the weather data are collected from the SURFRAD and WARM weather stations located near the SoyFACE farm.
- “Rings.xlsx” is new in this version. This file lists the rings and treatments used in each year of the SoyFACE experiments between 2001 and 2023 and is used in several of the MATLAB codes.
- “CMI Weather Data Explanation.docx” is newly added. This file contains specific information about the processing of raw weather data, which is used in the hourly weather and ambient data files.
- Files that were in RAR format in V3 are now updated and saved as ZIP format, including: Hourly Weather and Ambient Data Files.zip , SoyFACE 1-Minute Fumigation Data Files.zip , SoyFACE Hourly Fumigation Data Files.zip, and Matlab Files.zip.
- The "Fumigation Target Percentages" file was updated to add data for 2022 and 2023. This file shows how much of the time the CO2 and O3 fumigation levels are within a 10 or 20 percent margin of the target levels when the fumigation system is turned on.
- The "Matlab Files" folder contains custom code (Aspray, E.K.) that was used to clean the "SoyFACE 1-Minute Fumigation Data" files and to generate the "SoyFACE Hourly Fumigation Data" and "Fumigation Target Percentages" files. Code information can be found in the various "Explanation" files. The Matlab code changes are as follows:
1. “Data_Issues_Finder.m” code was changed to use the “Ring.xlsx” file to gather ring and treatment information based on the contents of the file rather than being hardcoded in the Matlab code itself.
2. “Data_Issues_Finder_all.m” code is new. This code is the same as the “Data_Issues_Finder.m” code except that it identifies all CO2 and O3 repeats. In contrast, the “Data_Issues_Finder.m” code only identifies CO2 and O3 repeats that occur when the fumigation system is turned on.
3. “Target_Yearly.m” code was changed to use the “Ring.xlsx” file to gather ring and treatment information based on the contents of the file rather than being hardcoded in the Matlab code itself.
4. “HourlyFumCode.m” code is new. This code uses the “Rings.xlsx” file to gather ring and treatment information based on the contents of the file instead of the user needing to define these values explicitly. This code also defines a list of all ring folders for the year selected and runs the hourly code for each ring, instead of the user having to run the hourly code for each ring individually. Finally, the code generates two dialog boxes for the user, one which allows user to specify whether they want the hourly code to be run for 1-minute fumigation files or 1-minute ambient files, and another which allows user to specify whether they would like the hourly fumigation averages to be replaced with hourly ambient averages when the fumigation system is turned off.
5. “HourlyDataFun.m” code was changed to run either “HourlyData.m” code or “HourlyDataAmb.m” code, depending on user input in the first dialog box.
6. “HourlyData.m” code was changed to replace hourly fumigation averages with hourly ambient averages when the fumigation system is turned off, depending on user input in the second dialog box.
7. “HourlyDataAmb.m” code is new. This code is similar to “HourlyData.m” code but is used to calculate hourly averages for 1-minute ambient files instead 1-minute fumigation files.
8. “batch.m” code was changed to account for new function input variables in “HourlyDataFun.m” code, along with adding header columns for “FumOutput.xlsx” and “AmbOutput.xlsx” output files generated by “HourlyData.m” and “HourlyDataAmb.m” code.
- Finally, the " * Explanation" files contain information about the column names, units of measurement, steps needed to use Matlab code, and other pertinent information for each data file. Some of them have been updated to reflect the current change of data.
keywords:
SoyFACE; agriculture; agricultural; climate; climate change; atmosphere; atmospheric change; CO2; carbon dioxide; O3; ozone; soybean; fumigation; treatment
published:
2019-03-06
Makhnenko, Roman; Tarokh, Ali
(2019)
This dataset is provided to support the statements in Tarokh, A., and R.Y. Makhnenko. 2019. Remarks on the solid and bulk responses of fluid-filled porous rock, Geophysics.
The unjacketed bulk modulus is a poroelastic parameter that can be directly measured in a laboratory test under a loading that preserves the difference between the mean stress and pore pressure constant. For a monomineralic rock, the measurement of the unjacketed bulk modulus is ignored because it is assumed to be equal to the bulk modulus of the solid phase. To examine this assumption, we tested porous sandstones (Berea and Dunnville) and limestones (Apulian and Indiana) mainly composed of quartz and calcite, respectively, under the unjacketed condition. The presence of microscale inhomogeneities, in the form of non-connected (occluded) pores, was shown to cause a considerable difference between the unjacketed bulk modulus and the bulk modulus of the solid phase. Furthermore, we found the unjacketed bulk modulus to be independent of the unjacketed pressure and Terzaghi effective pressure and therefore a constant.
keywords:
Poroelasticity; anisotropic solid skeleton; unjacketed bulk modulus; non-connected porosity