Illinois Data Bank Dataset Search Results
Results
published:
2025-11-24
Nguyen, Viviana; Xue, Pu; Li, Yifei; Zhao, Huimin; Lu, Ting
(2025)
Microbial growth emerges from coordinated synthesis of various cellular components from limited resources. In Saccharomyces cerevisiae, cyclic AMP (cAMP)-mediated signaling is shown to orchestrate cellular metabolism; however, it remains unclear quantitatively how the controlling circuit drives resource partition and subsequently shapes biomass growth. Here we combined experiment with mathematical modeling to dissect the signaling-mediated growth optimization of S. cerevisiae. We showed that, through cAMP-mediated control, the organism achieves maximal or nearly maximal steady-state growth during the utilization of multiple tested substrates as well as under perturbations impairing glucose uptake. However, the optimal cAMP concentration varies across cases, suggesting that different modes of resource allocation are adopted for varied conditions. Under settings with nutrient alterations, S. cerevisiae tunes its cAMP level to dynamically reprogram itself to realize rapid adaptation. Moreover, to achieve growth maximization, cells employ additional regulatory systems such as the GCN2-mediated amino acid control. This study establishes a systematic understanding of global resource allocation in S. cerevisiae, providing insights into quantitative yeast physiology as well as metabolic strain engineering for biotechnological applications.
keywords:
Conversion;Metabolomics;Modeling
published:
2018-07-29
Molloy, Erin K.; Warnow, Tandy
(2018)
This repository includes scripts, datasets, and supplementary materials for the study, "NJMerge: A generic technique for scaling phylogeny estimation methods and its application to species trees", presented at RECOMB-CG 2018. The supplementary figures and tables referenced in the main paper can be found in njmerge-supplementary-materials.pdf. The latest version of NJMerge can be downloaded from Github: https://github.com/ekmolloy/njmerge.
***When downloading datasets, please note that the following errors.***
In README.txt, lines 37 and 38 should read:
+ fasttree-exon.tre contains lines 1-25, 1-100, or 1-1000 of fasttree-total.tre
+ fasttree-intron.tre contains lines 26-50, 101-200, or 1001-2000 of fasttree-total.tre
Note that the file names (fasttree-exon.tre and fasttree-intron.tre) are swapped.
In tools.zip, the compare_trees.py and the compare_tree_lists.py scripts incorrectly refer to the "symmetric difference error rate" as the "Robinson-Foulds error rate". Because the normalized symmetric difference and the normalized Robinson-Foulds distance are equal for binary trees, this does not impact the species tree error rates reported in the study. This could impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the normalized symmetric difference is always greater than or equal to the normalized Robinson-Foulds distance, so the gene tree error rates reported in the study are more conservative.
In njmerge-supplementary-materials.pdf, the alpha parameter shown in Supplementary Table S2 is actually the divisor D, which is used to compute alpha for each gene as follows.
1. For each gene, a random value X between 0 and 1 is drawn from a uniform distribution.
2. Alpha is computed as -log(X) / D, where D is 4.2 for exons, 1.0 for UCEs, and 0.4 for introns (as stated in Table S2).
Note that because the mean of the uniform distribution (between 0 and 1) is 0.5, the mean alpha value is -log(0.5) / 4.2 = 0.16 for exons, -log(0.5) / 1.0 = 0.69 for UCEs, and -log(0.5) / 0.4 = 1.73 for introns.
keywords:
phylogenomics; species trees; incomplete lineage sorting; divide-and-conquer
published:
2025-11-19
Banerjee, Shivali; Beraja, Galit; Eilts, Kristen; Singh, Vijay
(2025)
:Bioenergy crops have been known for their ability to produce biofuels and bioproducts. In this study, the product portfolio of recently developed transgenic sugarcane (oilcane) bagasse has been redefined for recovering natural pigments (anthocyanins), sugars, and vegetative lipids. The total anthocyanin content in oilcane bagasse has been estimated as 92.9 ± 18.9 µg/g of dried bagasse with cyanidin-3-glucoside (13.5 ± 18.9 µg per g of dried bagasse) as the most prominent anthocyanin present. More than 85 % (w/w) of the total anthocyanins were recovered from oilcane bagasse at a pretreatment temperature of 150 °C for 15 min. These conditions for the hydrothermal pretreatment also led to a 2-fold increase in the glucose yield upon the enzymatic saccharification of the pretreated bagasse. Further, a 1.5-fold enrichment of the vegetative lipids was demonstrated in the pretreated residue. Re-defining green biorefineries with multiple high-value products in a zero-waste approach is the need of the hour for attaining sustainability.
keywords:
Conversion;Biomass Analytics;Bioproducts;Biorefinery;Oilcane
published:
2025-09-30
Kurambhatti, Chinmay V.; Kumar, Deepak; Singh, Vijay
(2025)
The coproduction of high-value anthocyanin extract in the cellulosic ethanol process would diversify the co-product market, increase revenue, and potentially improve the economics of the process. The high anthocyanin concentration in the cob and structural carbohydrates in residual stover make purple corn stover an attractive source for anthocyanin and ethanol coproduction. This study aimed to develop simulation models for processes integrating ethanol production and anthocyanin extraction using purple corn stover, to evaluate their techno-economic feasibility, and to compare their performance with the conventional ethanol production process using corn stover. The annual ethanol production for plants processing 2000 MT dry feedstock / day was 148.6 million L/year for the integrated processes compared with 222.6 million L/year for the conventional process. Anthocyanin production in the modified processes using dilute acid-based and water-based anthocyanin extraction processes was 1779 and 1099 MT/year, respectively. Capital investments for the integrated processes ($448.1 to $443.8 million) were higher than the conventional process ($371.9 million). Due to high revenue from anthocyanin extract, the ethanol production cost for the integrated process using acid-based anthocyanin extraction ($0.36/L) was 34.5% lower than conventional ethanol production ($0.55/L). The ethanol production cost for the integrated process using water-based anthocyanin extraction ($0.68/L) was higher than conventional ethanol production due to low ethanol and anthocyanin yields. The minimum ethanol selling price for the integrated process using acid-based anthocyanin extraction ($0.65/L) was also lower than the conventional process ($0.72/L), indicating an improvement in economic performance.
keywords:
Conversion;Economics;Feedstock Bioprocessing;Modeling
published:
2022-02-11
Lu, Yiyang; Bohn-Wippert, Kathrin; Pazerunas, Patrick J.; Moy, Jennifer M.; Singh, Harpal; Dar, Roy D.
(2022)
Upon treatment removal, spontaneous and random reactivation of latently infected T cells remains a major barrier toward curing HIV. Due to its stochastic nature, fluctuations in gene expression (or “noise”) can bias HIV reactivation from latency, and conventional drug screens for mean gene expression neglect compounds that modulate noise. Here we present a time-lapse fluorescence microscopy image set obtained from a Jurkat T-cell line, infected with a minimal HIV gene circuit, treated with 1,806 small molecule compounds, and imaged for 48 hours. In addition, the single-cell time-dependent reporter dynamics (single-cell gene expression intensity and noise trajectories) extracted from the image dataset are included. Based on this dataset, a total of 5 latency promoting agents of HIV was found through further experimentation in Lu et al., PNAS 2021 (doi: 10.1073/pnas.2012191118).
For a detailed description of the dataset, please refer to the readme file.
keywords:
HIV; latency; drug screen; fluorescence microscopy; time-lapse; microscopy; single-cell data; noise; gene expression fluctuation;
published:
2025-11-12
BAYSAL, CAN; Kausch, Albert P.; Cody, Jon P.; Altpeter, Fredy; Voytas, Daniel
(2025)
The requirement of in vitro tissue culture for the delivery of gene editing reagents limits the application of gene editing to commercially relevant varieties of many crop species. To overcome this bottleneck, plant RNA viruses have been deployed as versatile tools for in planta delivery of recombinant RNA. Viral delivery of single-guide RNAs (sgRNAs) to transgenic plants that stably express CRISPR-associated (Cas) endonuclease has been successfully used for targeted mutagenesis in several dicotyledonous and few monocotyledonous plants. Progress with this approach in monocotyledonous plants is limited so far by the availability of effective viral vectors. We engineered a set of foxtail mosaic virus (FoMV) and barley stripe mosaic virus (BSMV) vectors to deliver the fluorescent protein AmCyan to track viral infection and movement in Sorghum bicolor. We further used these viruses to deliver and express sgRNAs to Cas9 and Green Fluorescent Protein (GFP) expressing transgenic sorghum lines, targeting Phytoene desaturase (PDS), Magnesium-chelatase subunit I (MgCh), 4-hydroxy-3-methylbut-2-enyl diphosphate reductase, orthologs of maize Lemon white1 (Lw1) or GFP. The recombinant BSMV did neither infect sorghum nor deliver or express AmCyan and sgRNAs. In contrast, the recombinant FoMV systemically spread throughout sorghum plants and induced somatic mutations with frequencies reaching up to 60%. This mutagenesis led to visible phenotypic changes, demonstrating the potential of FoMV for in planta gene editing and functional genomics studies in sorghum.
keywords:
Feedstock Production;Genome Engineering;Genomics
published:
2019-12-10
Yang, Pan; Zhao, Qiankun; Cai, Ximing
(2019)
The dataset consists of two types of data: the estimate of land productivity (the maximum productivity, MP) and the estimate of land that has low productivity for any major crops planted in the Contiguous United States and then may be available for growing bioenergy crops (the marginal land, ML). All data items are in GeoTiff format, under the World Geodetic System (WGS) 84 project, and with a resolution of 0.0020810045 degree (~250 m).
The MP values are calculated based on machine learning model estimated yields of major crops in the CONUS, and its expected value (MP_mean.tif), and associated uncertainty (MP_IDP.tif). The ML availability data have two versions: a deterministic version and a version with uncertainty. The deterministic MLs are determined as the land pixels with expected MP values falling in the range defined in the following criteria, and the MLs with uncertainty are determined as the probability that the MP value of a land pixel falls in the range defined in the following criteria:
Criteria_____Description
S1________ Current crop and pasture land with MP <= P50
S2________ Current crop and pasture land with MP <= P25
S3________ S1 + current grass and shrub land with P25 < MP < P50
S4________ S2 + current grass and shrub land with P10 < MP < P25
Economic__ Current crop and pasture land with potential profitability < 0
Here P10, P25 and P50 are the 10th, 25th and 50th percentile of crop MP values
keywords:
Land productivity;marginal land;land use
published:
2021-10-24
Tillman, Francis E.; Bakken, George S.; O'Keefe, Joy M.
(2021)
This dataset contains daily and hourly temperature measurements in twenty different bat box designs deployed in central Indiana, USA from May to September 2018. Daily and hourly environmental data (temperature, solar radiation, wind speed and direction) are also included for days and hours sampled. Bat box temperature data were reclassified to cool (</= 30°C), permissive (30.1–39.9°C), and stressful (>/= 40°C) categories according to known temperature tolerances of temperate-zone bats.
keywords:
bat box; design; environmental variables; microclimate; temperature
published:
2024-04-11
Margenot, Andrew; Zhou, Shengnan; Xu, Suwei; Condron, Leo; Metson, Geneviève; Haygarth, Philip; Wade, Jordon; Agyeman, Price Chapman
(2024)
A defining feature of the Anthropocene is the distortion of the biosphere phosphorus (P) cycle. A relatively sudden acceleration of input fluxes without a concomitant increase in output fluxes has led to net accumulation of P in the terrestrial-aquatic continuum. Over the past century, P has been mined from geological deposits to produce crop fertilizers. When P inputs are not fully removed with harvest of crop biomass, the remaining P accumulates in soils. This residual P is a uniquely anthropogenic pool of P, and its management is critical for agronomic and environmental sustainability. This dataset includes data for us to quantify residual P from different long-term managed systems.
The following is the desccription of the dataset. There are 7 sheets in total.
1. P_balance: From Morrow Plots maize-maize rotaiton (1888-2021), L: Low estimation; M: medium estimation; H: high estimation;
2. M3P: From Morrow Plots selected plots (selected years), M3P_sur: Mehlich III P concentration in surface 17cm soils; M3P_sub: Mehlich III P concentration in 17-34cm subsoils; P_balance: the difference between P inputs and P outputs; TP_sur: total P stocks in surface 17cm soils; TP_sub: total P stocks in 17-34cm subsoils;
3. Morrow_Plot_P_pool_all: Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; Fertilized: P stocks in the fertilized plot; Unfertilized: P stocks in the unfertilized plot; F-U: difference between P stocks in ther fertilized and unfertilized plots; dif%: percent difference in total P;
4. Rothamsted_P_pool_all: Treatment: Unfertilized: no fertilization; FYM: farmyard manure; PK: synthetic P and K fertilizer; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P;
5. L'Acadie_P_pool_all: Treatment: MP_LowP: moldboard plow with low rate of P fertilizer; MP_HighP: moldboard plow with high rate of P fertilizer; NT_LowP: no till with low rate of P fertilizer; NT_HighP: no till with high rate of P fertilizer; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P;
6. Rothamsted_P_pool_duration: Treatment: Unfertilized: no fertilization; FYM: farmyard manure; PK: synthetic P and K fertilizer; Duration: from a year to another year; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P;
7. L'Acadie_P_pool_duration: Treatment: MP_LowP: moldboard plow with low rate of P fertilizer; MP_HighP: moldboard plow with high rate of P fertilizer; NT_LowP: no till with low rate of P fertilizer; NT_HighP: no till with high rate of P fertilizer; Duration: from a year to another year; Group: a - labile P; b - Fe/Al-P; c - Ca-P; d - total organic P; e - non-extractable P; P_change: differnce in P stocks over time; dif%: percent difference in total P;
keywords:
phosphate rock; biosphere; balances; soil test P; long-term experiment
published:
2025-12-02
Cheng, Ming-Hsun; Maitra, Shraddha; Carr Clennon, Aidan N.; Appell, Michael; Dien, Bruce; Singh, Vijay
(2025)
The recalcitrance of lignocellulosic biomass necessitates an efficient pretreatment protocol for operating a successful cellulosic biorefinery. It is critical to improve cellulose accessibility for hydrolysis and fermentation by altering the plant cell wall’s physical structure and chemical composition. Sequential hydrothermal-mechanical refining pretreatment (HMR) allows efficient recovery of cellulosic sugars without utilizing any hazardous chemicals. HMR has been successfully applied to Liberty switchgrass, a bioenergy cultivar released by the USDA, and now it is being applied to oilcane, a recently developed transgenic sugarcane variety engineered to accumulate lipids in its vegetative tissues. Sugar yields of oilcane bagasse (OCB) and switchgrass (SG) treated with HMR are 96.4% and 75.4%, respectively. This study sought to correlate cellulosic sugar yields with structural changes within the cell wall caused by HMR on two distinct bioenergy crops. Simon’s staining technique for the specific surface area analysis showed that HMR increased the specific surface area of pretreated biomass residues by 80-112%. In addition, ATR-FTIR was performed to determine the effects of HMR on physical structures based on the total crystallinity index (TCI) and hydrogen bonding intensity (HBI). Irrespective of biomass type, HMR decreased the initial crystalline cellulose contents of untreated biomass residues by 3.5% and reduced TCI and HBI by 7-13%. The study found that sugar yields were negatively correlated to reducing values of hydrogen bonding intensity, crystalline cellulose content, and total crystallinity index.
keywords:
Conversion;Biomass Analytics;Economics;Hydrolysate
published:
2025-12-18
Boob, Aashutosh; Zhang, Changyi; Pan, Yuwei; Zaidi, Airah; Whitaker, Rachel; Zhao, Huimin
(2025)
Sulfolobus islandicus, an emerging archaeal model organism, offers unique advantages for metabolic engineering and synthetic biology applications owing to its ability to thrive in extreme environments. Although several genetic tools have been established for this organism, the lack of well-characterized chromosomal integration sites has limited its potential as a cellular factory. Here, we systematically identified and characterized 13 artificial CRISPR RNAs targeting eight integration sites in S. islandicus using the CRISPR-COPIES pipeline and a multi-omics-informed computational workflow. We leveraged the endogenous CRISPR-Cas system to integrate the reporter gene lacS and validated heterologous expression through a β-galactosidase assay, revealing significant positional effects. As a proof of concept, we utilized these sites to genetically manipulate lipid ether composition by overexpressing glycerol dibiphytanyl glycerol tetraether (GDGT) ring synthase B (GrsB). This study expands the genetic toolbox for S. islandicus and advances its potential as a robust platform for archaeal synthetic biology and industrial biotechnology.
keywords:
AI/ML; gene editing; genome engineering; metabolic engineering
published:
2019-03-22
Jones, Todd M.; Benson, Thomas J.; Ward, Michael P.
(2019)
This data publication provides example video clips related to research on association among flight ability of juvenile songbirds at fledging and juvenile morphological traits (wing emergence, wing length, body condition, mass, and tarsus length. File names reflect the species dropped in each video. These videos are supplemental material for scientific publications by the authors and reflect an example subset of all videos collected form 2017-2018 as part of a larger study on the post-fledging ecology of grassland and shrubland birds in east-Central Illinois, USA. No birds were harmed/injured in the production of these videos and procedures were approved by the Illinois Institutional Animal Care and Use Committee (IACUC), protocol no. 18221. Individuals depicted in the videos have given consent for the videos to be shared (talent/model release form; <a href="https://publicaffairs.illinois.edu/resources/release/">https://publicaffairs.illinois.edu/resources/release/</a>)
keywords:
songbirds; flight ability; wing development; wing length; wing emergence; nestling development; post-fledging
published:
2025-08-20
Arshad, Muhammad Umer; Archer, David ; Wasonga, Daniel ; Namoi, Nictor; Boe, Arvid ; Rob , Mitchell; Heaton, Emily; Khanna, Madhu; Lee, DoKyoung
(2025)
The compiled datasets include detailed costs for switchgrass production, categorized into establishment, maintenance, and harvesting expenses, along with revenue calculations. Costs were gathered from multiple sources and adjusted for inflation, focusing on farm-gate profitability, excluding fixed costs and transportation. All financial data is provided per hectare. The dataset was used to evaluate the economic performance of forage- and bioenergy-type switchgrass cultivars and their response to nitrogen fertilization across diverse marginal environments in the U.S. Midwest. Data Envelopment Analysis (DEA) and cost-benefit analysis were employed to assess the efficiency and profitability of 23 different cultivar and fertilization rate combinations over five years.
published:
2025-11-25
The diel activity of study animals while feeding at their kills in the Santa Cruz Mountains of California
keywords:
Santa Cruz
published:
2026-01-08
Dibaeinia, Payam; Sinha, Saurabh
(2026)
CoNSEPT is a tool to predict gene expression in various cis and trans contexts. Inputs to CoNSEPT are enhancer sequence, transcription factor levels in one or many trans conditions, TF motifs (PWMs), and any prior knowledge of TF-TF interactions.
keywords:
software; gene expression
published:
2025-08-27
Jang, Chunhwa; Namoi, Nictor; Lee, Jung Woo; Becker, Talon; Rooney, William; Lee, DoKyoung
(2025)
Data were collected from agronomy fields in Urbana and Ewing, IL, during the 2022 and 2023 growing seasons. The dataset includes dry biomass yield, nitrogen, phosphorus, and potassium concentrations and removals, and chemical composition elements (cellulose, hemicellulose, lignin, and soluble fractions) for 13 high-biomass sorghum hybrids.
data_sharing.xlsx contains 20 columns and 104 rows. Below is the explanation of all variables in the file:
Year: 2022; 2023
Location: Urbana, IL; Ewing, IL
N rate (kg-N/ha): 0; 112
Hybrid #: H1-H13
Pedigree: Pedigree for 13 hybrids
Dry biomass yield (Mg/ha): Aboveground dry biomass yield
N (g/kg): Nitrogen concentration in plant tissue
P (g/kg): Phosphorus concentration in plant tissue
K (g/kg): Potassium concentration in plant tissue
N (kg/ha): Nitrogen removal by aboveground biomass
P (kg/ha): Phosphorus removal by aboveground biomass
K (kg/ha): Potassium removal by aboveground biomass
Cellulose (g/kg): Cellulose concentration in plant tissue
Hemicellulose (g/kg): Hemicellulose concentration in plant tissue
Lignin (g/kg): Lignin concentration in plant tissue
Soluble (g/kg): Soluble concentration in plant tissue
Cellulose (Mg/ha): Cellulose content in aboveground biomass
Hemicellulose (Mg/ha): Hemicellulose content in aboveground biomass
Lignin (Mg/ha): Lignin content in aboveground biomass
Soluble (Mg/ha): Soluble content in aboveground biomass
keywords:
high-biomass sorghum hybrids; yield potential; environmental adaptability; feedstock quality; nutrient removal; N fertilization
published:
2025-09-26
Arora, Amit; Singh, Vijay
(2025)
In this study, different process schemes were designed and evaluated for biodiesel production from engineered cane lipids with uncertain fatty acid compositions. Four different process schemes were compared under (i) thermal glycerolysis and (ii) enzymatic glycerolysis approaches. These schemes were based on the biodiesel yield and economic indicators such as the net present value (NPV) and the minimum selling price (MSP) of biodiesel. A scheme with polar lipid separation under thermal glycerolysis resulted in the maximum NPV ($96.5 million) and minimum MSP ($1107/ton biodiesel), respectively. Through local sensitivity analysis, it was concluded that the cane lipid percentage is the most significant factor influencing process economics. A conjoint analysis of the lipid procurement price and cane lipid percent suggested that 15% cane lipids with a low lipid procurement price ($0.536/kg) results in a positive NPV. When the cane lipid price is higher (>$0.80/kg), a 20% lipid content should be considered to achieve a positive NPV. At 20% cane lipids, the worst-case and best-case scenarios were evaluated by analyzing the interplay of the three most important parameters, The best-case scenario revealed that the minimum NPV under any process scheme could yield more than $100 million (or MSP: $0.80/L), and the worst-case analysis showed that losses incurred by the plant could be as high as $80 million (MSP: $1.36/L). A Monte Carlo simulation indicated that there is a 70% chance of the plant being profitable (NPV > 0).
keywords:
Conversion;Economics;Feedstock Bioprocessing;Modeling
published:
2025-10-21
Trieu, Anthony; Belaffif, Mohammad B.; Hirannaiah, Pradeepa; Manjunatha, Shilpa; Wood, Rebekah; Bathula, Yokshitha; Billingsley, Rebecca L.; Arpan, Anjali; Sacks, Erik; Clemente, Tom; Moose, Stephen; Reichert, Nancy A.; swaminathan, kankshita
(2025)
Miscanthus, a C4 member of the family Poaceae, is a promising perennial crop for bioenergy, renewable bioproducts, and carbon sequestration. Species of interest include nothospecies Miscanthus x giganteus and its parental species M. sacchariflorus and M. sinensis. Use of biotechnology-based procedures to genetically improve miscanthus, to date, have only included plant transformation procedures for introduction of exogenous genes into the host genome at random, non-targeted sites.
keywords:
Feedstock Production;Biomass Analytics;Genomics
published:
2023-07-14
Punyasena, Surangi W.; Urban, Michael A.; Adaime, Marc-Elie; Romero, Ingrid; Jaramillo, Carlos
(2023)
This dataset includes a total of 300 images of 45 extant species of Podocarpus (Podocarpaceae) and nine images of fossil specimens of the morphogenus Podocarpidites. The goal of this dataset is to capture the diversity of morphology within the genus and create an image database for training machine learning models.
The images were taken using Airyscan confocal superresolution microscopy at 630x magnification (63x/NA 1.4 oil DIC). The images are in the CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or open microscopy software, such as ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/software/zeiss-zen/czi-image-file-format.html]
Please cite this dataset and listed publications when using these images.
keywords:
optical superresolution microscopy; Zeiss Airyscan; CZI images; conifer; saccate pollen; Podocarpus; Podocarpidites; Smithsonian Tropical Research Institute
published:
2022-07-19
Parmar, Dharmeshkumar; Jia, Jin; Shrout, Joshua; Sweedler, Jonathan; Bohn, Paul
(2022)
#### Details of Pseudomonas aeruginosa biofilm dataset ####
----------------*Folder Structure*-------------------------------------
This dataset contains peak intensity tables extracted from mass spectrometry imaging (MSI) data using tools, SCiLS and MSI reader. There are 2 folders in "MSI-Data-Paeruginosa-biofilms-UIUC-DP-JVS-July2022.zip", each folder contains 3 sub-folders as listed below.
1. PellicleBiofilms-and-Supernatant [Pellicle biofilms collected from air-liquid interface and spend supernatant medium after 96 h incubation period]:
(1) Full-Scan-Data-96h; (2) MSMS-data-from-C7-Quinolones-96h; and (3) MSMS-data-from-C9-Quinolones-96h
2. StaticBiofilms [Static biofilms grown on mucin surface]:
(1) Full-Scan-Data; (2) MSMS-data-from-C7-Quinolones; and (3) MSMS-data-from-C9-Quinolones
----------------*File name*----------------------------------------------
Sample information is included in the file names for easy identification and processing. Attributes covered in file names are explained in the example below.
*Example file name "Rep1-Stat-FRD1-mPat-48-FS"*
~ Each unit of information is separated by "-"
~Unit 1 - "Rep1" - Biological replicate ( Rep1, Rep2, and Rep3)
~Unit 2 - "Stat" - Sample type (Stat = Static Biofilm, Pel = Pellicle biofilm, Sup = Supernatant)
~Unit 3 - "FRD1" - Strain (FRD1 = Mucoid strain, PAO1C = Non-mucoid strain)
~Unit 4 - "mPat" - Type of mucin surface used (mPat = patterned mucin surface, mUni = uniform mucin surface)
~Unit 5 - "48" - Sample time point (hours = 48, 72, 96)
~Unit 6 - "FS" - Scan type used in MSI (FS = high resolution full-scan, 260 = targeted MS/MS of C7 quinolones (m/z 260), 288 = targeted MS/MS of C9 quinolones (m/z 288))
----------------*File structure*------------------------------------------
All MSI data has been exported to CSV format. Each CSV files contains information about scan number, Coordinates (x,y,z), m/z values, extraction window (absolute), and corresponding intensities in the form of a matrix.
----------------*End of Information*--------------------------------------
keywords:
mass spectrometry imaging (MSI); biofilm; antibiotic resistance; Pseudomonas aeruginosa; quorum sensing; rhamnolipids
published:
2024-08-19
Ward, Michael; Stewart, Sarah; Benson, Thomas
(2024)
Data on the nesting success and post-fledgling survival of Eastern Whip-poor-wills in central Illinois. Data was part of Sarah Stewart's MS project at the University of Illinois.
keywords:
bird nesting success; post-fledgling survival; eastern whip-poor-will
published:
2025-10-10
Field, John L.; Richard, Tom; Smithwick, Erica A. H.; Cai, Hao; Laser, Mark; LeBauer, David; Long, Stephen; Paustian, Keith; Qin, Zhangcai; Sheehan, John; Smith, Pete; Wang, Michael Q.; Lynd, Lee
(2025)
This zip file contains a UNIX-format DayCent model executable, input files, automation code, and associated directory structure necessary to re-produce the DayCent analysis underlying the manuscript. The main script “autodaycent.py” (written for Python 2.7) opens an interactive command line routine that facilitates: Calibrating the DayCent pine growth model; Initializing DayCent for a set of case studies sites; Executing an ensemble of model runs representing case study site reforestation, grassland restoration, or conversion to switchgrass cultivation; and Results analysis & generation of manuscript Fig. 3. Note that the interactive analysis code requires that all input files to be contained in the directory structure as uploaded, without modification. Executable versions of the DayCent model compatible with other operating systems are available upon request.
keywords:
Feedstock Production;Modeling
published:
2018-01-03
Sweet, Andrew; Bush, Sarah; Gustafsson, Daniel; Allen, Julie; DiBlasi, Emily; Skeen, Heather; Weckstein, Jason; Johnson, Kevin
(2018)
Concatenated sequence alignment, phylogenetic analysis files, and relevant software parameter files from a cophylogenetic study of Brueelia-complex lice and their avian hosts. The sequence alignment file includes a list of character blocks for each gene alignment and the parameters used for the MrBayes phylogenetic analysis.
1) Files from the MrBayes analyses:
a) a file with 100 random post-burnin trees (50% burnin) used in the cophylogenetic analysis - analysisrandom100_trees_brueelia.tre
b) a majority rule consensus tree - treeconsensus_tree_brueelia.tre
c) a maximum clade credibility tree - mcc_tree_brueelia.tre
The tree tips are labeled with louse voucher names, and can be referenced in Supplementary Table 1 of the associated publication.
2) Files related to a BEAST analysis with COI data:
a) the XML file used as input for the BEAST run, including model parameters, MCMC chain length, and priors - beast_parameters_coi_brueelia.xml
b) a file with 100 random post-burnin trees (10% burnin) from the BEAST posterior distribution of trees; used in OTU analysis - beast_100random_trees_brueelia.tre
c) an ultrametric maximum clade credibility tree - mcc_tree_beast_brueelia.tre
3) A maximum clade credibility tree of Brueelia-complex host species generated from a distribution of trees downloaded from https://birdtree.org/subsets/ - mcc_tree_brueelia_hosts.tre
4) Concatenated sequence alignment - concatenated_alignment_brueelia.nex
keywords:
bird lice; Brueelia-complex; passerines; multiple sequence alignment; phylogenetic tree; Bayesian phylogenetic analysis; MrBayes; BEAST
published:
2025-11-24
Dubinkina, Veronika; Bhogale, Shounak; Hsieh, Ping-Hung; Dibaeinia, Payam; Nambiar, Ananthan; Maslov, Sergei; Yoshikuni, Yasuo; Sinha, Saurabh
(2025)
Because of its natural stress tolerance to low pH, Issatchenkia orientalis (a.k.a. Pichia kudriavzevii) is a promising non-model yeast for bio-based production of organic acids. Yet, this organism is relatively unstudied, and specific mechanisms of its tolerance to low pH are poorly understood, limiting commercial use. In this study, we selected 12 I. orientalis strains with varying acid stress tolerance (six tolerant and six susceptible) and profiled their transcriptomes in different pH conditions to study potential mechanisms of pH tolerance in this species. We identified hundreds of genes whose expression response is shared by tolerant strains but not by susceptible strains, or vice versa, as well as genes whose responses are reversed between tolerant and susceptible strains. We mapped regulatory mechanisms of transcriptomic responses via motif analysis as well as differential network reconstruction, identifying several transcription factors, including Stb5, Mac1, and Rtg1/Rtg3, some of which are known for their roles in acid response in Saccharomyces cerevisiae. Functional genomics analysis of short-listed genes and transcription factors suggested significant roles for energy metabolism and translation-related processes, as well as the cell wall integrity pathway and RTG-dependent retrograde signaling pathway. Finally, we conducted additional experiments for two organic acids, 3-hydroxypropionate and citramalate, to eliminate acid-specific effects and found potential roles for glycolysis and trehalose biosynthesis specifically for response to low pH. In summary, our approach of comparative transcriptomics and phenotypic contrasting, along with a multi-pronged bioinformatics analysis, suggests specific mechanisms of tolerance to low pH in I. orientalis that merit further validation through experimental perturbation and engineering.
keywords:
Conversion;Transcriptomics
published:
2024-02-21
Hartman, Jordan H; Corush, Joel B; Larson, Eric R; Tiemann, Jeremy S; Willink, Philip; Davis, Mark A
(2024)
Data associated with the manuscript "Niche conservatism and spread explain hybridization and introgression between native and invasive fish" by Jordan H. Hartman, Joel B. Corush, Eric R. Larson, Jeremy S. Tiemann, Philip Willink, and Mark A. Davis. For this project, we combined results of ecological niche models (ENMs) and next-generation restriction site-associated DNA sequencing (RADseq) to test theories of niche conservatism and biotic resistance on the success of invasion, hybridization, and extent of introgression between native Western Banded Killifish and non-native Eastern Banded Killifish. This dataset provides the sampling locations and number of Banded Killifish in each population, accession numbers for RADseq from the National Center for Biotechnology Information Sequence Read Archive and the assignment of each Banded Killifish, the habitat associations of each population from the ENMs, and the occurrence points used to build the ENMs.
keywords:
Banded Killifish; ecological niche model; Fundulus diaphanus; hybrid swarm; invasive species; Laurentian Great Lakes