Illinois Data Bank Dataset Search Results
Results
published:
2025-10-17
Deewan, Anshu; Liu, Jing-Jing; Jagtap, Sujit Sadashiv; Yun, Eun Ju; Walukiewicz, Hanna E.; Jin, Yong-Su; Rao, Christopher V.
(2025)
Oleaginous yeasts have received significant attention due to their substantial lipid storage capability. The accumulated lipids can be utilized directly or processed into various bioproducts and biofuels. Lipomyces starkeyi is an oleaginous yeast capable of using multiple plant-based sugars, such as glucose, xylose, and cellobiose. It is, however, a relatively unexplored yeast due to limited knowledge about its physiology. In this study, we have evaluated the growth of L. starkeyi on different sugars and performed transcriptomic and metabolomic analyses to understand the underlying mechanisms of sugar metabolism. Principal component analysis showed clear differences resulting from growth on different sugars. We have further reported various metabolic pathways activated during growth on these sugars. We also observed non-specific regulation in L. starkeyi and have updated the gene annotations for the NRRL Y-11557 strain. This analysis provides a foundation for understanding the metabolism of these plant-based sugars and potentially valuable information to guide the metabolic engineering of L. starkeyi to produce bioproducts and biofuels.
keywords:
Conversion;Metabolomics;Transcriptomics
published:
2021-03-31
This archive contains the datasets used in the paper "Recursive MAGUS: scalable and accurate multiple sequence alignment".
- 16S.3, 16S.T, 16S.B.ALL
- HomFam
- RNASim
These can also be found at https://sites.google.com/eng.ucsd.edu/datasets/alignment/pastaupp
published:
2025-09-18
Saifuddin, Mustafa; Bhatnagar, Jennifer; Segrè, Daniel; Finzi, Adrien C.
(2025)
Respiration by soil bacteria and fungi is one of the largest fluxes of carbon (C) from the land surface. Although this flux is a direct product of microbial metabolism, controls over metabolism and their responses to global change are a major uncertainty in the global C cycle. Here, we explore an in silico approach to predict bacterial C-use efficiency (CUE) for over 200 species using genome-specific constraint-based metabolic modeling. We find that potential CUE averages 0.62 ± 0.17 with a range of 0.22 to 0.98 across taxa and phylogenetic structuring at the subphylum levels. Potential CUE is negatively correlated with genome size, while taxa with larger genomes are able to access a wider variety of C substrates. Incorporating the range of CUE values reported here into a next-generation model of soil biogeochemistry suggests that these differences in physiology across microbial taxa can feed back on soil-C cycling.
keywords:
Sustainability;Metabolomics;Modeling
published:
2021-08-28
Southey, Bruce; Rodriguez-Zas, Sandra
(2021)
Metabolite identifications and profiles of liver samples from 22 day old male and female pigs from gilt that exposed to porcine reproductive and respiratory syndrome virus (P) or not (C) that were weaned at 21 days of age (W) or not (N). Profiles were obtained by University of Illinois Carver Metabolomics Center. Spectrum for each sample was acquired using a gas chromatography mass spectrometry system consisting of an Agilent 7890 gas chromatograph, an Agilent 5975 MSD, and an HP 7683B auto sampler.
keywords:
gas chromatography; mass spectrometry; maternal immune activation; weaning; liver
published:
2025-01-29
Quiroz, Edwin; Ashley, Mary V.; Zaya, David N.
(2025)
These data records weekly aphid and monarch butterfly (Danaus plexippus) neonate counts on individual milkweed plants in multiple raised garden beds in Chicago during the summers of 2023 and 2024. Relationships between aphid infestation and monarch neonates can be investigated along with weekly trends of monarch oviposition and aphid abundances. All gardens included in this study were on the University of Illinois Chicago campus, and within 100 meters of proximity. Data are provided on three milkweed species in 2023, and one milkweed species in 2024.
keywords:
Aphis; Myzocallis; Danaus plexippus; urban gardens; Asclepias syriaca; milkweeds
published:
2021-04-22
All code in Matlab .m scripts or functions (version R2019b)
Affiliated with article “Temperate and chronic virus competition leads to low lysogen frequency” published in the Journal of Theoretical Biology (2021)
Codes simulate and plot the solutions of an Ordinary Differential Equations model and generate bifurcation diagrams.
published:
2024-12-01
Bishop, Rebecca C.; Kemper, Ann M.; Clark, Lindsay V.; Wilkins, Pamela A.; McCoy, Annette M.
(2024)
Healthy mares were kept at pasture for 3 weeks, stabled for 5 weeks, returned to pasture and an final sample collected 6 weeks later. Samples were collected weekly: gastric fluid by double-tube nasogastric intubation and aspiration, feces by rectal palpation. Microbial DNA was isolated using the QIAamp PowerFecal Pro DNA kit. Full length 16S, ITS and partial 23S rRNA gene libraries were created using the Shoreline Complete ID kit.
published:
2024-10-11
Zinnen, Jack; Barak, Rebecca; Matthews, Jeffrey
(2024)
This is the core data for Influence of ecological characteristics and phylogeny on native plant species’ commercial availability, a manuscript pending publication in Ecological Applications. The data regard ecological characteristics, phenology, and phylogeny of plant species native to the Midwestern United States and how those factors relate to commercial availability.
keywords:
biodiversity; native plant nursery; plant trade; plant vendors; restoration
published:
2021-05-09
Zuckermann, Federico
(2021)
Raw data and its analysis collected from a trial designed to test the impact of providing a Bacillus-based direct-fed microbial (DFM) on the syndrome resulting from orally infecting pigs with either Salmonella enterica serotype Choleraesuis (S. Choleraesuis) alone, or in combination with an intranasal challenge, three days later, with porcine reproductive and respiratory syndrome virus (PRRSV).
keywords:
excel file
published:
2025-11-14
Asadian, Marisa; Croslow, Seth; Trinklein, Timothy; Rubakhin, Stanislav; Lam, Fan; Sweedler, Jonathan
(2025)
We developed a sequential single-cell matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) workflow that enables endogenous lipid profiling in the first step, followed by cell-type classification of the same cells via immunocytochemistry in the second step. This stepwise approach integrates high-throughput single-cell analysis enabled by microMS with multiplex immunolabeling using photocleavable mass tags (PCMTs), which are antibodies conjugated to peptide mass reporters that are photoreleased and then detected by MALDI-MS. This platform combines the strengths of untargeted chemical profiling with targeted marker-based cell identification, allowing characterization of the cells’ endogenous metabolic activity, followed by cell classification using well-established immunomarkers. Here, we provide the raw data, mzML-converted files, and LC-MS/MS data from rodent hippocampal cells as described in the manuscript.
keywords:
Single Cell Mass Spectrometry; MALDI; Hippocampal Cells; Lipidomics; Photocleavable Mass-tags
published:
2021-11-16
Prada, Cecilia M.; Turner, Benjamin L.; Dalling, James W.
(2021)
Data from an a field experiment at El Velo, Chiriqui, Republic of Panama. Data contain information about functional traits of seedlings growing in different treatments including type of forest, nitrogen addition and organic matter.
keywords:
Mycorrhiza; nitrogen; oak forest; Panama; plant-soil feedbacks, seedling growth
published:
2020-10-28
Curtis, Amanda; Tiemann, Jeremy; Douglass, Sarah; Davis, Mark; Larson, Eric
(2020)
We studied we examined the role of stream flow on environmental DNA (eDNA) concentrations and detectability of an invasive clam (Corbicula fluminea), while also accounting for other abiotic and biotic variables. This data includes the eDNA concentrations, quadrat estimates of clam density, and abiotic variables.
keywords:
Corbicula; detection probability; eDNA; invasive species; lotic; occupancy modeling
published:
2025-06-03
Han, Jaeyeong; Ficca, Alyson; Lanzatella, Marissa; Leang, Kanika; Barnum, Matthew; Boudreaux, Jonathan; Schroeder, Nathan
(2025)
This data comprises image files used in the analysis of Analysis of Nematode Ventral Nerve Cords Suggests Multiple Instances of Evolutionary Addition and Loss of Neurons by Han et al. (bioRxiv, 2025: doi: https://doi.org/10.1101/2025.03.20.644414). It is separated into two folders. The first comprise data using DAPI staining to quantify the number of VNC nuclei in diverse nematodes. The second includes dye-filling data of Mononchus aquaticus.
keywords:
C. elegans; Mononchus; neuroanatomy; nematode nervous system; ventral nerve cord; secondary simplification
published:
2025-08-21
Lu, Yi; Sweedler, Jonathan; Zhou, Shuaizhen; Zhou, Yu
(2025)
Engineering efficient biocatalysts is essential for metabolic engineering to produce valuable bioproducts from renewable resources. However, due to the complexity of cellular metabolic networks, it is challenging to translate success in vitro into high performance in cells. To meet such a challenge, an accurate and efficient quantification method is necessary to screen a large set of mutants from complex cell culture and a careful correlation between the catalysis parameters in vitro and performance in cells is required. In this study, we employed a mass-spectrometry based high-throughput quantitative method to screen new mutants of 2-pyrone synthase (2PS) for triacetic acid lactone (TAL) biosynthesis through directed evolution in E. coli. From the process, we discovered two mutants with the highest improvement (46 fold) in titer and the fastest kcat (44 fold) over the wild type 2PS, respectively, among those reported in the literature. A careful examination of the correlation between intracellular substrate concentration, Michaelis-Menten parameters and TAL titer for these two mutants reveals that a fast reaction rate under limiting intracellular substrate concentrations is important for in-cell biocatalysis. Such properties can be tuned by protein engineering and synthetic biology to adopt these engineered proteins for the maximum activities in different intracellular environments.
keywords:
catalysis; mass spectrometry; metabolic engineering
published:
2020-04-06
McCoy, Annette; Lopp, Christine; Kooy, Sarah; Migliorisi, Alessandro; Austin, Scott; Wilkins, Pamela
(2020)
Raw measurement data for umbilical remnants (umbilical vein, umbilical arteries and urachus) in support of Equine Veterinary Journal publication "Normal Regression of the Internal Umbilical Remnant Structures in Standardbred Foals."
keywords:
equine; umbilicus; ultrasound
published:
2025-04-29
Bose, Anish; Schuster, Keaton; Sonam, Surabhi; Kodali, Chandril; Smith-Bolton, Rachel
(2025)
This page contains the data for the publication "The pioneer transcription factor Zelda controls the exit from regeneration and restoration of patterning in Drosophila" published in the journal Science Advances.
keywords:
Drosophila; regeneration; wing imaginal disc; Zelda
published:
2022-04-19
Nowak, Romana; Yang, Shuhong; Li, Kailiang; Bi, Jiajia; Drnevich, Jenny
(2022)
List of differentially expressed genes in human endometrial stromal cells with knockdown of Basigin (BSG) gene expression during decidualization.
The BSG siRNA or negative scrambled control siRNA were transfected into human endometrial stromal cells (HESCs) following the protocol of siLentFect™ Lipid (Bio-Rad, Hercules, CA. Following complete knock down of BSG in HESCs (72 hours after adding siRNA), HESCs were treated with medium containing estrogen, progesterone and cAMP to induce decidualization. BSG siRNA and negative control scrambled siRNA were added to the cells every four days (day 0, 4) over the course of the decidualization protocol. Total RNA was harvested at day 6 of the decidualization protocol for microarray analysis. Microarray analysis was performed at the University of Illinois at Urbana-Champaign Roy J. Carver Biotechnology Center. Briefly, 0.2 micrograms of total RNA were labeled using the Agilent two color QuickAmp labeling kit (Agilent Technologies, Santa Clara, CA) according to the manufacturer’s protocol. The optional spike-in controls were not used. Samples were hybridized to Human Gene Expression 4x44K v2 Microarray (Agilent Technologies, Santa Clara, CA) in an Agilent Hybridization Cassette according to standard protocols. The arrays were then scanned on an Axon GenePix 4000B scanner and the images were quantified using Axon GenePix 6.1.
Microarray data pre-processing and statistical analyses were done in R (v3.6.2) using the limma package (3.42.0 (Ritchie et al., 2015). Median foreground and median background values from the 4 arrays were read into R and any spots that had been manually flagged (-100 values) were given a weight of zero. The background values were ignored because investigations showed that trying to use them to adjust for background fluorescence added more noise to the data; background was low and even for all arrays, therefore no background correction was done.
The individual Cy5 and Cy3 fluorescence for each array were normalized together using the quantile method 3 (Yang and Thorne, 2003). Agilent's Human Gene Expression 4x44K v2 Microarray has a total of 45,220 probes: 1224 probes for positive controls, 153 negative control, 823 labeled “ignore” and 43,118 labeled “cDNA”. The pos+neg+ignore probes were used to ascertain the background level of fluorescence (6, on the log2 scale) then discarded. The cDNA probes comprise 34,127 unique 60mer probes, of which 999 probes are spotted 10 times each and the rest one time each. We averaged the replicate probes for those spotted 10 times and then fit a mixed model that had treatment and dye as fixed effects and array pairing as a random effect (Phipson et al., 2016; Smyth et al., 2005). After fitting the model but before False Discovery Rate (FDR) correction (Benjamini and Hochberg, 1995), probes were filtered out by the following criteria: 1) did not have at least 4/8 samples with expression values > 6 (14,105 probes removed), 2) no longer had an assigned Entrez Gene ID in Bioconductor’s HsAgilentDesign026652.db annotation package (v3.2.3; 2,152 probes removed) (Huber et al., 2015), 3) mapped to the same Entrez Gene ID as another probe but had a larger p-value for treatment effect (4,141 probes removed). This left 13,729 probes representing 13,729 unique genes.
<b>*Please note: that there is a discrepancy between the file and the readme as this plain text is the actual data file of this dataset.</b>
keywords:
Basigin; endometrium; decidualization; human
published:
2025-04-26
Alvarez, Jennifer; Fraterrigo, Jennifer; Dalling, James; Edgington, John
(2025)
Historical census data collected at Trelease Woods from 1986 to 2004 with information on tree species, diameter at breast height (DBH), and plot location.
keywords:
old-growth; temperate forest; species composition; forest dynamics; historical data
published:
2025-10-13
Schultz, J. Carl; Mishra, Shekhar; Gaither, Emily; Mejia, Andrea; Dinh, Hoang V.; Maranas, Costas D.; Zhao, Huimin
(2025)
The oleaginous, carotenogenic yeast Rhodotorula toruloides has been increasingly explored as a platform organism for the production of terpenoids and fatty acid derivatives. Fatty alcohols, a fatty acid derivative widely used in the production of detergents and surfactants, can be produced microbially with the expression of a heterologous fatty acyl-CoA reductase. Due to its high lipid production, R. toruloides has high potential for fatty alcohol production, and in this study several metabolic engineering approaches were investigated to improve the titer of this product. Fatty acyl-CoA reductase from Marinobacter aqueolei was co-expressed with SpCas9 in R. toruloides IFO0880 and a panel of gene overexpressions and Cas9-mediated gene deletions were explored to increase the fatty alcohol production. Two overexpression targets (ACL1 and ACC1, improving cytosolic acetyl-CoA and malonyl-CoA production, respectively) and two deletion targets (the acyltransferases DGA1 and LRO1) resulted in significant (1.8 to 4.4-fold) increases to the fatty alcohol titer in culture tubes. Combinatorial exploration of these modifications in bioreactor fermentation culminated in a 3.7 g/L fatty alcohol titer in the LRO1Δ mutant. As LRO1 deletion was not found to be beneficial for fatty alcohol production in other yeasts, a lipidomic comparison of the DGA1 and LRO1 knockout mutants was performed, finding that DGA1 is the primary acyltransferase responsible for triacylglyceride production in R. toruloides, while LRO1 disruption simultaneously improved fatty alcohol production, increased diacylglyceride and triacylglyceride production, and increased glucose consumption. The fatty alcohol titer of fatty acyl-CoA reductase-expressing R. toruloides was significantly improved through the deletion of LRO1, or the deletion of DGA1 combined with overexpression of ACC1 and ACL1. Disruption of LRO1 surprisingly increased both lipid and fatty alcohol production, creating a possible avenue for future study of the lipid metabolism of this yeast.
keywords:
Conversion;Genome Engineering;Genomics
published:
2016-08-16
Nguyen, Nam-phuong; Nute, Mike; Mirarab, Siavash; Warnow, Tandy
(2016)
This archive contains all the alignments and trees used in the HIPPI paper [1]. The pfam.tar archive contains the PFAM families
used to build the HMMs and BLAST databases. The file structure is:
./X/Y/initial.fasttree
./X/Y/initial.fasta
where X is a Pfam family, Y is the cross-fold set (0, 1, 2, or 3). Inside the folder
are two files, initial.fasta which is the Pfam reference alignment with 1/4 of the
seed alignment removed and initial.fasttree, the FastTree-2 ML tree estimated on
the initial.fasta.
The query.tar archive contains the query sequences for each cross-fold set.
The associated query sequences for a cross-fold Y is labeled as query.Y.Z.fas,
where Z is the fragment length (1, 0.5, or 0.25). The query files are found
in the splits directory.
[1] Nguyen, Nam-Phuong D, Mike Nute, Siavash Mirarab, and Tandy Warnow. (2016) HIPPI: Highly Accurate Protein Family Classification with Ensembles of HMMs. To appear in BMC Genomics.
keywords:
HIPPI dataset; ensembles of profile Hidden Markov models; Pfam
published:
2025-12-09
Chase, Marissa H.; Fraterrigo, Jennifer M.; Charles, Brian; Harmon-Threatt, Alexandra
(2025)
The dataset includes bee community data from a study conducted down in southern Illinois across three forested public land sites. Bee diversity and abundance data, as well as environmental variables, are included for each plot. Each plot was visited a total of four times.
keywords:
wild bees; forest management; resource availability
published:
2025-04-26
Alvarez, Jennifer; Fraterrigo, Jennifer; Dalling, James
(2025)
Census data collected at Trelease Woods in 1936 with information on tree species, stem count, diameter at breast height (DBH), and basal area. The plot boundaries from the 1936 census were georeferenced to subset 2018 census data for a direct comparison between the two census years.
keywords:
old-growth; temperate forest; species composition; forest dynamics; historical data
published:
2025-11-07
Lee, Ye-Gi; Kang, Nam Kyu; Kim, Chanwoo; Tran, Vinh; Cao, Mingfeng; Yoshikuni, Yasuo; Zhao, Huimin; Jin, Yong-Su
(2025)
This study presents a cost-effective strategy for producing organic acids from glucose and xylose using the acid-tolerant yeast, Issatchenkia orientalis. I. orientalis was engineered to produce lactic acid from xylose, and the resulting strain, SD108XL, successfully converted sorghum hydrolysates into lactic acid. In order to enable low-pH fermentation, a self-buffering strategy, where the lactic acid generated by the SD108XL strain during fermentation served as a buffer, was developed. As a result, the SD108 strain produced 67 g/L of lactic acid from 73 g/L of glucose and 40 g/L of xylose, simulating a sugar composition of sorghum biomass hydrolysates. Moreover, techno-economic analysis underscored the efficiency of the self-buffering strategy in streamlining the downstream process, thereby reducing production costs. These results demonstrate the potential of I. orientalis as a platform strain for the cost-effective production of organic acids from cellulosic hydrolysates.
keywords:
Conversion;Gene Editing;Hydrolysate;Metabolic Engineering
published:
2025-11-15
Hassell Jr., James E.; Arellano Perez, Angel David; Vasudevan, Krithika; Ressler, Reed; Garcia, Gabriela; Parr, Madison; Vierkant, Valerie; Bayer, Hugo; Maren, Stephen
(2025)
This dataset contains all the raw data, figures, and Prism files corresponding to each experiment performed for the paper “Hippocampal ensembles regulate circuit-induced relapse of extinguished fear.”
keywords:
Hippocampal ensembles; circuit-induced relapse, fear extinction.
published:
2021-05-14
Gramig, Benjamin; Khanna, Madhu; Jain, Atul
(2021)
This document contains the Supplemental Materials for Chapter 4: Climate Change Impacts on Agriculture from the report "An Assessment of the Impacts of Climate Change in Illinois" published in 2021.
keywords:
Illinois; climate change; agriculture; impacts; adaptation; crop yield; ISAM; econometrics; days suitable for fieldwork