Illinois Data Bank Dataset Search Results
Results
published:
2025-11-10
Raj, Tirath; Dien, Bruce; Singh, Vijay
(2025)
Sugarcane is being enhanced as a bioenergy crop by engineering it to accumulate and store lipids along with polymeric sugars in vegetative tissues. However, there is no existing process that allows for processing this new crop to recover both lipid and cellulosic sugars from the oilcane bagasse. Therefore, a comprehensive investigation of two pretreatment methods—natural deep eutectic solvents (NADES) and chemical-free hydrothermal pretreatment (HT) was conducted to judge their suitability for recovering fermentable sugars, lipids, and lignin from bagasse. Two NADES, i.e., choline chloride: lactic acid (ChCl:LA) and betaine: lactic acid (BT:LA) were prepared using a 1:2 M ratio and were evaluated for pretreatment of oilcane bagasse at 10, 20, and 50 % (w/w) solids, followed by enzymatic hydrolysis at 10 % (w/w) solids. Notably, ChCl:LA NADES treatment at 10 % (w/w) solids at 140 °C for 2 h, solubilized 78.8 % of lignin and 80.4 % of hemicellulose and allowed 82.7 % enzymatic conversion of glucans to glucose. In contrast, HT pretreatment removed approximately 87.6 % of the hemicellulose and provided an enzymatic glucose yield of 69.7 %. Furthermore, ChCl:LA operated at 50 % solids loading the enriched lipids 2.6-fold (9.2 wt%) in recovered solids compared to HT (6.4 %) and BT:LA (5.1 %) pretreatment processes. NMR-HSQC and GPC analysis showed that ChCl:LA also cleaved the most lignin β–O–4 linkages and demonstrated lower molecular weight compared to HT. This study demonstrates that NADES pretreatment is an effective green processing method for recovering lipids, sugars, and lignin from bioenergy crops at high solid loading (50 % w/w) within the context of an integrated biorefinery.
keywords:
Conversion;Hydrolysate;Lipidomics
published:
2022-08-06
Carson, Dawn; Kopsco, Heather; Gronemeyer, Peg; Mateus-Pinilla, Nohra; Smith, Genee; Sandstrom, Emma; Smith, Rebecca
(2022)
An online knowledge, attitudes, and practices survey on ticks and tick-borne diseases was distributed to medical professionals in Illinois during summer 2020 to fall 2021. These are the raw data associated with that survey and the survey questions used. Age, gender, and county of practice have been removed for identifiability. We have added calculated values (columns 165 to end), including: the tick knowledge score, TBD knowledge score, and total knowledge score, which are the sum of the total number of correct answers in each category, and score percent, which are the proportion of correct answers in each category; region, which is determined from the county of practice; TBD relevant practice, which separates the practice variable into TBD primary, secondary, and non-responders; and several variables which group categories.
keywords:
ticks; medicine; tick-borne disease; survey
published:
2022-11-01
Beilke, Elizabeth; Haulton, Scott; O'Keefe, Joy
(2022)
Datasets that accompany Beilke, Haulton, and O'Keefe 2022 publication (Title: Foliage-roosting eastern red bats select for features associated with management in a central hardwood forest; Journal: Forest Ecology and Management).
published:
2022-09-14
Beilke, Elizabeth; O'Keefe, Joy
(2022)
Datasets that accompany Beilke and O'Keefe 2022 publication (Title: Bats reduce insect density and defoliation in temperate forests: an exclusion experiment; Journal: Ecology).
keywords:
bats; defoliation; ecosystem services; forests, insectivory; insects; trophic cascades
published:
2024-12-12
Varela, Sebastian; Leakey, Andrew
(2024)
This dataset supports the implementation described in the manuscript "Breaking the Barrier of Human-Annotated Training Data for Machine-Learning-Aided Biological Research Using Aerial Imagery." It comprises UAV aerial imagery used to execute the code available at https://github.com/pixelvar79/GAN-Flowering-Detection-paper. For detailed information on dataset usage and instructions for implementing the code to reproduce the study, please refer to the GitHub repository.
keywords:
Plant phenotyping; generative and adversarial learning; phenotyping; UAV; UAS, drone
published:
2022-11-02
This dataset contains the behavioral, metabolic, and capture data which is reported within the manuscript Data for Capture is predicted by behavior and size, not metabolism, in Muskellunge
published:
2024-09-24
Sawyer, Elle; Kreps, Timothy; Lodge, David; Larson, Eric
(2024)
Data at the lake summary and individual crayfish level that supports the manuscript Sawyer, E.K., Kreps, T. A., Lodge, D. M. and E.R. Larson. “Long-term declines in body size of the invasive rusty crayfish (Faxonius rusticus) in temperate lakes." Includes size measurements of 69,303 individual rusty crayfish (Faxonius rusticus) for 17 lakes of Vilas County, Wisconsin, United States collected between 1980 and 2020.
keywords:
body size; Faxonius rusticus; invasive species; non-native species; rusty crayfish; Wisconsin; Vilas County
published:
2026-01-12
Dinh, Hoang; Sarkar, Debolina; Maranas, Costas
(2026)
In the repository are example scripts that perform uncertainty injection and propagation to flux balance analysis with outputs for a small sample size (for demonstration purpose only). For proper analysis, user should download the scripts and run for a large sample size (e.g., 10,000 samples).
If you use the scripts, please cite the following Metabolic Engineering article: “Quantifying the propagation of parametric uncertainty on flux balance analysis” (https://doi.org/10.1016/j.ymben.2021.10.012)
There are two subdirectories:
/uncFBA/uncBiom: injection of normally distributed noise to biomass precursor coeffcients and ATP maintenance (growth-associated ATP maintenance (GAM) and non-growth associated ATP maintenance (NGAM))
/uncFBA/uncRHS: departure from steady-state by adding noise drawn from normal distribution to the RHS terms of mass balance constraints
keywords:
Metabolomics; Modeling
published:
2025-04-27
Alvarez, Jennifer; Fraterrigo, Jennifer; Dalling, James
(2025)
Downed woody debris census data for Trelease Woods collected in the summer of 2022. Dataset contains volume, biomass, decay class, and GPS coordinates for each downed woody debris piece.
keywords:
Old-growth; temperate forest; downed woody debris; coarse woody debris; census data
published:
2025-04-28
Alvarez, Jennifer; Fraterrigo, Jennifer; Dalling, James
(2025)
Dataset of the standing dead trees at Trelease Woods in 2022. Dataset contains volume, biomass, decay class, and GPS coordinates for each standing dead tree.
keywords:
old-growth; temperate forest; standing deadwood; census data
published:
2025-04-27
Alvarez, Jennifer; Fraterrigo, Jennifer; Dalling, James
(2025)
Soil data for ten soil cores collected at Trelease Woods in 2022. Soil samples were analyzed with an elemental analyzer via combustion to obtain total carbon (C) and nitrogen. A subset of these samples were analyzed using the Walkley-Black method to obtain organic C. A calibration curve relating organic C and total C was created using these data.
keywords:
old-growth; temperate forest; soil carbon; soil nitrogen; nutrient cycling
published:
2021-08-27
The dataset shows all poison frogs (superfamily Dendrobatoidea) in private U.S. collections during 1990–2020. For each species and color morph, there is a date of arrival, the way it arrived in U.S. collections, and detailed notes related to its presence in the pet trade.
keywords:
pet trade; amphibians; Dendrobatidae
published:
2025-11-07
Ahmed, Md Wadud; Esquerre, Carlos A.; Eilts, Kristen; Allen, Dylan P.; McCoy, Scott M.; Varela, Sebastian; Singh, Vijay; Leakey, Andrew; Kamruzzaman, Mohammad
(2025)
Compositional characterization of biomass is vital for the biofuel industry. Traditional wet chemistry-based methods for analyzing biomass composition are laborious, time-consuming, and require extensive use of chemical reagents as well as highly skilled personnel. In this study, near-infrared (NIR) spectroscopy was used to quickly assess the composition of above-ground vegetative biomass from 113 diverse, photoperiod-sensitive, biomass-type sorghum (Sorghum bicolor) accessions cultivated under field conditions in Central Illinois. Biomass samples were analyzed using NIR spectra collected in the spectral range of 867–2536 nm, with their chemical compositions determined following the National Renewable Energy Laboratory (NREL) protocol. Advanced spectral pre-treatment and band selection techniques were utilized to develop calibration models using partial least squares regression (PLSR). The models’ effectiveness was assessed through cross-validation and independent data tests. The predictions for moisture, ash, extractives, glucan, xylan, acid-soluble lignin (ASL), acid-insoluble lignin (AIL), and total lignin were accurate and reliable, demonstrating the capability of NIR spectroscopy to provide rapid and precise characterization of sorghum biomass. The results demonstrated that NIR spectroscopy is an efficient tool for rapidly characterizing sorghum biomass, making it a sustainable option for screening desirable feedstock for biofuel or bioproduct production.
keywords:
Conversion;Feedstock Production;Biomass Analytics;Modeling
published:
2025-11-10
Banerjee, Shivali; Eilts, Kristen; Singh, Vijay
(2025)
Oilcane is an engineered sugarcane with the ability to hyper-accumulate vegetative lipids. It is processed to obtain juice and bagasse as a potential substrate for the production of biofuels and biochemicals. The juice comprises solid particles that are separated as waste mud before the fermentation of the juice. In this study, the oilcane waste mud (OWM) generated from 1000 liters of oilcane juice was quantified and evaluated as a potential resource for recovering biobased waxes. Hexane and ethyl acetate were evaluated as two different solvents for extracting waxes from OWM followed by its purification using acetone. The extracted biobased wax samples were characterized for their chemical and thermal profiles which were then compared with commercial natural waxes. Detailed mass balance shows that 53.6 ± 2.6 kg (dry basis) of solid OWM gets generated upon processing 1000 L (~1068 kg) of oilcane juice. Hexane and ethyl acetate led to a crude wax yield of 25.6 ± 0.2% and 16.6 ± 0.4% (wt/wt, dry basis) respectively from OWM at the end of 8 h. The relative purification of the wax samples was reported in the range of 58%–65% (wt/wt). The purified OWM wax has a melting point of 74.7°C. The waste mud was valorized as a source of biobased waxes with characteristic chemical and thermal profiles comparable to commercial natural waxes (carnauba and beeswax). Considering the decline in the supply of petroleum wax in the future coupled with the switch to “greener” alternative products by consumers, OWM could be a valuable source of natural wax in the industrial sector reducing the dependence on petroleum waxes. Eventually, recovering biobased wax as a co-product from OWM would bring in an additional stream of revenue leading to the development of a zero-waste biorefinery based on bioenergy crops.
keywords:
Conversion;Biomass Analytics;Feedstock Bioprocessing;Hydrolysate
published:
2021-12-28
Xia, Yushu; Wander, Michelle
(2021)
*Updates for this V3: added a few more records and rearranged the sequence of the tables in order to support our new paper "Evaluation of Indirect and Direct Scoring Methods to Relate Biochemical Soil Quality Indicators to Ecosystem Services" accepted by the Soil Science Society of America Journal.
We summarize peer reviewed literature reporting associations between for three soil quality indicators (SQIs) (β-glucosidase (BG), fluorescein diacetate (FDA) hydrolysis, and permanganate oxidizable carbon (POXC)) and crop yield and greenhouse gas emissions. Peer-reviewed articles published between January of 1990 and May 2018 were searched using the Thomas Reuters Web of Science database (Thomas Reuters, Philadelphia, Pennsylvania) and Google Scholar to identify studies reporting results for: “β-glucosidase”, “permanganate oxidizable carbon”, “active carbon”, “readily oxidizable carbon”, or “fluorescein diacetate hydrolysis”, together with one or more of the following: “crop yield”, “productivity”, “greenhouse gas’, “CO2”, “CH4”, or “N2O”.
Meta-data for records include the following descriptor variables and covariates useful for scoring function development: 1) identifying factors for the study site (location, duration of the experiment), 2) soil textural class, pH, and SOC, 3) depth of soil sampling, 4) units used in published works (i.e.: equivalent mass, concentration), 5) SQI abundances and measured ecosystem functions, and 6) summary statistics for correlation between SQIs and functions (yield and greenhouse gas emissions).
*Note: Blank values in tables are considered unreported data.
keywords:
Soil health promoting practices; Soil quality indicators; β-glucosidase; fluorescein diacetate hydrolysis; Permanganate oxidizable carbon; Greenhouse gas emissions; Scoring curves; Soil Management Assessment Framework
published:
2025-01-06
Shilikbay, Temirlan; Nawaz, Aatiqa; Doon, Megan; Ceman, Stephanie
(2025)
The complete data for the publication "RNA helicase MOV10 suppresses fear memory and dendritic arborization and regulates microtubule dynamics in hippocampal neurons," excluding sequencing data deposited in GEO, is provided here.
keywords:
MOV10; NUMA1; hippocampal neurons; behavior; cytoskeleton; tiff; czi; dv; mp4; mpg; ndpi; csv; xlsx; R
published:
2025-09-08
Hudson, Matthew; Zhao, Huimin; Sweedler, Jonathan; Shanklin, John; Cahoon, Edgar; Root, Mike; Burgess, Steven; Park, Kiyoul; Zhou, Shuaizhen; Blanford, Jantana; Lane, Stephan; Croslow, Seth; Dong, Jia
(2025)
Plant bioengineering is a time-consuming and labor-intensive process with no guarantee of achieving desired traits. Here, we present a fast, automated, scalable, high-throughput pipeline for plant bioengineering (FAST-PB) in maize (Zea mays) and Nicotiana benthamiana. FAST-PB enables genome editing and product characterization by integrating automated biofoundry engineering of callus and protoplast cells with single-cell matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS). We first demonstrated that FAST-PB could streamline Golden Gate cloning, with the capacity to construct 96 vectors in parallel. Using FAST-PB in protoplasts, we found that PEG2050 increased transfection efficiency by over 45%. For proof-of-concept, we established a reporter-gene-free method for CRISPR editing and phenotyping via mutation of high chlorophyll fluorescence 136. We show that diverse lipids were enhanced up to 6-fold using CRISPR activation of lipid controlling genes. In callus cells, an automated transformation platform was employed to regenerate plants with enhanced lipid traits through introducing multigene cassettes. Lastly, FAST-PB enabled high-throughput single-cell lipid profiling by integrating MALDI-MS with the biofoundry, protoplast, and callus cells, differentiating engineered and unengineered cells using single-cell lipidomics. These innovations massively increase the throughput of synthetic biology, genome editing, and metabolic engineering and change what is possible using single-cell metabolomics in plants.
keywords:
AI/ML; genome engineering; metabolic engineering; phenotyping
published:
2021-06-25
Szydlowski, Daniel; Daniels, Melissa; Larson, Eric
(2021)
Data associated with the manuscript "Do rusty crayfish invasions affect water clarity in north temperate lakes?" by Daniel K. Szydlowski, Melissa K. Daniels, and Eric R. lARSON
keywords:
chlorophyll a; crayfish; Faxonius rusticus; invasive species; lakes; LandSat; remote sening; rusty crayfish; Secchi disc; water clarity
published:
2025-10-10
Tran, Vinh; Cao, Mingfeng; Fatma, Zia; Song, Xiaofei; Zhao, Huimin
(2025)
The nonconventional yeast Issatchenkia orientalis has emerged as a potential platform microorganism for production of organic acids due to its ability to grow robustly under highly acidic conditions. However, lack of efficient genetic tools remains a major bottleneck in metabolic engineering of this organism. Here we report that the autonomously replicating sequence (ARS) from Saccharomyces cerevisiae (ScARS) was functional for plasmid replication in I. orientalis, and the resulting episomal plasmid enabled efficient genome editing by the CRISPR/Cas9 system. The optimized CRISPR/Cas9-based system employed a fusion RPR1′-tRNA promoter for single guide RNA (sgRNA) expression and could attain greater than 97% gene disruption efficiency for various gene targets. Additionally, we demonstrated multiplexed gene deletion with disruption efficiencies of 90% and 47% for double gene and triple gene knockouts, respectively. This genome editing tool can be used for rapid strain development and metabolic engineering of this organism for production of biofuels and chemicals.
keywords:
Conversion;Genomics;Genome Engineering;Transcriptomics
published:
2022-07-08
Rahlin, Anastasia; Saunders, Sarah; Beilke, Stephanie
(2022)
Dataset for "Spatial drivers of wetland bird occupancy within an urbanized matrix in the Upper Midwestern United States" manuscript contains occupancy data for ten wetland bird species used in single-species occupancy models at four spatial scales and four wetland habitat types. Data were collected from 2017-2019 in NE Illinois and NW Indiana. Dataset includes wetland bird occupancy data, habitat parameter values for each survey location, and R code used to run analyses.
keywords:
wetland birds; occupancy; emergent wetland; urbanization; Great Lakes region
published:
2025-11-19
Xu, Hao; Shi, Longyuan; Boob, Aashutosh; Park, Wooyoung; Tan, Shih-I; Tran, Vinh; Schultz, J. Carl; Zhao, Huimin
(2025)
Rhodotorula toruloides is a non-model, oleaginous yeast uniquely suited to produce acetyl-CoA-derived chemicals. However, the lack of well-characterized genomic integration sites has impeded the metabolic engineering of this organism. Here we report a set of computationally predicted and experimentally validated chromosomal integration sites in R. toruloides. We first implemented an in silico platform by integrating essential gene information and transcriptomic data to identify candidate sites that meet stringent criteria. We then conducted a full experimental characterization of these sites, assessing integration efficiency, gene expression levels, impact on cell growth, and long-term expression stability. Among the identified sites, 12 exhibited integration efficiencies of 50% or higher, making them sufficient for most metabolic engineering applications. Using selected high-efficiency sites, we achieved simultaneous double and triple integrations and efficiently integrated long functional pathways (up to 14.7 kb). Additionally, we developed a new inducible marker recycling system that allows multiple rounds of integration at our characterized sites. We validated this system by performing five sequential rounds of GFP integration and three sequential rounds of MaFAR integration for fatty alcohol production, demonstrating, for the first time, precise gene copy number tuning in R. toruloides. These characterized integration sites should significantly advance metabolic engineering efforts and future genetic tool development in R. toruloides.
keywords:
Conversion;Metabolic Engineering;Software;Transcriptomics
published:
2021-05-14
Supplemental Forest Data for Chapter 6: Climate Change Impacts on Ecosystems in "An Assessment of the Impacts of Climate Change in Illinois"
published:
2021-05-14
Cattai de Godoy, Maria
(2021)
- The aim of this research was to evaluate the novel dietary fiber source, miscanthus grass, in comparison to traditional fiber sources, and their effects on the microbiota of healthy adult cats. Four dietary treatments, cellulose (CO), miscanthus grass fiber (MF), a blend of miscanthus fiber and tomato pomace (MF+TP), or beet pulp (BP) were evaluated.<br /><br />- The study was conducted using a completely randomized design with twenty-eight neutered adult, domesticated shorthair cats (19 females and 9 males, mean age 2.2 ± 0.03 yr; mean body weight 4.6 ± 0.7 kg, mean body condition score 5.6 ± 0.6). Total DNA from fresh fecal samples was extracted using Mo-Bio PowerSoil kits (MO BIO Laboratories, Inc., Carlsbad, CA). Amplification of the 292 bp-fragment of V4 region from the 16S rRNA gene was completed using a Fluidigm Access Array (Fluidigm Corporation, South San Francisco, CA). Paired-end Illumina sequencing was performed on a MiSeq using v3 reagents (Illumina Inc., San Diego, CA) at the Roy J. Carver Biotechnology Center at the University of Illinois.
<br />- Filenames are composed of animal name identifier, diet (BP= beet pulp; CO= cellulose; MF= miscanthus grass fiber; TP= blend of miscanthus fiber and tomato pomace).
keywords:
cats; dietary fiber; fecal microbiota; miscanthus grass; nutrient digestibility; postbiotics
published:
2023-03-06
Zhou, Shuaizhen; Sweedler, Jonathan V.
(2023)
This dataset includes mass spectrometry, library screening, and gas chromatography data used for creating a high-throughput screening in metabolic engineering.
keywords:
mass spectrometry; gas chromatography
published:
2025-06-06
Smith, Rebecca; Kopsco, Heather; Ceniceros, Ashley; Carson, Dawn
(2025)
The materials used to provide Continuing Medical Education on ticks and tick-borne diseases in Illinois on February 1, 2023 at Carle Hospital, along with the pre- and post-quiz and deidentified data of the quiz takers.
Files:
"Ticks and Tick-borne Diseases of Illinois_Final_w_speaker_notes.pptx": Presentation slides used for CME course, with notes to indicate verbal commentary
"CME assessment_final.docx": Pre- and post-CME quiz questions and answers, annotated to indicate correct answers and reasoning for incorrect answers
"CME_prequiz_data_for_sharing.csv": De-identified data from pre-CME quiz
"CME_postquiz_data_for_sharing.csv": De-identified data from post-CME quiz, including demographics
"DataCleaning_forSharing.R": R file used to clean the raw data and calculate the scores
"ReadMe.txt":
keywords:
tick-borne disease; CME