Illinois Data Bank
Displaying 76 - 100 of 516 in total
Subject Area
Funder
Publication Year
License
Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2025-06-23
 
This repository contains data and model weights associated with the publication "ESMDynamic: Fast and Accurate Prediction of Protein Dynamic Contact Maps from Single Sequences". It includes the datasets used for training and evaluating a dynamic contact prediction model, ESMDynamic, as well as a script for conversion and usage.
keywords: Computational biology; Structural biology; Molecular dynamics; Machine learning; Protein modeling; Bioinformatics; Biophysics; Artificial intelligence
published: 2025-08-21
 
Engineering efficient biocatalysts is essential for metabolic engineering to produce valuable bioproducts from renewable resources. However, due to the complexity of cellular metabolic networks, it is challenging to translate success in vitro into high performance in cells. To meet such a challenge, an accurate and efficient quantification method is necessary to screen a large set of mutants from complex cell culture and a careful correlation between the catalysis parameters in vitro and performance in cells is required. In this study, we employed a mass-spectrometry based high-throughput quantitative method to screen new mutants of 2-pyrone synthase (2PS) for triacetic acid lactone (TAL) biosynthesis through directed evolution in E. coli. From the process, we discovered two mutants with the highest improvement (46 fold) in titer and the fastest kcat (44 fold) over the wild type 2PS, respectively, among those reported in the literature. A careful examination of the correlation between intracellular substrate concentration, Michaelis-Menten parameters and TAL titer for these two mutants reveals that a fast reaction rate under limiting intracellular substrate concentrations is important for in-cell biocatalysis. Such properties can be tuned by protein engineering and synthetic biology to adopt these engineered proteins for the maximum activities in different intracellular environments.
keywords: catalysis; mass spectrometry; metabolic engineering
published: 2025-08-21
 
Viral vectors provide an increasingly versatile platform for transformation-free reagent delivery to plants. RNA viral vectors can be used to induce gene silencing, overexpress proteins, or introduce gene editing reagents; however, they are often constrained by carrying capacity or restricted tropism in germline cells. Site-specific recombinases that catalyze precise genetic rearrangements are powerful tools for genome engineering that vary in size and, potentially, efficacy in plants. In this work, we show that viral vectors based on tobacco rattle virus (TRV) deliver and stably express four recombinases ranging in size from ∼0.6 to ∼1.5 kb and achieve simultaneous marker removal and reporter activation through targeted excision in transgenic Nicotiana benthamiana lines. TRV vectors with Cre, FLP, CinH, and Integrase13 efficiently mediated recombination in infected somatic tissue and led to heritable modifications at high frequency. An excision-activated Ruby reporter enabled simple and high-resolution tracing of infected cell lineages without the need for molecular genotyping. Together, our experiments broaden the scope of viral recombinase delivery and offer insights into infection dynamics that may be useful in developing future viral vectors.
keywords: gene editing; genome engineering; plant transformation
published: 2025-08-20
 
The compiled datasets include detailed costs for switchgrass production, categorized into establishment, maintenance, and harvesting expenses, along with revenue calculations. Costs were gathered from multiple sources and adjusted for inflation, focusing on farm-gate profitability, excluding fixed costs and transportation. All financial data is provided per hectare. The dataset was used to evaluate the economic performance of forage- and bioenergy-type switchgrass cultivars and their response to nitrogen fertilization across diverse marginal environments in the U.S. Midwest. Data Envelopment Analysis (DEA) and cost-benefit analysis were employed to assess the efficiency and profitability of 23 different cultivar and fertilization rate combinations over five years.
published: 2025-08-08
 
Rubisco activase is an ATP-dependent chaperone that facilitates dissociation of inhibitory sugar phosphates from the catalytic sites of Rubisco during photosynthesis. In Arabidopsis, Rubisco activase is negatively regulated by dark-dependent phosphorylation of Thr78. The prevalence of Thr78 in Rubisco activase was investigated across sequences from 91 plant species, finding that 29 (∼32%) species shared a threonine in the same position. Analysis of seven C3 species with an antibody raised against a Thr78 phospho-peptide demonstrated that this position is phosphorylated in multiple genera. However, light-dependent dephosphorylation of Thr78 was observed only in Arabidopsis. Further, phosphorylation of Thr78 could not be detected in any of the four C4 grass species examined. The results suggest that despite conservation of Thr78 in Rubisco activase from a wide range of species, a regulatory role for phosphorylation at this site is more limited. This provides a case study for how variation in post-translational regulation can amplify functional divergence across the phylogeny of plants beyond what is explained by sequence variation in a metabolically important protein.
keywords: photosynthesis; sorghum
published: 2025-08-07
 
Bioenergy and bioproduct markets are expanding to meet demand for climate friendly goods and services. Perennial biomass crops are particularly well suited for this goal because of their high yields, low input requirements, and potential to increase soil carbon (C). However, it is unclear how much C is allocated into belowground pools by perennial bioenergy crops and whether the belowground benefits vary with nitrogen (N) fertilizer inputs. Using in situ 13C pulse-chase labeling, we tested whether the sterile perennial grass Miscanthus × giganteus (miscanthus) or annual maize transfers more photosynthetic C to belowground pools. The experiment took place at two sites in Central and Northwest (NW) Iowa with different management histories and two nitrogen (N) fertilizer rates (0 and 224 kg N ha-1 yr-1) to determine if the fate of plant-derived soil C depends on soil fertility and crop type (perennial or annual). Maize allocated a greater percentage of total new 13C to roots than miscanthus, but miscanthus had greater new 13C in total and belowground plant biomass. We found strong interactions between site and most soil measurements – including new 13C in mineral and particulate soil organic matter (SOM) pools –which appear to be driven by differences in historical fertilizer management. The NW Iowa site, with a history of manure inputs, had greater plant-available nutrients (phosphorus, potassium, and ammonium) in soils, and resulted in less 13C from miscanthus in SOM pools compared to maize (approximately 64% less in POM and 70% less in MAOM). In more nutrient-limited soils (Central site), miscanthus transferred 4.5 times more 13C than maize to the more stable mineral-associated SOM pool. Our results suggest that past management, including historical manure inputs that affect a site’s soil fertility, can influence the net C benefits of bioenergy crops. Dataset includes tables/figures from article and supplementary info. Dryad contains raw data.
keywords: land management; carbon; miscanthus; maize
published: 2025-08-05
 
This dataset includes all data used in the manuscript by Carrica and Gulley titled, "Ontogeny of catechol-o-methyltransferase expression in the rat prefrontal cortex: effects of methamphetamine exposure"
keywords: dopamine clearance; adolescence; drug exposure; prefrontal cortex
published: 2025-07-25
 
This dataset contains the pregnancy status of wild, white-tailed deer (Odocoileus virginianus) from northern Illinois culled as part of the Illinois Department of Natural Resources' chronic wasting disease (CWD) surveillance program. Fiscal years 2005 through 2024 are included. A fiscal year is the time between July 1st of one calendar year and June 30th of the next. Variables in this dataset include the pregnancy status, CWD infection status, age, weight, and day of mortality for each female deer, as well as the deer land cover utility (LCU) score for the TRS, township, or county from which the deer was culled. The deer population density of the county is also included. Data have been anonymized for landowner privacy reasons so that the location and year are not identifiable, but will give the same modeling results by maintaining how the data are grouped. The R code used to conduct the regression modeling is also included.
keywords: cervid; Cervidae, chronic wasting disease; CWD; reproduction; white-tailed deer; Odocoileus virginianus; pregnancy; regression
published: 2025-07-21
 
This dataset includes image stacks, annotated counts, and ground-truth masks from two high-resolution sediment cores extracted from Laguna Pallcacocha, in El Cajas National Park, Ecuadorian Andes by Moy et al. (2002) and Hagemans et al. (2021). The first core (PAL 1999, from Moy et al. (2002)) extends through the Holocene (11,600 cal. yr. BP - present). There are a total of 900 annotated image stacks and masks in the PAL 1999 domain. The second core (PAL IV, from Hagemans et al. (2021)) captures the 20th century. There are 2986 annotated image stacks and masks in the PAL IV domain. Different microscopes and annotations tools were used to image and annotate each core and there are corresponding differences in naming conventions and file formats. Thus, we organized our data separately for the PAL 1999 and the PAL IV domains. The three letter codes used to label our pollen annotations are in the file: “Pollen_Identification_Codes.xlsx”. Both domain directories contain: • Image stacks organized by subdirectory • Annotations within each image stack directory, containing specimen identifications using a three letter code and coordinates defining bounding boxes or circles • Ground-truth distance-transform masks for each image stack The zip file "bestValModel_encoder.paramOnly.zip" is the trained pollen detection model produced from the images and annotations in this dataset. Please cite this dataset as: Feng, Jennifer T.; van den Berg, Thya; Donders, Timme H.; Kong, Shu; Puthanveetil Satheesan, Sandeep; Punyasena, Surangi W. (2025): Slide scans, annotated pollen counts, and trained pollen detection models for fossil pollen samples from Laguna Pallcacocha, El Cajas National Park, Ecuador . University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-4207757_V1 Please also include citations of the original publications from which these data are taken: Feng, Jennifer T., Sandeep Puthanveetil Satheesan, Shu Kong, Timme H. Donders, and Surangi W. Punyasena. “Addressing the ‘Open World’: Detecting and Segmenting Pollen on Palynological Slides with Deep Learning.” bioRxiv, January 1, 2025. https://doi.org/10.1101/2025.01.05.631390. Feng, Jennifer T., Sandeep Puthanveetil Satheesan, Shu Kong, Timme H. Donders, and Surangi W. Punyasena. “Addressing the ‘Open World’: Detecting and Segmenting Pollen on Palynological Slides with Deep Learning.” Paleobiology, 2025 [in press]. Feng, J. T. (2023). Open-world deep learning applied to pollen detection (MS thesis, University of Illinois at Urbana-Champaign). https://hdl.handle.net/2142/120168
keywords: continual learning; deep learning; domain gaps; open-world; palynology; pollen grain detection; taxonomic bias
published: 2024-11-15
 
This page contains the data for the manuscript "Vacuolating cytotoxin A interactions with the host cell surface". This manuscript is currently in prep.
keywords: Steven R Blanke; Vacuolating cytotoxin A; VacA; Helicobacter pylori; protein binding; sphingomyelin; cell surface
published: 2025-06-24
 
This supporting information file contains codes related to pending publication Ge et al. Proc. Nat. Acad. Sci. USA, (revisions in review). The contents include a Mathematica code that solves the Laplace transformed equations and generates figures from the paper. A python code is included for generation of Figure 5 in the main text.
keywords: Population balance model; Covalent organic framework; Nucleation; Growth;
published: 2025-05-21
 
This dataset includes a total of 16 images of 2 extant species of Podocarpus (Podocarpaceae) and 23 images of fossil specimens of the morphogenus Podocarpidites. The images were taken using a Zeiss LSM 880 microscope with Airyscan confocal superresolution at 630x magnification (63x/NA 1.4 oil DIC). The images are in the original CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or open microscopy software, such as ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/software/zeiss-zen/czi-image-file-format.html] For Podocarpus (modern specimens): Each folder is labelled by genus and contain all images corresponding to that genus. Detailed information about the folders, files, and specimens can be found in the Excel file "METADATA_Podocarpus_extant.csv". This file includes metadata on: species, slide ID, collection, folder name file name and notes. Images are of pollen grains from slides in the Florida Museum of Natural History collections. For Podocarpidites (fossil specimens): Each image is named after the sample from which it was derived. Detailed information about the specimens can be found in the Excel file "METADATA_ Podocarpidites_fossil.csv". This file includes metadata: the fossil type (Taxon), the slide and sample name (Slide Info), the location of the sample locality (Country, Latitude, Longitude), the age of the sample (Min age, Max age), the location of the specimen on the sample slide (England Finder coordinates), and the image file name. Images are of fossil pollen from slides in Smithsonian Tropical Research Institute collections. Please cite this dataset and listed publications when using these images.
keywords: optical superresolution microscopy; Zeiss Airyscan; CZI images; conifer; saccate pollen; Podocarpus; Podocarpidites
published: 2025-04-02
 
This dataset contains Raman spectra, each acquired from an individual, living, cell entrapped within a soft or stiff gelatin methacrylate hydrogel or from a cell-free region of the hydrogel sample. Spectra were acquired from the following cell types: Madin-Darby Canine Kidney cell (MDCK); Chinese hamster ovary cell (CHO-K1); transfected CHO-K1 cell that expressed the SNAP-tag and HaloTag reporter proteins fused to an organelle-specific protein (CHO-T); human monocyte-like cell (THP-1); inactive macrophage-like (M0-like); active anti-inflammatory macrophage-like (M2-like), pro/anti-inflammatory macrophage-like (M1/M2-like). These spectra are useful for identifying whether the hydrogel matrix obscures the Raman spectral signatures that are characteristic of each of these cell types.
keywords: Raman spectroscopy; 3D cell culture; single-cell spectrum; hydrogel scaffold; collagen scaffold; macrophage spectra; macrophage differentiation; THP-1 line; noninvasive phenotype identification; vibrational spectroscopy
published: 2025-04-30
 
This dataset represents the results of targeted eDNA assays via quantitative PCR for two imperiled freshwater species.
keywords: Environmental DNA, Freshwater Mussel, Salamander, Conventional Surveys, Endangered Species, Habitat Use, Artificial Structures
published: 2025-06-16
 
Biometric, and ground-based and eddy covariance flux data to investigate the impact of sugarcane expansion across subtropical Florida on the carbon (C) budget over a three-year rotation. Dataset includes: three-year record of daily fluxes, NPP and SOC input measurements, and estimates of carbon use efficiency and net ecosystem carbon balance in sugarcane and improved and semi-native pastures following pasture conversion to sugarcane.
keywords: land use change; sugarcane expansion; bioenergy; carbon budget; CUE; NECB
published: 2024-08-29
 
Over the past 15 years, soybean seed yield response to season-long elevated O3 concentrations [O3] and to year-to-year weather conditions was studied using free-air O3 concentration enrichment (O3-FACE) in the field at the SoyFACE facility in Central Illinois. Elevated [O3] significantly reduced seed yield across cultivars and years. However, our results quantitatively demonstrate that weather conditions, including soil water availability and air temperature, did not alter yield sensitivity to elevated [O3] in soybean.
keywords: drought, elevated O3, heat, O3-FACE, soybean, yield
published: 2025-05-01
 
BEPAM, Biofuel and Environmental Policy Analysis Model, models the agricultural sector and determines economically optimal land-use and feedstock mix at the US scale by maximizing the sum of agricultural sector consumers’ and producers’ surplus subject to various resource balances, land availability, and technological constraints under a range of biomass prices, from zero to $140 Mg-1 over the 2016-2030 period. Here BEPAM is used to model SAF production using energy crops and crop residues. BEPAM uses the GAMS format and uses yield and GHG balance projections from the biogeochemical model, DayCent.
keywords: BEPAM; Energy crops; direct and indirect land use change; soil carbon sequestration; fossil fuel displacement; economic incentives
published: 2025-04-28
 
Dataset of the standing dead trees at Trelease Woods in 2022. Dataset contains volume, biomass, decay class, and GPS coordinates for each standing dead tree.
keywords: old-growth; temperate forest; standing deadwood; census data
published: 2025-04-27
 
Downed woody debris census data for Trelease Woods collected in the summer of 2022. Dataset contains volume, biomass, decay class, and GPS coordinates for each downed woody debris piece.
keywords: Old-growth; temperate forest; downed woody debris; coarse woody debris; census data
published: 2025-04-27
 
Soil data for ten soil cores collected at Trelease Woods in 2022. Soil samples were analyzed with an elemental analyzer via combustion to obtain total carbon (C) and nitrogen. A subset of these samples were analyzed using the Walkley-Black method to obtain organic C. A calibration curve relating organic C and total C was created using these data.
keywords: old-growth; temperate forest; soil carbon; soil nitrogen; nutrient cycling
published: 2025-01-27
 
This is the core data for RELIX, a dataset of vascular plant species presence for 353 prairie remnants in the Midwestern United States and associated dataset of prairie remnant metadata. The primary data file contains a list of the vascular plant species observed in the prairie remnants, as well as a metadata table with more information about the prairie remnant in question and the species list itself. The data was compiled from a variety of written sources, private and published, chronicling observations made between the mid-twentieth century and 2021. It also contains a supplementary data table of vascular plant species observed in at least 8 of the prairie remnants in RELIX, as well as a list of acknowledgements for the associated manuscript.
keywords: prairie peninsula; prairie relict; prairie soil; species inventories; tallgrass prairie
published: 2024-08-15
 
This study acquired publicly available Shell annual reports. Reports were selected for the years since the UN investigation in 2011, resulting in documents from 2012 to 2023.
keywords: environmental justice; ethics of care; indigenous communities; Niger River Delta; oil spills
published: 2025-01-29
 
These data records weekly aphid and monarch butterfly (Danaus plexippus) neonate counts on individual milkweed plants in multiple raised garden beds in Chicago during the summers of 2023 and 2024. Relationships between aphid infestation and monarch neonates can be investigated along with weekly trends of monarch oviposition and aphid abundances. All gardens included in this study were on the University of Illinois Chicago campus, and within 100 meters of proximity. Data are provided on three milkweed species in 2023, and one milkweed species in 2024.
keywords: Aphis; Myzocallis; Danaus plexippus; urban gardens; Asclepias syriaca; milkweeds
Research Data Service Illinois Data Bank
Access and Use Policies Web Privacy Notice Contact Us