Dataset Search

Displaying 201 - 225 of 625 in total

Filters

Subject Area

Life Sciences (625)

Social Sciences (0)

Physical Sciences (0)

Technology and Engineering (0)

Uncategorized

Arts and Humanities (0)

Funder

U.S. Department of Energy (DOE) (202)

Other (171)

U.S. National Science Foundation (NSF) (104)

U.S. Department of Agriculture (USDA) (61)

U.S. National Institutes of Health (NIH) (47)

Illinois Department of Natural Resources (IDNR) (26)

U.S. Geological Survey (USGS) (6)

Illinois Department of Transportation (IDOT) (3)

U.S. National Aeronautics and Space Administration (NASA) (2)

U.S. Army (2)

Publication Year

2025 (218)

2024 (67)

2021 (66)

2020 (60)

2022 (54)

2019 (42)

2023 (40)

2018 (24)

2026 (23)

2017 (19)

2016 (12)

2009 (0)

2011 (0)

2012 (0)

2014 (0)

2015 (0)

License

CC BY (328)

CC0 (281)

custom (16)

Illinois Data Bank Dataset Search Results

Results

published: 2021-07-10

Optical topometry and machine learning to rapidly phenotype stomatal patterning traits for maize QTL mapping

Xie, Jiayang; Fernandes, Samuel; Mayfield-Jones, Dustin; Erice, Gorka; Choi, Min; Lipka, Alexander; Leakey, Andrew (2021)

This dataset containes the images of B73xMS71 RIL population used in QTL linkage mapping for maize epidermal traits in year 2016 and 2017. 2016RIL_all_mns.rar and 2017RIL_all_mns.rar: contain raw images produced by Nanofocus lsurf Explorer Optical Topometer (Oberhausen, Germany) at 20X magnification with 0.6 numerical aperture. Files were processed in Nanofocus μsurf analysis extended software (Oberhausen,Germany). 2016RIL_all_TIF.rar and 2017RIL_all_TIF.rar: contain images processed from the Topology layer in each nms file to strengthen the edges of cell outlines, and used in downstream cell detection. 2016RIL_all_detection_result.rar and 2017RIL_all_detection_result.rar: contain images with epidermal cells predicted using the Mask R-CNN model. training data.rar: contain images used for Mask R-CNN model training and validation.

keywords: stomata; Mask R-CNN; cell segmentation; water use efficiency

published: 2025-05-21

Data for A multiplex of connectome trajectories enables several connectivity patterns in parallel

Mostame, Parham; Wirsich, Jonathan; Alderson, Thomas H.; Ridley, Ben; Giraud, Anne-Lise; Carmichael, David W.; Vulliemoz, Serge; Guye, Maxime; Lemieux, Louis; Sadaghiani, Sepideh (2025)

___________________________________SUMMARY This dataset contains derivative data from concurrent fMRI and scalp EEG recordings used in: Mostame Parham, Wirsich Jonathan, Alderson Thomas H, Ridley Ben, Giraud Anne-Lise, Carmichael David W, Vulliemoz Serge, Guye Maxime, Lemieux Louis, Sadaghiani Sepideh (2024) A multiplex of connectome trajectories enables several connectivity patterns in parallel eLife 13:RP98777. doi: https://doi.org/10.7554/eLife.98777.3 ___________________________________RAW DATA The data has been originally published and described as part of other studies (Morillon et al., 2010; Sadaghiani et al., 2012). Briefly, 10 minutes of eyes-closed resting state were analyzed from 26 healthy subjects (average age = 24.39 years; range: 18-31 years; 8 females) with no history of psychiatric or neurological disorders. Informed consent was given by each participant and the study was approved by the local Research Ethics Committee (CPP Ile de France III). FMRI was acquired using a 3T Siemens Tim Trio scanner with a GE-EPI pulse sequence (TR = 2 s; TE = 50 ms; 40 slices; 300 volumes; field of view: 192×192; voxel size: 3×3×3 mm3). Structural T1-weighted scan were acquired using the MPRAGE pulse sequence (176 slices; field of view: 256×256; voxel size: 1×1×1 mm3). 62-channel scalp EEG (Easycap, with an additional EOG and an ECG channel) was recorded using an MR-compatible amplifier (BrainAmp MR, Brain Products) at 5Hz sampling rate. ___________________________________PREPROCESSING fMRI and EEG data were preprocessed with standard preprocessing steps as explained in detail elsewhere (Wirsich et al., 2020). In brief, fMRI underwent standard slice-time correction, spatial realignment (SPM12, http://www.fil.ion.ucl.ac.uk/spm/software/spm12). Structural T1-weighted images were processed using Freesurfer (recon-all, v6.0.0, https://surfer.nmr.mgh.harvard.edu/) in order to perform non-uniformity and intensity correction, skull stripping and gray/white matter segmentation. The cortex was parcellated into 68 regions of the Desikan-Kiliany atlas (Desikan et al., 2006). This atlas was chosen because —as an anatomical parcellation— avoids biases towards one or the other functional data modality. The T1 images of each subject and the Desikan-Killiany were co-registered to the fMRI images (FSL-FLIRT 6.0.2, https://fsl.fmrib.ox.ac.uk/fsl/fslwiki). We extracted signals of no interest such as the average signals of cerebrospinal fluid (CSF) and white matter from manually defined regions of interest (ROI, 5 mm sphere, Marsbar Toolbox 0.44, http://marsbar.sourceforge.net) and regressed out of the BOLD timeseries along with 6 rotation, translation motion parameters and global gray matter signal (Wirsich et al., 2017a). Then we bandpass-filtered the timeseries at 0.009–0.08 Hz. Average timeseries of each region was then used to calculate connectivity. EEG underwent gradient and cardio-ballistic artifact removal using Brain Vision Analyzer software (Allen et al., 1998, 2000) and was down-sampled to 250 Hz. EEG was projected into source space using the Tikhonov-regularized minimum norm in Brainstorm software (Baillet et al., 2001; Tadel et al., 2011). Source activity was then averaged to the 68 regions of the Desikan-Killiany atlas. Band-limited EEG signals in each canonical frequency band and every atlas region were then used to calculate frequency-specific connectome dynamics. Note that the MEG-ROI-nets toolbox in the OHBA Software Library (OSL; https://ohba-analysis.github.io/osl-docs/) was used to minimize source leakage in the band-limited source-localized EEG data (Colclough et al., 2015). ___________________________________FOLDER STRUCTURE The dataset includes five separate folders as described below: 1) EEGfMRI_dFC folder: connectome dynamics of scalp data This folder contains 26 single MATLAB (.mat) files for each subject. Inside each `.mat` is a structure with fields `A`, `B`, and `C`, corresponding to fMRI, amplitude-coupling, and phase-coupling connectome dynamics, respectively. The fMRI data are 3-dimensional (ROI × ROI × timepoints). The EEG data are stored in a 1×5 cell array (Delta, Theta, Alpha, Beta, Gamma), each cell containing a 3-D ROI × ROI × timepoints matrix. 2) EEGfMRI_dFC_SourceOrtho foldeR: connectome dynamics of source-orthogonalized scalp data Same format as above, except that EEG connectome dynamics are derived from source-orthogonalized signals. The MEG-ROI-nets toolbox in the OHBA Software Library (OSL; https://ohba-analysis.github.io/osl-docs/) was used to minimize source leakage in the band-limited, source-localized EEG data (Colclough et al., 2015). 3-5) Cross-modal Recurrence Plot (CRP) data Each subject has an Excel file with five sheets (Delta through Gamma), corresponding to the five frequency bands. Each sheet contains a 2-D CRP matrix (rows = fMRI timepoints, columns = band-limited EEG timepoints). - Scalp EEG–fMRI CRPs (CRP_EEGfMRI and CRP_EEGfMRI_SourceOrtho folder): two versions (with and without source-orthogonalization), each has 52 Excel files, including amplitude- and phase-coupling CRPs. - Intracranial EEG–fMRI CRPs (CRP_iEEGfMRI folder): one version, 27 Excel files, containing three cases: amplitude coupling, HRF-convolved amplitude coupling, and phase coupling.

keywords: Connectome; fMRI-EEG; Intracranial; Multiplex

published: 2024-07-01

Data and code for estimating population sizes, annual survival, and inferring absence of the frog Mantella cowanii

Edmonds, Devin; Andriantsimanarilafy, Raphali; Crottini, Angelica; Dreslik, Michael; Newton-Youens, Jade; Andoniana, Ramahefason; Christian, Randrianantoandro; Andreone, Franco (2024)

This data and code accompany the manuscript "Small population size and possible extirpation of the threatened Malagasy poison frog Mantella cowanii". The data were collected using photograph capture-recapture at three sites in the central highlands of Madagascar. In Part 1, the script implements robust design capture-mark-recapture models in program MARK through the RMark interface to estimate population sizes and annual survival probabilities. In Part 2, it estimates the number of surveys needed to infer absence at sites where we did not detect the frog.

keywords: abundance; amphibian; capture-recapture

published: 2023-07-26

Data for Improved Net Carbon Budgets in the US Midwest through Direct Measured Impacts of Enhanced Weathering

Kantola, Ilsa B; Blanc-Betes, Elena; Masters, Michael; Chang, Elliot; Marklein, Alison; Moore, Caitlin; von Haden, Adam; Bernacchi, Carl; Wolf, Adam; Epihov, Dimitar; Beerling, David; DeLucia, Evan (2023)

This data set contains data used for “Improved Net Carbon Budgets in the US Midwest through Direct Measured Impacts of Enhanced Weathering.” Data include biomass, soil bulk densities, soil respiration measurements, soil lanthanide element analysis, plant tissue analysis for major cations, and eddy covariance fluxes.

keywords: agriculture; bioenergy crop; carbon budget; eddy covariance; net ecosystem carbon balance; net primary production; soil respiration; enhanced weathering; carbon dioxide removal; Illinois

published: 2025-04-25

Data for Structural and Functional Neuroimaging Investigation in patients with ZIKA virus (ZIKV) infection with neurological manifestations

Sadaghiani, Sepideh; Jun, Suhnyoung; Bido Medina, Richard (2025)

Zika virus (ZIKV) infection has been linked to neurological disorders such as microcephaly in children. Cases of Guillain-Barré Syndrome (GBS), a peripheral nervous system (PNS) disorder, have been reported in adults with ZIKV infection. These ZIKV-related GBS cases often exhibit atypical clinical features compared to classic GBS, including central nervous system (CNS) involvement. This dataset comprises two patient groups and a healthy control group. The first patient group includes adults with confirmed ZIKV infection, presenting both PNS-related GBS symptoms and CNS manifestations. The second group consists of adults with GBS but without ZIKV infection. The final group includes healthy, unaffected individuals.

keywords: Zika virus; Guillain-Barré Syndrome; adults; neuroimaging; central nervous system;

published: 2025-09-24

Data from High Solids Loading Biorefinery for the Production of Cellulosic Sugars from Bioenergy Sorghum

Cheng, Ming-Hsun; Kadhum, Haider Jawad; Murthy, Ganti S.; Dien, Bruce; Singh, Vijay (2025)

A novel process applying high solids loading in chemical-free pretreatment and enzymatic hydrolysis was developed to produce sugars from bioenergy sorghum. Hydrothermal pretreatment with 50% solids loading was performed in a pilot scale continuous reactor followed by disc refining. Sugars were extracted from the enzymatic hydrolysis at 10% to 50% solids content using fed-batch operations. Three surfactants (Tween 80, PEG 4000, and PEG 6000) were evaluated to increase sugar yields. Hydrolysis using 2% PEG 4000 had the highest sugar yields. Glucose concentrations of 105, 130, and 147 g/L were obtained from the reaction at 30%, 40%, and 50% solids content, respectively. The maximum sugar concentration of the hydrolysate, including glucose and xylose, obtained was 232 g/L. Additionally, the glucose recovery (73.14%) was increased compared to that of the batch reaction (52.74%) by using two-stage enzymatic hydrolysis combined with fed-batch operation at 50% w/v solids content.

keywords: Conversion;Feedstock Bioprocessing

published: 2019-06-22

Manipulating social information to promote frugivory by birds on a Hawaiian Island

MacDonald, Sean; Ward, Michael; Sperry, Jinelle (2019)

keywords: conspecific attraction; fruit-eating bird; Hawaiian flora; playback experiment; seed dispersal; social information; Zosterops japonicas

published: 2024-01-19

Soybean seed quality response to eCO2 data files

Digrado, Anthony; Montes, Christopher; Baxter, Ivan; Ainsworth, Elizabeth (2024)

This data set is related to a SoyFACE experiment conducted in 2004, 2006, 2007, and 2008 with the soybean cultivars Loda and HS93-4118. The experiment looked at how seed elements were affected by elevated CO2 and yield. In this V2, 2 new files were added per journal requirement. Total there are 5 data files in text format within the digrado_et_al_gcb_data_V2 and 1 readme file. The name of files are listed below. Details about headers are explained in the readme.txt file. 1. ionomic_data.txt file contains the ionomic data (mg/kg) for the two cultivars. The file contains all six technical replicates for each plot. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 2. yield_data.txt file contains the yield data for the two cultivars (seed yield in kg/ha, seed yield in bu/a, Protein (%), Oil (%)). The file contains yield data for every plot. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 3. mineral_pro_oil_yield.txt file contains the yield per hectare for each mineral (g/ha) along with the yield per hectare for protein and oil (t/ha). This was obtained by multiplying the seed content of each element (minerals, protein, and oil) by the total seed yield. The file contains yield data for every plots. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 4. economic_assessment.txt file contains data used to assess the financial impact of altered seed oil content on soybean oil production. 5. meteorological_data.txt file contains the meteorological data recorded by a weather station located ~ 3km from the experimental site (Willard Airport Champaign). Data covering the period between May 28 and September 24 were used for 2004; between May 25 and September 24 were used in 2006; between May 23 and September 17 in 2007; and between June 16 and October 24 in 2008.

keywords: protein; oil; mineral; SoyFACE; nutrient; Glycine max; soybean; yield; CO2; agriculture; climate change

published: 2025-05-21

Pollen of Podocarpus (Podocarpaceae) II: Airyscan confocal superresolution images

Punyasena, Surangi W.; Adaime, Marc-Elie; Jaramillo, Carlos (2025)

This dataset includes a total of 16 images of 2 extant species of Podocarpus (Podocarpaceae) and 23 images of fossil specimens of the morphogenus Podocarpidites. The images were taken using a Zeiss LSM 880 microscope with Airyscan confocal superresolution at 630x magnification (63x/NA 1.4 oil DIC). The images are in the original CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or open microscopy software, such as ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/software/zeiss-zen/czi-image-file-format.html] For Podocarpus (modern specimens): Each folder is labelled by genus and contain all images corresponding to that genus. Detailed information about the folders, files, and specimens can be found in the Excel file "METADATA_Podocarpus_extant.csv". This file includes metadata on: species, slide ID, collection, folder name file name and notes. Images are of pollen grains from slides in the Florida Museum of Natural History collections. For Podocarpidites (fossil specimens): Each image is named after the sample from which it was derived. Detailed information about the specimens can be found in the Excel file "METADATA_ Podocarpidites_fossil.csv". This file includes metadata: the fossil type (Taxon), the slide and sample name (Slide Info), the location of the sample locality (Country, Latitude, Longitude), the age of the sample (Min age, Max age), the location of the specimen on the sample slide (England Finder coordinates), and the image file name. Images are of fossil pollen from slides in Smithsonian Tropical Research Institute collections. Please cite this dataset and listed publications when using these images.

keywords: optical superresolution microscopy; Zeiss Airyscan; CZI images; conifer; saccate pollen; Podocarpus; Podocarpidites

published: 2025-09-30

Data from Economic Perspective of Ethanol and Biodiesel Coproduction from Industrial Hemp

Viswanathan, Mothi Bharath; Cheng, Ming-Hsun; Clemente, Tom; Dweikat, Ismail; Singh, Vijay (2025)

In this study, the economics of producing biofuels from an industrial hemp (Cannabis sativa) genotype – 19m96136 was investigated. A lignocellulosic biofuel plant, hourly consuming 85 metric tons of hemp biomass was modeled in SuperPro Designer®. The integrated bioenergy plant produced hemp biodiesel and bioethanol from lipids and carbohydrates, respectively. The structural composition of the industrial hemp plant was analyzed in a previous study. The data obtained was used to simulate feedstock composition in SuperPro Designer®. The simulation results indicated that Hemp containing 2% lipids can yield up to 3.95 million gallons of biodiesel annually. On improving biomass lipid content to 5 and 10%, biodiesel production increased to 9.88 and 19.91 million gallons, respectively. The breakeven unit production cost of hemp biodiesel with 2, 5, and 10% lipid containing hemp was $18.49, $7.87, and $4.13/gallon, respectively. The biodiesel unit production cost when utilizing 10% lipid-containing hemp was comparable to soybean biodiesel at $4.13/gallon. Furthermore, sensitivity analysis revealed the possibility of a 7.80% reduction in unit production cost upon a 10% reduction in hemp feedstock cost. Furthermore, industrial hemp was capable of producing between 307.80 and 325.82 gallons of total biofuels per hectare of agricultural land than soybean.

keywords: Conversion;Feedstock Production;Economics;Modeling

published: 2024-03-25

Data for "Differing physiological performance of coexisting cool- and warmwater fish species under heatwaves in the Midwestern United States"

Suski, Cory; Dai, Qihong (2024)

This is the dataset for the manuscript titled, "Differing physiological performance of coexisting cool- and warmwater fish species under heatwaves in the Midwestern United States"

keywords: climate change; heat wave; metabolic rate; swimming; predator-prey interaction; thermal tolerance; Sander vitreus; walleye; largemouth bass; species distributions

published: 2024-06-24

Data for “Autophagy suppression in DNA damaged cells occurs through a newly identified p53-proteasome-LC3 axis”

Lieu, D'Feau J.; Crowder, Molly K.; Kryza, Jordan R.; Tamilselvam, Batcha; Kaminski, Paul J.; Kim, Ik-Jung; Li, Yingxing; Jeong, Eunji; Enkhbaatar, Michidmaa; Chen, Henry; Son, Sophia B.; Mok, Hanlin; Bradley, Kenneth A.; Phillips, Heidi; Blanke, Steven R. (2024)

This page contains the data for the manuscript "Autophagy suppression in DNA damaged cells occurs through a newly identified p53-proteasome-LC3 axis" currently available in preprint on bioRxiv

keywords: Steven R Blanke; Cytolethal Distending Toxin; CDT; Autophagy; Genotoxicity; p53; DNA damage; DNA damage response; LC3; proteasome; proteostasis; DDR; autophagosome

published: 2024-07-09

Data matrices for "Missing Data and Model Selection in Phylogenomics: A Re-Evaluation of Cicadomorpha (Hemiptera: Auchenorrhyncha) Superfamily Level Relationships Under Site-Heterogeneous Models"

Yan, Bin; Dietrich, Christopher; Yu, Xiaofei; Jiang, Yan; Dai, Renhuai; Du, Shiyu; Cai, Chenyang; Yang, Maofa; Zhang, Feng (2024)

The included files are the alignments of DNA or amino acid sequences used for phylogenetic analyses of Auchenorrhyncha (Insecta: Hemiptera) in the manuscript by Bin et al. submitted to the journal “Systematic Entomology.” The files are plain text in either FASTA (.fa or .fas suffix) or PHYLIP (.phy suffix) format. Matrix0 is the set of all loci after multiple sequence alignment and trimming (hereafter called). Matrix1 consists of loci having 75% average bootstrap support and 80% taxon completeness (hereafter called Matrix1). Matrix2 consists of loci having 75% average bootstrap support and 95% completeness. Matrix2_nt12 is the same as Matrix2 but with third codon positions excluded. More details on how the datasets were compiled is provided in the Methods section of the manuscript file, also included as a PDF. Supplemental figures for the submitted manuscript are also provided as a PDF for additional information.

keywords: Insecta; Phylogeny; DNA sequence; Evolution

published: 2025-06-23

Data for ESMDynamic: Fast and Accurate Prediction of Protein Dynamic Contact Maps from Single Sequences

Kleiman, Diego; Feng, Jiangyan; Xue, Zhengyuan; Shukla, Diwakar (2025)

This repository contains data and model weights associated with the publication "ESMDynamic: Fast and Accurate Prediction of Protein Dynamic Contact Maps from Single Sequences". It includes the datasets used for training and evaluating a dynamic contact prediction model, ESMDynamic, as well as a script for conversion and usage.

keywords: Computational biology; Structural biology; Molecular dynamics; Machine learning; Protein modeling; Bioinformatics; Biophysics; Artificial intelligence

published: 2016-12-13

Sequencing data for motility selection experiments

Fraebel, David T.; Kuehn, Seppe (2016)

BAM files for founding strain (MG1655-motile) as well as evolved strains from replicate motility selection experiments in low-viscosity agar plates containing either rich medium (LB) or minimal medium (M63+0.18mM galactose)

published: 2020-12-29

Fern functional traits

Viana, Jéssica; Turner, Benjamin; Dalling, James (2020)

Three datasets: species_abundance_data, species_traits, and environmental_data. The three datasets were collected in the Fortuna Forest Reserve (8°45′ N, 82°15′ W) and Palo Seco Protected Forest (8°45′ N, 82°13′ W) located in western Panama. The two reserves support humid to super-humid rainforests, according to Holdridge (1947). The species_abundance_data and species_traits datasets were collected across 15 subplots of 25 m2 in 12 one-hectare permanent plots distributed across the two reserves. The subplots were spaced 20 m apart along three 5 m wide transects, each 30 m apart. Please read Prada et al. (2017) for details on the environmental characteristics of the study area. Prada CM, Morris A, Andersen KM, et al (2017) Soils and rainfall drive landscape-scale changes in the diversity and functional composition of tree communities in a premontane tropical forest. J Veg Sci 28:859–870. https://doi.org/10.1111/jvs.12540

keywords: functional traits; plants; ferns; environmental data; Fortuna; species data; community ecology

published: 2021-09-03

Dataset for evaluating the Hind/He statistic in polyRAD

Clark, Lindsay V.; Mays, Wittney; Lipka, Alexander E.; Sacks, Erik J. (2021)

All of the files in this dataset pertain to the evaluation of a novel statistic, Hind/He, for distinguishing Mendelian loci from paralogs. They are derived from a RAD-seq genotyping dataset of diploid and tetraploid Miscanthus sacchariflorus.

published: 2021-08-20

Maize and Sorghum Establishment and Yield following Pre-Emergence Waterlogging

von Haden, Adam C.; DeLucia, Evan H.; Yang, Wendy; Burnham, Mark (2021)

In 2020, early-season extreme precipitation events occurred following the planting of Sorghum bicolor (L.) Moench and Zea mays L. in central Illinois that caused ponding. Following the first rainfall event 50m transects were established to assess the waterlogging effects on seedling emergence and crop yields. Soil moisture, emergence, stem and tiller count, LAI, and yield were measured at various points in the season along these transects.

keywords: Sorghum; Maize; Emergence; Yield; LAI

published: 2022-02-11

FASTA file of the final sequence alignment used in the haplotype analyses of Culex pipiens complex populations collected in south-eastern Illinois (2016-2017)

Trivellone, Valeria; Cao, Yanghui; Blackshear, Millon; Kim, Chang-Hyun; Stone, Christopher (2022)

The Culex_Trivellone_etal.fas fasta file contains the original final sequence alignment used in the haplotype analyses of Trivellone et al. (Frontiers in Public Health, under review). The 492 sequences (from specimens of Culex pipiens complex collected in different habitat types using a BG-sentinel traps) were aligned using PASTA v1.8.5 under default settings. The final dataset contains 686 positions of the cytochrome c oxidase subunit I (COI) mitochondrial gene. The data analyses are further described in the cited original paper.

keywords: Culex; Culicidae; COI; mosquito surveillance, species assemblages

published: 2025-01-31

Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae

Punyasena, Surangi W.; Romero, Ingrid; Urban, Michael A. (2025)

Title: Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae Authors: Surangi W. Punyasena, Ingrid Romero, Michael A. Urban Subject: Biological sciences Keywords: Malvaceae; superresolution microscopy; Zeiss; Bombacacidites; Neotropics; CZI Funder: NSF-DBI Advances in Bioinformatics (NSF-DBI-1262561) Corresponding Creator: Surangi W. Punyasena This dataset includes a total of 430 images of extant specimens of the Malvaceae, with a focus on species that are or have been included within the subfamily Bombacoideae. There are 27 genera included within 26 folders. Each folder is named by genus and contains all the images that correspond to that genus. Note that the genus _Matisia_ is included with _Quararibea_ as detailed in the metadata READ ME file. The specimens imaged are from the palynological collections of the Swedish Museum of Natural History and Smithsonian Tropical Research Institute, and herbarium specimens from the Smithsonian Herbarium National Museum. The optical superresolution microscopy images were taken using a Zeiss LSM 880 with Airyscan at 630X magnification (63x/NA 1.4 oil DIC). The images are in the original CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or in ImageJ/FIJI. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/en/products/software/zeiss-zen/czi-image-file-format.html] Image metadata and file organization are described in the CSV file "METADATA_Malvaceae_Bombacoideae_modern-species.csv". The column headings are: Folder The folder in which the image file is found Subfamily The current subfamily determination based on the literature. Note that _Pentaplaris_ and _Septotheca_ have not been assigned a subfamily. Genus Genus name Species Species name Accepted name Accepted species name, updated from the literature Slide name Species name as denoted on the herbarium slide Collection Source of the herbarium slide: Sweden National Museum of Natural History or the Smithsonian Tropical Research Institute File name File name using the species name denoted on the herbarium slide Slide ID/Herbarium ID Specimen collection number Please cite this dataset as: Punyasena, Surangi W.; Romero, Ingrid; Urban, Michael A. (2025): Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-2968712_V1

keywords: Malvaceae; superresolution microscopy; Zeiss; Bombacoideae; Neotropics; CZI

published: 2025-10-30

Data for Construction of a Compact Array of Microplasma Jet Devices and Its Application for Random Mutagenesis of Rhodosporidium toruloides

Koh, Hyun Gi; Kim, Jinhong; Rao, Christopher V.; Park, Sung-Jin; Jin, Yong-Su (2025)

A small and efficient DNA mutation-inducing machine was constructed with an array of microplasma jet devices (7 × 1) that can be operated at atmospheric pressure for microbial mutagenesis. Using this machine, we report disruption of a plasmid DNA and generation of mutants of an oleaginous yeast Rhodosporidium toruloides. Specifically, a compact-sized microplasma channel (25 × 20 × 2 mm3) capable of generating an electron density of greater than 1013 cm–3 was constructed to produce reactive species (N2*, N2+, O, OH, and Hα) under helium atmospheric conditions to induce DNA mutagenesis. The length of microplasma channels in the device played a critical role in augmenting both the volume of plasma and the concentration of reactive species. First, we confirmed that microplasma treatment can linearize a plasmid by creating nicks in vitro. Second, we treated R. toruloides cells with a jet device containing 7 microchannels for 5 min; 94.8% of the treated cells were killed, and 0.44% of surviving cells showed different colony colors as compared to their parental colony. Microplasma-based DNA mutation is energy-efficient and can be a safe alternative for inducing mutations compared to conventional methods using toxic mutagens. This compact and scalable device is amenable for industrial strain improvement involving large-scale mutagenesis.

keywords: Conversion;Genome Engineering

published: 2020-05-31

Simulated multi-sample tumor bulk sequencing data

Zhang, Chuanyi; El-Kebir, Mohammed; Ochoa, Idoia (2020)

This repository includes a simulated dataset and related scripts used for the paper "Moss: Accurate Single-Nucleotide Variant Calling from Multiple Bulk DNA Tumor Samples".

keywords: Somatic Mutations; Bulk DNA Sequencing; Cancer Genomics

published: 2025-06-22

Microclimate Species Distribution Models and Mechanistic Models of Potential Surface Activity for Plethodontid Salamanders in Great Smoky Mountains National Park

Stickley, Samuel; Crawford, John; Peterman, William; Fraterrigo, Jennifer (2025)

keywords: terrestrial salamanders, microhabitat, physiology, mechanistic models, ecological niche models, climate change, Great Smoky Mountains National Park

published: 2025-11-24

Data for Sustainable Co-Production of Plant Lipids and Cellulosic Sugars from Transgenic Energycane at an Industrially Relevant Scale: A Proof of Concept for Alternative Feedstocks

Maitra, Shraddha; Cheng, Ming-Hsun; Liu, Hui; Cao, Viet Dang; Kannan, Baskaran; Long, Stephen; Shanklin, John; Altpeter, Fredy; Singh, Vijay (2025)

Development of sustainable and scalable technologies to convert lignocellulosic biomass to biofuels is critical to achieving carbon neutrality. The potential of transgenic bioenergy crops as a renewable source of sugars and lipids has been demonstrated at bench-scale. However, scaling up these processes is important for holistic analysis. Here proof-of-concept for chemical-free hydrothermal pretreatment of transgenic energycane-oilcane line L13 at an industrially relevant scale to recover vegetative lipids along with cellulosic sugars is presented. Pilot-scale processing of 97 kg of transgenic energycane-oilcane L13 stems and high solids pretreatment of bagasse enhanced the recovery of cellulosic glucose and xylose by 5-fold as compared to untreated bagasse and helped in the enrichment of vegetative lipids in the biomass residues which allowed its recovery at the end of the bioprocess. Palmitic and oleic acids were the predominant fatty acids (FAs) extracted from stems and leaves. The processing did not affect lipid composition. The efficiency of lipid recovery from untreated biomass was 75.9% which improved to 88.7% upon pretreatment. The vegetative tissues of transgenic energycane-oilcane L13 contained 0.42 metric tons/hectare of lipids. Processing vegetative tissues yielded 0.38 metric tons/hectare of lipids. This approaches an oil yield similar to soybean (global average 0.44 metric tons/hectare) and is almost twice as high as the oil yield from sugarcane engineered to hyperaccumulate lipids (0.20 metric tons/hectare). The study suggests that further optimization by state-of-the-art metabolic engineering and biomass processing can establish transgenic bioenergy crops for commercial drop-in fuel production.

keywords: Conversion;Feedstock Production;Biomass Analytics;Energycane;Lipidomics

published: 2020-06-06

Reproduction and hybridization in Celastrus scandens and C. orbiculatus at the Indiana Dunes National Park

Zaya, David N.; Leicht-Young, Stacey A.; Pavlovic, Noel B.; Ashley, Mary V. (2020)

These data are from an observational study and small experiment investigating reproductive biology and hybridization between two plants, Celastrus scandens L. and Celastrus orbiculatus Thunb. (Celastraceae). These data were collected during the 2008 growing season from the Indiana Dunes National Park (formerly Indiana Dunes National Lakeshore), just east of the municipality of Ogden Dunes, Indiana, USA. The five data files provide information on floral output of the two species, fertilization rate, fruit set rate, hybridization rate at two scales (individual flowers in both species, individual maternal plants in C. scandens), and the results of a hand-pollination experiment that exchanged pollen between the two species. There are six data files associated with this submission, five data files in comma-separated values format and one text file (‘readme.txt’) that includes detailed explanations of the data files.

keywords: Celastrus; invasive species; hybridization; heterospecific pollen; hand pollination