Dataset Search

Displaying 301 - 325 of 782 in total

Filters

Subject Area

Life Sciences (481)

Social Sciences (118)

Physical Sciences (114)

Technology and Engineering (61)

Uncategorized

Funder

U.S. Department of Energy (DOE) (233)

Other (206)

U.S. National Science Foundation (NSF) (177)

U.S. National Institutes of Health (NIH) (67)

U.S. Department of Agriculture (USDA) (46)

Illinois Department of Natural Resources (IDNR) (20)

U.S. Geological Survey (USGS) (6)

Illinois Department of Transportation (IDOT) (3)

U.S. National Aeronautics and Space Administration (NASA) (3)

U.S. Army (3)

Publication Year

2025 (242)

2022 (87)

2021 (82)

2024 (79)

2020 (65)

2023 (51)

2019 (49)

2018 (48)

2026 (48)

2017 (21)

2016 (10)

License

CC BY (431)

CC0 (328)

custom (23)

Illinois Data Bank Dataset Search Results

Results

published: 2024-07-01

Data and code for estimating population sizes, annual survival, and inferring absence of the frog Mantella cowanii

Edmonds, Devin; Andriantsimanarilafy, Raphali; Crottini, Angelica; Dreslik, Michael; Newton-Youens, Jade; Andoniana, Ramahefason; Christian, Randrianantoandro; Andreone, Franco (2024)

This data and code accompany the manuscript "Small population size and possible extirpation of the threatened Malagasy poison frog Mantella cowanii". The data were collected using photograph capture-recapture at three sites in the central highlands of Madagascar. In Part 1, the script implements robust design capture-mark-recapture models in program MARK through the RMark interface to estimate population sizes and annual survival probabilities. In Part 2, it estimates the number of surveys needed to infer absence at sites where we did not detect the frog.

keywords: abundance; amphibian; capture-recapture

published: 2025-03-18

Global News Index and Extracted Features Repository (v.1.3.0)

Cline Center for Advanced Social Research (2025)

The Cline Center Global News Index is a searchable database of textual features extracted from millions of news stories, specifically designed to provide comprehensive coverage of events around the world. In addition to searching documents for keywords, users can query metadata and features such as named entities extracted using Natural Language Processing (NLP) methods and variables that measure sentiment and emotional valence. Archer is a web application purpose-built by the Cline Center to enable researchers to access data from the Global News Index. Archer provides a user-friendly interface for querying the Global News Index (with the back-end indexing still handled by Solr). By default, queries are built using icons and drop-down menus. More technically-savvy users can use Lucene/Solr query syntax via a ‘raw query’ option. Archer allows users to save and iterate on their queries, and to visualize faceted query results, which can be helpful for users as they refine their queries. Additional Resources: - Access to Archer and the Global News Index is limited to account-holders. If you are interested in signing up for an account, please fill out the <a href="https://docs.google.com/forms/d/e/1FAIpQLSf-J937V6I4sMSxQt7gR3SIbUASR26KXxqSurrkBvlF-CIQnQ/viewform?usp=pp_url">Archer Access Request Form</a> so we can determine if you are eligible for access or not. - Current users who would like to provide feedback, such as reporting a bug or requesting a feature, can fill out the <a href="https://forms.gle/6eA2yJUGFMtj5swY7">Archer User Feedback Form</a>. - The Cline Center sends out periodic email newsletters to the Archer Users Group. Please fill out this <a href="https://groups.webservices.illinois.edu/subscribe/154221">form</a> to subscribe to it. Citation Guidelines: 1) To cite the GNI codebook (or any other documentation associated with the Global News Index and Archer) please use the following citation: Cline Center for Advanced Social Research. 2025. Global News Index and Extracted Features Repository [codebook], v1.3.0. Champaign, IL: University of Illinois. June. XX. doi:10.13012/B2IDB-5649852_V6 2) To cite data from the Global News Index (accessed via Archer or otherwise) please use the following citation (filling in the correct date of access): Cline Center for Advanced Social Research. 2025. Global News Index and Extracted Features Repository [database], v1.3.0. Champaign, IL: University of Illinois. Jun. XX. Accessed Month, DD, YYYY. doi:10.13012/B2IDB-5649852_V6 *NOTE: V6 is replacing V5 with updated ‘Archer’ documents to reflect changes made to the Archer system.

published: 2025-08-01

Data for Reductions in soybean photosynthesis and yield by elevated ozone are not mitigated by soil drying

Martin, Duncan G; Aspray, Elise K; Li, Shuai; Leakey, Andrew DB; Ainsworth, Elizabeth A (2025)

Physiological and yield data from a three year field experiment of soybean exposed to elevated ozone stress and reduced soil moisture at the SoyFACE experiment.

keywords: soybean; ozone; drought; photosynthesis; yield

published: 2025-04-25

Data for Structural and Functional Neuroimaging Investigation in patients with ZIKA virus (ZIKV) infection with neurological manifestations

Sadaghiani, Sepideh; Jun, Suhnyoung; Bido Medina, Richard (2025)

Zika virus (ZIKV) infection has been linked to neurological disorders such as microcephaly in children. Cases of Guillain-Barré Syndrome (GBS), a peripheral nervous system (PNS) disorder, have been reported in adults with ZIKV infection. These ZIKV-related GBS cases often exhibit atypical clinical features compared to classic GBS, including central nervous system (CNS) involvement. This dataset comprises two patient groups and a healthy control group. The first patient group includes adults with confirmed ZIKV infection, presenting both PNS-related GBS symptoms and CNS manifestations. The second group consists of adults with GBS but without ZIKV infection. The final group includes healthy, unaffected individuals.

keywords: Zika virus; Guillain-Barré Syndrome; adults; neuroimaging; central nervous system;

published: 2025-09-24

Data from High Solids Loading Biorefinery for the Production of Cellulosic Sugars from Bioenergy Sorghum

Cheng, Ming-Hsun; Kadhum, Haider Jawad; Murthy, Ganti S.; Dien, Bruce; Singh, Vijay (2025)

A novel process applying high solids loading in chemical-free pretreatment and enzymatic hydrolysis was developed to produce sugars from bioenergy sorghum. Hydrothermal pretreatment with 50% solids loading was performed in a pilot scale continuous reactor followed by disc refining. Sugars were extracted from the enzymatic hydrolysis at 10% to 50% solids content using fed-batch operations. Three surfactants (Tween 80, PEG 4000, and PEG 6000) were evaluated to increase sugar yields. Hydrolysis using 2% PEG 4000 had the highest sugar yields. Glucose concentrations of 105, 130, and 147 g/L were obtained from the reaction at 30%, 40%, and 50% solids content, respectively. The maximum sugar concentration of the hydrolysate, including glucose and xylose, obtained was 232 g/L. Additionally, the glucose recovery (73.14%) was increased compared to that of the batch reaction (52.74%) by using two-stage enzymatic hydrolysis combined with fed-batch operation at 50% w/v solids content.

keywords: Conversion;Feedstock Bioprocessing

published: 2018-01-11

DT-BASE - Training Quality Causal Model

Pence, Justin; Mohaghegh, Zahra (2018)

Dataset includes structure and values of a causal model for Training Quality in nuclear power plants. Each entry refers to a piece of evidence supporting causality of the Training Quality causal model. Includes bibliographic information, context-specific text from the reference, and three weighted values; (M1) credibility of reference, (2) causality determined by the author, and (3) analysts confidence level. (M1, M2, and M3) Weight metadata are based on probability language from <a href="https://www.ipcc.ch/ipccreports/tar/vol4/english/index.htm" style="text-decoration: none" >Intergovernmental Panel on Climate Change (IPCC), Climate Change 2001: Synthesis Report</a>. The language can be found in the “Summary for Policymakers” section, in the PDF format. Weight Metadata: LowerBound_Probability, UpperBound_Probability, Qualitative Language 0.99, 1, Virtually Certain 0.9, 0.99, Very Likely 0.66, 0.9, Likely 0.33, 0.66, Medium Likelihood 0.1, 0.33, Unlikely 0.01, 0.1, Very Unlikely 0, 0.01, Extremely Unlikely

keywords: Data-Theoretic; Training; Organization; Probabilistic Risk Assessment; Training Quality; Causal Model; DT-BASE; Bayesian Belief Network; Bayesian Network; Theory-Building

published: 2019-06-22

Manipulating social information to promote frugivory by birds on a Hawaiian Island

MacDonald, Sean; Ward, Michael; Sperry, Jinelle (2019)

keywords: conspecific attraction; fruit-eating bird; Hawaiian flora; playback experiment; seed dispersal; social information; Zosterops japonicas

published: 2024-01-19

Soybean seed quality response to eCO2 data files

Digrado, Anthony; Montes, Christopher; Baxter, Ivan; Ainsworth, Elizabeth (2024)

This data set is related to a SoyFACE experiment conducted in 2004, 2006, 2007, and 2008 with the soybean cultivars Loda and HS93-4118. The experiment looked at how seed elements were affected by elevated CO2 and yield. In this V2, 2 new files were added per journal requirement. Total there are 5 data files in text format within the digrado_et_al_gcb_data_V2 and 1 readme file. The name of files are listed below. Details about headers are explained in the readme.txt file. 1. ionomic_data.txt file contains the ionomic data (mg/kg) for the two cultivars. The file contains all six technical replicates for each plot. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 2. yield_data.txt file contains the yield data for the two cultivars (seed yield in kg/ha, seed yield in bu/a, Protein (%), Oil (%)). The file contains yield data for every plot. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 3. mineral_pro_oil_yield.txt file contains the yield per hectare for each mineral (g/ha) along with the yield per hectare for protein and oil (t/ha). This was obtained by multiplying the seed content of each element (minerals, protein, and oil) by the total seed yield. The file contains yield data for every plots. The cultivar, year, treatment, and the plot from which the samples were collected are given for each entry. 4. economic_assessment.txt file contains data used to assess the financial impact of altered seed oil content on soybean oil production. 5. meteorological_data.txt file contains the meteorological data recorded by a weather station located ~ 3km from the experimental site (Willard Airport Champaign). Data covering the period between May 28 and September 24 were used for 2004; between May 25 and September 24 were used in 2006; between May 23 and September 17 in 2007; and between June 16 and October 24 in 2008.

keywords: protein; oil; mineral; SoyFACE; nutrient; Glycine max; soybean; yield; CO2; agriculture; climate change

published: 2025-05-21

Pollen of Podocarpus (Podocarpaceae) II: Airyscan confocal superresolution images

Punyasena, Surangi W.; Adaime, Marc-Elie; Jaramillo, Carlos (2025)

This dataset includes a total of 16 images of 2 extant species of Podocarpus (Podocarpaceae) and 23 images of fossil specimens of the morphogenus Podocarpidites. The images were taken using a Zeiss LSM 880 microscope with Airyscan confocal superresolution at 630x magnification (63x/NA 1.4 oil DIC). The images are in the original CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or open microscopy software, such as ImageJ. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/us/products/software/zeiss-zen/czi-image-file-format.html] For Podocarpus (modern specimens): Each folder is labelled by genus and contain all images corresponding to that genus. Detailed information about the folders, files, and specimens can be found in the Excel file "METADATA_Podocarpus_extant.csv". This file includes metadata on: species, slide ID, collection, folder name file name and notes. Images are of pollen grains from slides in the Florida Museum of Natural History collections. For Podocarpidites (fossil specimens): Each image is named after the sample from which it was derived. Detailed information about the specimens can be found in the Excel file "METADATA_ Podocarpidites_fossil.csv". This file includes metadata: the fossil type (Taxon), the slide and sample name (Slide Info), the location of the sample locality (Country, Latitude, Longitude), the age of the sample (Min age, Max age), the location of the specimen on the sample slide (England Finder coordinates), and the image file name. Images are of fossil pollen from slides in Smithsonian Tropical Research Institute collections. Please cite this dataset and listed publications when using these images.

keywords: optical superresolution microscopy; Zeiss Airyscan; CZI images; conifer; saccate pollen; Podocarpus; Podocarpidites

published: 2018-05-21

Geometric analysis of magnetic dimensionality

Karigerasi, Manohar H.; Wagner, Lucas K.; Shoemaker, Daniel P. (2018)

This dataset contains bonding networks and tolerance ranges for geometric magnetic dimensionality. The data can be searched in the html frontend above, code obtained at the GitHub repository, or the raw data can be downloaded as csv below. The csv data contains the results of 42520 compounds (unique icsd_code) from ICSD FindIt v3.5.0. The csv is semicolon-delimited since some fields contain multiple comma-separated values.

keywords: materials science; physics; magnetism; crystallography

published: 2018-09-06

XSEDE: Allocations Awards for the NSF Cyberinfrastructure Portfolio, 2004-2017

XSEDE-Extreme Science and Engineering Discovery Environment (2018)

The XSEDE program manages the database of allocation awards for the portfolio of advanced research computing resources funded by the National Science Foundation (NSF). The database holds data for allocation awards dating to the start of the TeraGrid program in 2004 to present, with awards continuing through the end of the second XSEDE award in 2021. The project data include lead researcher and affiliation, title and abstract, field of science, and the start and end dates. Along with the project information, the data set includes resource allocation and usage data for each award associated with the project. The data show the transition of resources over a fifteen year span along with the evolution of researchers, fields of science, and institutional representation.

keywords: allocations; cyberinfrastructure; XSEDE

published: 2024-05-23

Data for: Learned 1-D passive scalar advection to accelerate chemical transport modeling: a case study with GEOS-FP horizontal wind fields

Park, Manho; Zheng, Zhonghua; Riemer, Nicole; Tessum, Christopher (2024)

This dataset contains the training results (model parameters, outputs), datasets for generalization testing, and 2-D implementation used in the article "Learned 1-D passive scalar advection to accelerate chemical transport modeling: a case study with GEOS-FP horizontal wind fields." The article will be submitted to Artificial Intelligence for Earth Systems. The datasets are saved as CSV for 1-D time-series data and *netCDF for 2-D time series dataset. The model parameters are saved in every training epoch tested in the study.

keywords: Air quality modeling; Coarse-graining; GEOS-Chem; Numerical advection; Physics-informed machine learning; Transport operator

published: 2025-09-30

Data from Economic Perspective of Ethanol and Biodiesel Coproduction from Industrial Hemp

Viswanathan, Mothi Bharath; Cheng, Ming-Hsun; Clemente, Tom; Dweikat, Ismail; Singh, Vijay (2025)

In this study, the economics of producing biofuels from an industrial hemp (Cannabis sativa) genotype – 19m96136 was investigated. A lignocellulosic biofuel plant, hourly consuming 85 metric tons of hemp biomass was modeled in SuperPro Designer®. The integrated bioenergy plant produced hemp biodiesel and bioethanol from lipids and carbohydrates, respectively. The structural composition of the industrial hemp plant was analyzed in a previous study. The data obtained was used to simulate feedstock composition in SuperPro Designer®. The simulation results indicated that Hemp containing 2% lipids can yield up to 3.95 million gallons of biodiesel annually. On improving biomass lipid content to 5 and 10%, biodiesel production increased to 9.88 and 19.91 million gallons, respectively. The breakeven unit production cost of hemp biodiesel with 2, 5, and 10% lipid containing hemp was $18.49, $7.87, and $4.13/gallon, respectively. The biodiesel unit production cost when utilizing 10% lipid-containing hemp was comparable to soybean biodiesel at $4.13/gallon. Furthermore, sensitivity analysis revealed the possibility of a 7.80% reduction in unit production cost upon a 10% reduction in hemp feedstock cost. Furthermore, industrial hemp was capable of producing between 307.80 and 325.82 gallons of total biofuels per hectare of agricultural land than soybean.

keywords: Conversion;Feedstock Production;Economics;Modeling

published: 2024-07-09

Data matrices for "Missing Data and Model Selection in Phylogenomics: A Re-Evaluation of Cicadomorpha (Hemiptera: Auchenorrhyncha) Superfamily Level Relationships Under Site-Heterogeneous Models"

Yan, Bin; Dietrich, Christopher; Yu, Xiaofei; Jiang, Yan; Dai, Renhuai; Du, Shiyu; Cai, Chenyang; Yang, Maofa; Zhang, Feng (2024)

The included files are the alignments of DNA or amino acid sequences used for phylogenetic analyses of Auchenorrhyncha (Insecta: Hemiptera) in the manuscript by Bin et al. submitted to the journal “Systematic Entomology.” The files are plain text in either FASTA (.fa or .fas suffix) or PHYLIP (.phy suffix) format. Matrix0 is the set of all loci after multiple sequence alignment and trimming (hereafter called). Matrix1 consists of loci having 75% average bootstrap support and 80% taxon completeness (hereafter called Matrix1). Matrix2 consists of loci having 75% average bootstrap support and 95% completeness. Matrix2_nt12 is the same as Matrix2 but with third codon positions excluded. More details on how the datasets were compiled is provided in the Methods section of the manuscript file, also included as a PDF. Supplemental figures for the submitted manuscript are also provided as a PDF for additional information.

keywords: Insecta; Phylogeny; DNA sequence; Evolution

published: 2018-03-08

Molecular Biology Databases Published in Nucleic Acids Research between 1991-2016

Imker, Heidi (2018)

This dataset was developed to create a census of sufficiently documented molecular biology databases to answer several preliminary research questions. Articles published in the annual Nucleic Acids Research (NAR) “Database Issues” were used to identify a population of databases for study. Namely, the questions addressed herein include: 1) what is the historical rate of database proliferation versus rate of database attrition?, 2) to what extent do citations indicate persistence?, and 3) are databases under active maintenance and does evidence of maintenance likewise correlate to citation? An overarching goal of this study is to provide the ability to identify subsets of databases for further analysis, both as presented within this study and through subsequent use of this openly released dataset.

keywords: databases; research infrastructure; sustainability; data sharing; molecular biology; bioinformatics; bibliometrics

published: 2025-05-05

Data for article about perceived trustworthiness of social media AI-Generated content

Benson, Sara; Cheng, Siyao; Ton, Mary; Graves, Celenia; Owens, Dawn (2025)

The dataset includes responses from approximately 550 participants to survey questions about trust in images labeled with AI-related tags, compared to other images found online. The questions also explore how the type of label influences their trust.

keywords: Artificial intelligence (AI); Trust in AI; Al labeling; AI ethics

published: 2016-06-06

Datasets for modeling collaborative formation and collaborative "success"

Fegley, Brent D. (2016)

These datasets represent first-time collaborations between first and last authors (with mutually exclusive publication histories) on papers with 2 to 5 authors in years [1988,2009] in PubMed. Each record of each dataset captures aspects of the similarity, nearness, and complementarity between two authors about the paper marking the formation of their collaboration.

published: 2019-05-22

Isolated artificial spin ice kinetics

Lao, Yuyang; Schiffer, Peter (2019)

This is the experimental data of isolated nanomagnet islands with or without the presence of large nanomagnet islands. The small islands are made of Permalloy materials with size of 170 nm by 470 nm by 2.5 nm. The systems are measured at a temperature where the small islands are fluctuating around room temperature. The data is recorded as photoemission electron microscopy intensity. More details about the data can be found in the note.txt and Spe_2016.xlsx file. Note: The raw data folders are stored in five volumes during the compression. All five volumes are needed in order to recover the original folder.

keywords: artificial spin ice; magnetism

published: 2020-12-29

Fern functional traits

Viana, Jéssica; Turner, Benjamin; Dalling, James (2020)

Three datasets: species_abundance_data, species_traits, and environmental_data. The three datasets were collected in the Fortuna Forest Reserve (8°45′ N, 82°15′ W) and Palo Seco Protected Forest (8°45′ N, 82°13′ W) located in western Panama. The two reserves support humid to super-humid rainforests, according to Holdridge (1947). The species_abundance_data and species_traits datasets were collected across 15 subplots of 25 m2 in 12 one-hectare permanent plots distributed across the two reserves. The subplots were spaced 20 m apart along three 5 m wide transects, each 30 m apart. Please read Prada et al. (2017) for details on the environmental characteristics of the study area. Prada CM, Morris A, Andersen KM, et al (2017) Soils and rainfall drive landscape-scale changes in the diversity and functional composition of tree communities in a premontane tropical forest. J Veg Sci 28:859–870. https://doi.org/10.1111/jvs.12540

keywords: functional traits; plants; ferns; environmental data; Fortuna; species data; community ecology

published: 2021-09-03

Dataset for evaluating the Hind/He statistic in polyRAD

Clark, Lindsay V.; Mays, Wittney; Lipka, Alexander E.; Sacks, Erik J. (2021)

All of the files in this dataset pertain to the evaluation of a novel statistic, Hind/He, for distinguishing Mendelian loci from paralogs. They are derived from a RAD-seq genotyping dataset of diploid and tetraploid Miscanthus sacchariflorus.

published: 2021-08-20

Maize and Sorghum Establishment and Yield following Pre-Emergence Waterlogging

von Haden, Adam C.; DeLucia, Evan H.; Yang, Wendy; Burnham, Mark (2021)

In 2020, early-season extreme precipitation events occurred following the planting of Sorghum bicolor (L.) Moench and Zea mays L. in central Illinois that caused ponding. Following the first rainfall event 50m transects were established to assess the waterlogging effects on seedling emergence and crop yields. Soil moisture, emergence, stem and tiller count, LAI, and yield were measured at various points in the season along these transects.

keywords: Sorghum; Maize; Emergence; Yield; LAI

published: 2024-03-27

Dataset for "Arguing about Controversial Science in the News: Does Epistemic Uncertainty Contribute to Information Disorder?"

Zheng, Heng; Schneider, Jodi (2024)

To gather news articles from the web that discuss the Cochrane Review, we used Altmetric Explorer from Altmetric.com and retrieved articles on August 1, 2023. We selected all articles that were written in English, published in the United States, and had a publication date prior to March 10, 2023 (according to the “Mention Date” on Altmetric.com). This date is significant as it is when Cochrane issued a statement about the "misleading interpretation" of the Cochrane Review. The collection of news articles is presented in the Altmetric_data.csv file. The dataset contains the following data that we exported from Altmetric Explorer: - Publication date of the news article - Title of the news article - Source/publication venue of the news article - URL - Country We manually checked and added the following information: - Whether the article still exists - Whether the article is accessible - Whether the article is from the original source We assigned MAXQDA IDs to the news articles. News articles were assigned the same ID when they were (a) identical or (b) in the case of Article 207, closely paraphrased, paragraph by paragraph. Inaccessible items were assigned a MAXQDA ID based on their "Mention Title". For each article from Altmetric.com, we first tried to use the Web Collector for MAXQDA to download the article from the website and imported it into MAXQDA (version 22.7.0). If an article could not be retrieved using the Web Collector, we either downloaded the .html file or in the case of Article 128, retrieved it from the NewsBank database through the University of Illinois Library. We then manually extracted direct quotations from the articles using MAXQDA. We included surrounding words and sentences, and in one case, a news agency’s commentary, around direct quotations for context where needed. The quotations (with context) are the positions in our analysis. We also identified who was quoted. We excluded quotations when we could not identify who or what was being quoted. We annotated quotations with codes representing groups (government agencies, other organizations, and research publications) and individuals (authors of the Cochrane Review, government agency representatives, journalists, and other experts such as epidemiologists). The MAXQDA_data.csv file contains excerpts from the news articles that contain the direct quotations we identified. For each excerpt, we included the following information: - MAXQDA ID of the document from which the excerpt originates; - The collection date and source of the document; - The code with which the excerpt is annotated; - The code category; - The excerpt itself.

keywords: altmetrics; MAXQDA; polylogue analysis; masks for COVID-19; scientific controversies; news articles

published: 2021-11-18

Rewritable Two-Dimensional DNA-Based Data Storage System (2DDNA) Sequencing Dataset

Pan, Chao; Tabatabaei, S Kasra; Tabatabaei Yazdi, S. M. Hossein; Hernandez, Alvaro; Schroeder, Charles; Milenkovic, Olgica (2021)

This dataset contains sequencing data obtained from Illumina MiSeq device to prove the concept of the proposed 2DDNA framework. Please refer to README.txt for detailed description of each file.

keywords: machine learning;image processing;computer vision;rewritable storage system;2D DNA-based data storage

published: 2025-01-31

Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae

Punyasena, Surangi W.; Romero, Ingrid; Urban, Michael A. (2025)

Title: Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae Authors: Surangi W. Punyasena, Ingrid Romero, Michael A. Urban Subject: Biological sciences Keywords: Malvaceae; superresolution microscopy; Zeiss; Bombacacidites; Neotropics; CZI Funder: NSF-DBI Advances in Bioinformatics (NSF-DBI-1262561) Corresponding Creator: Surangi W. Punyasena This dataset includes a total of 430 images of extant specimens of the Malvaceae, with a focus on species that are or have been included within the subfamily Bombacoideae. There are 27 genera included within 26 folders. Each folder is named by genus and contains all the images that correspond to that genus. Note that the genus _Matisia_ is included with _Quararibea_ as detailed in the metadata READ ME file. The specimens imaged are from the palynological collections of the Swedish Museum of Natural History and Smithsonian Tropical Research Institute, and herbarium specimens from the Smithsonian Herbarium National Museum. The optical superresolution microscopy images were taken using a Zeiss LSM 880 with Airyscan at 630X magnification (63x/NA 1.4 oil DIC). The images are in the original CZI file format. They can be opened using Zeiss propriety software (Zen, Zen lite) or in ImageJ/FIJI. More information on how to open CZI files can be found here: [https://www.zeiss.com/microscopy/en/products/software/zeiss-zen/czi-image-file-format.html] Image metadata and file organization are described in the CSV file "METADATA_Malvaceae_Bombacoideae_modern-species.csv". The column headings are: Folder The folder in which the image file is found Subfamily The current subfamily determination based on the literature. Note that _Pentaplaris_ and _Septotheca_ have not been assigned a subfamily. Genus Genus name Species Species name Accepted name Accepted species name, updated from the literature Slide name Species name as denoted on the herbarium slide Collection Source of the herbarium slide: Sweden National Museum of Natural History or the Smithsonian Tropical Research Institute File name File name using the species name denoted on the herbarium slide Slide ID/Herbarium ID Specimen collection number Please cite this dataset as: Punyasena, Surangi W.; Romero, Ingrid; Urban, Michael A. (2025): Airyscan confocal superresolution images of extant Malvaceae pollen with a focus on Bombacoideae. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-2968712_V1

keywords: Malvaceae; superresolution microscopy; Zeiss; Bombacoideae; Neotropics; CZI

published: 2025-10-30

Data for Construction of a Compact Array of Microplasma Jet Devices and Its Application for Random Mutagenesis of Rhodosporidium toruloides

Koh, Hyun Gi; Kim, Jinhong; Rao, Christopher V.; Park, Sung-Jin; Jin, Yong-Su (2025)

A small and efficient DNA mutation-inducing machine was constructed with an array of microplasma jet devices (7 × 1) that can be operated at atmospheric pressure for microbial mutagenesis. Using this machine, we report disruption of a plasmid DNA and generation of mutants of an oleaginous yeast Rhodosporidium toruloides. Specifically, a compact-sized microplasma channel (25 × 20 × 2 mm3) capable of generating an electron density of greater than 1013 cm–3 was constructed to produce reactive species (N2*, N2+, O, OH, and Hα) under helium atmospheric conditions to induce DNA mutagenesis. The length of microplasma channels in the device played a critical role in augmenting both the volume of plasma and the concentration of reactive species. First, we confirmed that microplasma treatment can linearize a plasmid by creating nicks in vitro. Second, we treated R. toruloides cells with a jet device containing 7 microchannels for 5 min; 94.8% of the treated cells were killed, and 0.44% of surviving cells showed different colony colors as compared to their parental colony. Microplasma-based DNA mutation is energy-efficient and can be a safe alternative for inducing mutations compared to conventional methods using toxic mutagens. This compact and scalable device is amenable for industrial strain improvement involving large-scale mutagenesis.

keywords: Conversion;Genome Engineering