Illinois Data Bank Dataset Search Results
Results
published:
2024-08-24
Jones, Todd; Llamas, Alfredo; Phillips, Jennifer
(2024)
Dataset associated with Jones et al. GCB-23-1273.R1 submission: Phenotypic signatures of urbanization? Resident, but not migratory, songbird eye size varies with urban-associated light pollution levels. Excel CSV file with all of the data used in analyses and file with descriptions of each column.
keywords:
body size; demographics; eye size; phenotypic divergence; songbirds; sensory pollution; urbanization
published:
2023-12-18
Edmonds, Devin; Adamovicz, Laura; Allender, Matthew; Colton, Andrea; Randy, Nyboer; Michael, Dreslik
(2023)
We conducted long-term capture-mark-recapture surveys on two isolated ornate box turtle (Terrapene ornata) populations in northern Illinois, USA. This dataset provides the capture history strings and additional demographic information used for estimating population vital rates with robust design capture-mark-recapture models. The vital rates were then used in a stage-based population projection matrix model for each population.
keywords:
demography; capture-mark-recapture; vital rates; conservation; wildlife ecology
published:
2022-03-19
McCoy, Annette; Secor, Erica; Roady, Patrick; Gray, Sarah; Klein, Julie; Gutierrez-Nibeyro, Santiago
(2022)
Raw arthroscopic scores, histologic scores, cytokine measurements, and performance data for the study cohort described in the accompanying publication.
keywords:
horse; metatarsophalangeal joint; arthroscopy; exercise; developmental orthopedic disease
published:
2025-07-21
Feng, Jennifer T.; van den Berg, Thya; Donders, Timme H.; Kong, Shu; Puthanveetil Satheesan, Sandeep; Punyasena, Surangi W.
(2025)
This dataset includes image stacks, annotated counts, and ground-truth masks from two high-resolution sediment cores extracted from Laguna Pallcacocha, in El Cajas National Park, Ecuadorian Andes by Moy et al. (2002) and Hagemans et al. (2021). The first core (PAL 1999, from Moy et al. (2002)) extends through the Holocene (11,600 cal. yr. BP - present). There are a total of 900 annotated image stacks and masks in the PAL 1999 domain. The second core (PAL IV, from Hagemans et al. (2021)) captures the 20th century. There are 2986 annotated image stacks and masks in the PAL IV domain.
Different microscopes and annotations tools were used to image and annotate each core and there are corresponding differences in naming conventions and file formats. Thus, we organized our data separately for the PAL 1999 and the PAL IV domains. The three letter codes used to label our pollen annotations are in the file: “Pollen_Identification_Codes.xlsx”.
Both domain directories contain:
• Image stacks organized by subdirectory
• Annotations within each image stack directory, containing specimen identifications using a three letter code and coordinates defining bounding boxes or circles
• Ground-truth distance-transform masks for each image stack
The zip file "bestValModel_encoder.paramOnly.zip" is the trained pollen detection model produced from the images and annotations in this dataset.
Please cite this dataset as:
Feng, Jennifer T.; van den Berg, Thya; Donders, Timme H.; Kong, Shu; Puthanveetil Satheesan, Sandeep; Punyasena, Surangi W. (2025): Slide scans, annotated pollen counts, and trained pollen detection models for fossil pollen samples from Laguna Pallcacocha, El Cajas National Park, Ecuador . University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-4207757_V1
Please also include citations of the original publications from which these data are taken:
Feng, Jennifer T., Sandeep Puthanveetil Satheesan, Shu Kong, Timme H. Donders, and Surangi W. Punyasena. “Addressing the ‘Open World’: Detecting and Segmenting Pollen on Palynological Slides with Deep Learning.” bioRxiv, January 1, 2025. https://doi.org/10.1101/2025.01.05.631390.
Feng, Jennifer T., Sandeep Puthanveetil Satheesan, Shu Kong, Timme H. Donders, and Surangi W. Punyasena. “Addressing the ‘Open World’: Detecting and Segmenting Pollen on Palynological Slides with Deep Learning.” Paleobiology, 2025 [in press].
Feng, J. T. (2023). Open-world deep learning applied to pollen detection (MS thesis, University of Illinois at Urbana-Champaign). https://hdl.handle.net/2142/120168
keywords:
continual learning; deep learning; domain gaps; open-world; palynology; pollen grain detection; taxonomic bias
published:
2025-01-30
Raw data associated with PMID: 38925247
published:
2025-01-30
Zhang, Yufan; Bhattarai, Rabin
(2025)
This is a research data for a manuscript - A Framework of Simulating Structural Sediment Perimeter Barriers using VFSMOD.
keywords:
sediment control
published:
2022-06-01
Southey, Bruce; Rodriguez-Zas, Sandra L.
(2022)
This dataset contain information for the paper "Changes in neuropeptide prohormone genes among Cetartio-dactyla livestock and wild species associated with evolution and domestication" Veterinary Sciences, MDPI. Protein sequences were predicted using GeneWise for 98 neuropeptide prohormone genes from publicly available genomes of 118 Cetartiodactyla species. All predictions (CetartiodactylaSequences2022.zip) were manually verified. Sequences were aligned within each prohormone using MAFFT (MDPImultalign2022.zip includes multiple sequence alignment of all species available for each prohormone). Phylogenetic gene trees were constructed using PhyML and the species tree was constructed using ASTRAL (MDPItree2022.zip). The data is released under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0).
keywords:
prohormone; neuropeptide; Cetartiodactyla; Cetartiodactyla; phylogenetics; gene tree; species tree
published:
2025-09-23
Zhao, Huimin; Chen, Li-Qing; Martin, Teresa; Xue, Xueyi; Singh, Nilmani; Tan, Shi-I; Boob, Aashutosh
(2025)
Mitochondria play a key role in energy production and metabolism, making them a promising target for metabolic engineering and disease treatment. However, despite the known influence of passenger proteins on localization efficiency, only a few protein-localization tags have been characterized for mitochondrial targeting. To address this limitation, we leverage a Variational Autoencoder to design novel mitochondrial targeting sequences. In silico analysis reveals that a high fraction of the generated peptides (90.14%) are functional and possess features important for mitochondrial targeting. We characterize artificial peptides in four eukaryotic organisms and, as a proof-of-concept, demonstrate their utility in increasing 3-hydroxypropionic acid titers through pathway compartmentalization and improving 5-aminolevulinate synthase delivery by 1.62-fold and 4.76-fold, respectively. Moreover, we employ latent space interpolation to shed light on the evolutionary origins of dual-targeting sequences. Overall, our work demonstrates the potential of generative artificial intelligence for both fundamental research and practical applications in mitochondrial biology.
keywords:
AI/ML; metabolic engineering; modeling; software
published:
2017-06-16
Haselhorst, Derek S.; Tcheng, David K.; Moreno, J. Enrique ; Punyasena, Surangi W.
(2017)
Table S2. Raw pollen counts and climatic data for each seasonal sampling period. Climatic data reflects the average daily conditions observed over the duration samples were collected (˚C/day, mm/day, MJ/m2/day). Lycopodium counts and counts for each pollen taxon reflect the aggregated pollen sum from four sampling heights.
keywords:
pollen; count; climate; data; BCI; PNSL; Panama
published:
2020-11-25
Barker, Louise; Gaulke, Sarah M.; Chace, Jordyn Z.; Davis, Mark A.; Niemiller, Matthew L.; Taylor, Steven J.; Schuett, Gordon W.
(2020)
Video recorded by Louise Barker using a Cannon Powershot camera documents late-season combat behavior in Agkistrodon contortrix. Recorded in Beaufort County, North Carolina, 11.1 km SE of downtown Washington on 21 October 2020.
keywords:
Agkistrodon contortrix; combat; mating; reproduction; copperhead; pit viper; Viperidae;
published:
2017-06-16
Haselhorst, Derek S.; Tcheng, David K.; Moreno, J. Enrique ; Punyasena, Surangi W.
(2017)
Table S3. Mean slope response for each predictive model used in the ecoinformatic analysis. Mean responses are provided for each seasonal and annual pollen data set analyzed from BCI and PNSL and are summarized by life form. Calculated p-values are provided for each model.
keywords:
pollen; response; climate; ecoinformatics; BCI; PNSL; Panama
published:
2020-12-15
Khanna, Madhu; Chen, Xiaoguang; Wang, Weiwei; Oliver, Anthony
(2020)
The dataset consists of results and various input data that are used in the GAMS model for the publication "Repeal of the Clean Power Plan: Social Cost and Distributional Implications". All the data are either excel files or in the .inc format which can be read within GAMS or Notepad. Main data sources include: agriculture, transportation and electricity data. Model details can be found in the paper and the GAMS model package.
keywords:
carbon abatement; welfare cost; electricity sector; partial equilibrium model
published:
2023-10-26
Louie, Allison Y.; Rund, Laurie A.; Komiyama-Kasai, Karin A.; Weisenberger, Kelsie E.; Stanke, Kayla L.; Larsen, Ryan J.; Leyshon, Brian J.; Kuchan, Matthew J.; Das, Tapas; Steelman, Andrew J.
(2023)
This dataset contains MRI data and Imaris modeling analysis of CLARITY-cleared, immunostained tissue associated with a study that assessed the effects of lipid blends containing various levels of a hydrolyzed fat system on myelin development in healthy neonatal piglets. Data are from thirty-two piglets of mixed sexes across four diet treatment groups and includes a sow-fed reference group. MRI data (presented in Figure 2 of the associated article) consists of volumetric data from Voxel-Based Morphometry analysis in brain grey matter and white matter, as well as mean fractional anisotropy and mean orientation dispersion index data from Tract-Based Spatial Statistics analysis. Imaris data (presented in Figure 3 of the associated article) consists of twenty-one select output measures from 3D modeling analysis of PLP-stained prefrontal cortex tissue. All methods used for collection/generation/processing of data are described in the associated article: Louie AY, Rund LA, Komiyama-Kasai KA, Weisenberger KE, Stanke KL, Larsen RJ, Leyshon BJ, Kuchan MJ, Das T, Steelman AJ. A hydrolyzed lipid blend diet promotes myelination in neonatal piglets in a region and concentration-dependent manner. J Neurosci Res. 2023.
keywords:
myelin; dietary lipid; white matter; CLARITY; Imaris; voxel-based morphometry; diffusion tensor imaging
published:
2025-09-15
Zhao, Yang; Kim, Jae Y.; Karan, Ratna; Jung, Je Hyeong; Pathak, Bhuvan; Williamson, Bruce; Kannan, Baskaran; Wang, Duoduo; Fan, Chunyang; Yu, Wenjin; Dong, Shujie; Srivastava, Vibha; Altpeter, Fredy
(2025)
Sugarcane, a tropical C4 grass in the genus Saccharum (Poaceae), accounts for nearly 80% of sugar produced worldwide and is also an important feedstock for biofuel production. Generating transgenic sugarcane with predictable and stable transgene expression is critical for crop improvement. In this study, we generated a highly expressed single copy locus as landing pad for transgene stacking. Transgenic sugarcane lines with stable integration of a single copy nptII expression cassette flanked by insulators supported higher transgene expression along with reduced line to line variation when compared to single copy events without insulators by NPTII ELISA analysis. Subsequently, the nptII selectable marker gene was efficiently excised from the sugarcane genome by the FLPe/FRT site-specific recombination system to create selectable marker free plants. This study provides valuable resources for future gene stacking using site-specific recombination or genome editing tools.
keywords:
Feedstock Production;Biomass Analytics;Genomics
published:
2017-06-15
Christensen, Sarah; Molloy, Erin K.; Vachaspati, Pranjal; Warnow, Tandy
(2017)
Datasets used in the study, "Optimal completion of incomplete gene trees in polynomial time using OCTAL," presented at WABI 2017.
keywords:
phylogenomics; missing data; coalescent-based species tree estimation; gene trees
published:
2019-05-16
The associated data sets include information on stable isotopes from organic matter sources in high elevation lakes, the percentage of production assimilated from the different sources of organic matter, and the relationship between different metrics for trophic position and environmental variables.
keywords:
Stable isotopes; macroinvertebrate production; trophic position
published:
2022-09-28
Inagaki, Akino; Allen, Maximilian; Koike, Shinsuke
(2022)
Data from an a field survey at Nikko National Park in central Japan. Data contain information about deer carcass, environment of sites, and vertebrate scavenging.
keywords:
Carcass; Cervus nippon; Detection; Facultative scavenging; Obligate scavenger
published:
2017-02-23
GBS data from diverse sorghum lines. Project funded by DOE, ARPA-E, and startup funds to PJ Brown.
published:
2022-02-14
Dataset associated with Allen et al. (In Review):
Food caching by a solitary large carnivore supports optimal foraging theory
If using this dataset, please cite this manuscript.
published:
2017-03-08
Thapa, Sita; Schroeder, Nathan; Patel, Jayna; Reuter-Carlson, Ursula
(2017)
This dataset includes early embryogenesis and post-embryonic development of Soybean cyst nematode.
keywords:
Soybean cyst nematode; Embryogenesis; Post-embryonic development
published:
2023-12-01
Hohoff, Tara; Deppe, Jill
(2023)
Mist netting data for little brown bats (Myotis lucifugus) in McHenry County, Illinois and output of acoustic data processed using Kaleidoscope (Version 5.1.9, Bats of North America 5.1.0; Wildlife Acoustics) auto-identification software. Associated survey metadata and landcover metrics calculated using Fragstats included.
keywords:
little brown bats; mist netting; acoustics
published:
2017-02-21
GBS data from biparental sorghum populations provided by Dr. Bill Rooney, TAMU. Data produced and analyzed by Pradeep Hirannaiah to study recombination in sorghum. Funding for this study was provided by the Sorghum Checkoff.
published:
2017-03-07
Mickalide, Harry; Fraebel, David T.; Kuehn, Seppe
(2017)
This is a sample 5 minute video of an E coli bacterium swimming in a microfluidic chamber as well as some supplementary code files to be used with the Matlab code available at https://github.com/dfraebel/CellTracking
published:
2017-05-22
Nelson, Kirsten; Collins, Scott; Sass, Greg; Wahl, David
(2017)
Data (fish growth, prey responses) from a series of experiments examining interspecific interactions between native and invasive juvenile fishes.