Displaying 1 - 25 of 781 in total
Subject Area
Funder
Publication Year
License
Illinois Data Bank Dataset Search Results

Dataset Search Results

published: 2025-04-29
 
This page contains the data for the publication "The pioneer transcription factor Zelda controls the exit from regeneration and restoration of patterning in Drosophila" published in the journal Science Advances.
keywords: Drosophila; regeneration; wing imaginal disc; Zelda
published: 2016-05-19
 
This dataset contains records of four years of taxi operations in New York City and includes 697,622,444 trips. Each trip records the pickup and drop-off dates, times, and coordinates, as well as the metered distance reported by the taximeter. The trip data also includes fields such as the taxi medallion number, fare amount, and tip amount. The dataset was obtained through a Freedom of Information Law request from the New York City Taxi and Limousine Commission. The files in this dataset are optimized for use with the ‘decompress.py’ script included in this dataset. This file has additional documentation and contact information that may be of help if you run into trouble accessing the content of the zip files.
keywords: taxi;transportation;New York City;GPS
published: 2025-05-10
 
This dataset provides instructions for procedures to use heat transfer analyses to estimate thermal conditions in artificial roosts for bats. The dataset contains scripts to employ in the program GNU Octave, example meteorology data, and example text files specifying roost dimensions and material properties.
keywords: Bat box; design; heat storage; heat transfer analysis; insulation; temperature
published: 2025-06-16
 
Data for the publication of Magnetic Fields in the Pillars of Creation (Sarkar et al.). Contains the fits files and python scripts.
keywords: HAWC+; SOFIA; Pillars of Creation; M16; Eagle Nebula; Dust Polarization
published: 2019-10-27
 
This dataset accompanies the paper "STREETS: A Novel Camera Network Dataset for Traffic Flow" at Neural Information Processing Systems (NeurIPS) 2019. Included are: *Over four million still images form publicly accessible cameras in Lake County, IL. The images were collected across 2.5 months in 2018 and 2019. *Directed graphs describing the camera network structure in two communities in Lake County. *Documented non-recurring traffic incidents in Lake County coinciding with the 2018 data. *Traffic counts for each day of images in the dataset. These counts track the volume of traffic in each community. *Other annotations and files useful for computer vision systems. Refer to the accompanying "readme.txt" or "readme.pdf" for further details.
keywords: camera network; suburban vehicular traffic; roadways; computer vision
published: 2025-04-17
 
This dataset includes analysis code used to analyze the data involved with swapping photons between superconducting qubits in separate modules though a superconducting coaxial cable bus. The dataset includes Python code to model and plot the data, CAD designs of the modules that hold the superconducting qubits, high frequency simulation software files to model the electric fields of the superconducting circuits
keywords: superconducting qubits; qunatum information; modular architecture
published: 2025-06-30
 
This dataset contains measurements of water loss as white-tailed deer (Odocoileus virginianus) retroypharyngeal lymph nodes air-dried in a refrigerator for 31 days. Daily weights for lymph nodes are recorded every 24 hours, as are the variables "firmness" and "surface wetness". "Firmness" is a categorical variable measuring how much the tissue deforms to the touch (soft, medium, or hard). "Surface wetness" is the amount of visible moisture on the outside of the lymph node (all, some, or none). Lymph node weights were measured until their weights stabilized for 3 consecutive days at two decimal places (ex. 3.02, 3.02, 3.02) or until the weights fluctuated only by 0.01 (ex. 3.02, 3.03, 3.02). Lymph nodes were from northern Illinois white-tailed deer collected as part of the Illinois Department of Natural Resources' ongoing chronic wasting disease (CWD) management efforts.
keywords: cervid; lymph node; chronic wasting disease; cwd; diagnostic testing; dessication; drying; tissue
published: 2025-06-30
 
This dataset is associated with the manuscript "Residual tau-fluvalinate, a beehive acaricide, disrupts growth and metabolism in the greater wax moth, Galleria mellonella" This dataset includes 2 Excel files: 1) raw_data_bioassay.xlsx: this file contains the raw data for waxworm bioassay. There are 2 worksheets within this file: - LC50: raw data for measuring the LC50 of Galleria mellonella (greater wax moth) in laboratory and field strains exposed to tau-fluvalinate. - RGR: Relative Growth Rate, raw data for measuring body weight of field strain of Galleria mellonella exposed to tau-fluvalinate. 2) raw-data_RT-qPCR.xlsx: this file contains raw data (Ct value) of RT-qPCR.
keywords: Apis mellifera; cytochrome P450; tau-fluvalinate; detoxification genes; waxworm
published: 2025-06-26
 
This dataset supports the analysis presented in the study on curbside electric vehicle (EV) charging infrastructure planning in San Francisco and the published paper titled "Urban electric vehicle infrastructure: Strategic planning for curbside charging." It includes spatial data layers and tabular data used to evaluate location suitability under multiple criteria, such as demand, accessibility, and environmental benefits. This dataset can be used to replicate the multi-criteria decision-making framework, perform additional spatial analyses, or inform policy decisions related to EV infrastructure siting in urban environments.
keywords: Electric Vehicles; Curbside Charging Stations; Multi-Criteria Decision-Making; Suitability Analysis; Urban Infrastructure
published: 2025-06-26
 
This dataset encompasses experimental results supporting the upcoming journal paper, "Laboratory-scale assessment of CO2 sealing potential for heterogeneous caprock", which investigates the sealing potential of heterogeneous caprock. The dataset includes the measurements and analyses conducted under controlled laboratory conditions, capturing sealing potential such as permeability and breakthrough pressure.
keywords: Heterogeneity; CO2 breakthrough pressure; Intrinsic permeability; Capillary pressure curve
published: 2025-06-24
 
This supporting information file contains codes related to pending publication Ge et al. Proc. Nat. Acad. Sci. USA, (revisions in review). The contents include a Mathematica code that solves the Laplace transformed equations and generates figures from the paper. A python code is included for generation of Figure 5 in the main text.
keywords: Population balance model; Covalent organic framework; Nucleation; Growth;
published: 2025-06-23
 
This repository contains data and model weights associated with the publication "Fast and Accurate Prediction of Protein Dynamic Contact Maps from Single Sequences". It includes the datasets used for training and evaluating a dynamic contact prediction model, ESMDynamic, as well as a script for conversion and usage.
keywords: Computational biology; Structural biology; Molecular dynamics; Machine learning; Protein modeling; Bioinformatics; Biophysics; Artificial intelligence
published: 2025-06-16
 
Biometric, and ground-based and eddy covariance flux data to investigate the impact of sugarcane expansion across subtropical Florida on the carbon (C) budget over a three-year rotation. Dataset includes: three-year record of daily fluxes, NPP and SOC input measurements, and estimates of carbon use efficiency and net ecosystem carbon balance in sugarcane and improved and semi-native pastures following pasture conversion to sugarcane.
keywords: land use change; sugarcane expansion; bioenergy; carbon budget; CUE; NECB
published: 2025-05-05
 
The dataset includes responses from approximately 550 participants to survey questions about trust in images labeled with AI-related tags, compared to other images found online. The questions also explore how the type of label influences their trust.
keywords: Artificial intelligence (AI); Trust in AI; Al labeling; AI ethics
published: 2025-06-05
 
There are two files in this dataset. File1: AffiNorm AffiNorm contains 1,001 rows, including one header row, randomly sampled from MapAffil 2018 Dataset ([**https://doi.org/10.13012/B2IDB-2556310_V1**](https://databank.illinois.edu/datasets/IDB-2556310)). Each row in the file corresponds to a particular author on a particular PubMed record, and contains the following 26 columns, comma-delimited. All columns are ASCII, except city which contains Latin-1. COLUMN DESCRIPTION 1. PMID: the PubMed identifier. int. 2. ORDER: the position of the author. int. 3. YEAR - The year of publication. int(4), eg: 1975. 4. affiliation - affiliation string of the author. eg: Department of Pathology, University of Chicago, Illinois 60637. 5. annotation_type: the number of institutions annotated, denoted by S, M, O, or Z, where "S" (single) indicates 1 institution was annotated; "M" (Multiple) indicates more than one institutions were annotated; "O" (Out of Vocabulary or None) indicates no institution was annotated, but an institution was apparently mentioned; "Z" indicates no institution was mentioned. 6. Institution: the standard name(s) of the annotated institution(s), according to ROR. if "S" (single institution), it is saved as a string, eg: University of Chicago; if "M", it is saved as a string that looks like a python list, eg: ['Public Health Laboratory Service'; 'Centre for Applied Microbiology and Research']; if "O" or "Z", then blank. 7. inst_type: the type of institution, according to ROR. the potential values are: education, funder, healthcare, company, archive, nonprofit, government, facility, other. An institution may have more than one type, eg: ['Education', 'Funder'] 8. type_edu: TRUE if the inst_type contains "Education"; FALSE otherwise. 9. RORid: ROR identifier(s), eg: https://ror.org/05hs6h993. when multiple, the order corresponds to institution (column 6) 10. RORid_label. the standard name(s) of the annotated institution(s) according to ROR.same as institution (column 6) 11. GRIDid: GRID identifier(s). eg: grid.170205.1 12. GRIDid_label: the standard name(s) of the annotated institution(s) according to GRID. eg: University of Chicago. 13. WikiDataid: WikiData identifier(s). eg: Q131252 14. WikiDataid_label: the standard name(s) of the annotated institution(s) according to WikiData. eg: University of Chicago 15. synonyms: a comma separated list of variant names from InsVar (file 2) . format of string. eg: University of Chicago, Chicago University, U of C, UChicago, uchicago.edu, U Chicago, ... 16. MapAffil-grid: GRID from the MapAffil 2018 Dataset. 17. MapAffil-grid_label: The standard name of institution from MapAffil 2018 Dataset. 18. judge_mapA: TRUE if GRIDid (column 11) contains MapAffil-grid (column 16); FALSE otherwise. 19. MapAffiltemporal-grid: GRID from the temporal version of MapAffil, http://abel.ischool.illinois.edu/data/MapAffilTempo2018.tsv.gz 20. MapAffiltemporal-grid_label: The standard name of institution from MapAffilTemporal 2018 Dataset. 21. judge_mapT: TRUE if GRIDid (column 11) contains MapAffiltemporal-grid (column 19); FALSE otherwise. 22. RORapi_query_id: ROR from ROR api tool (query endpoint) 23. RORapi_query_id_label: The standard name of institution from ROR api tool (query endpoint). format in string. 24. judge_rorapi_affiliation: TRUE if RORid (column 9) contains RORapi_query_id (column 22); FALSE otherwise. 25. rorapi_affiliation_id: ROR from ROR api tool (affiliation endpoint). 26. judge_rorapi_affiliation: TRUE if RORid (column 9) contains RORapi_affiliation (column 25); FALSE otherwise. File 2: insVar.json InsVar is a supplementary dataset for AffiNorm, which includes the institution ID and its redirected aliases from wikidata. The institution ID list is from GRID, the redirected aliases are from wiki api, for example: https://en.wikipedia.org/wiki/Special:WhatLinksHere?target=University+of+Illinois+Urbana-Champaign&namespace=&hidetrans=1&hidelinks=1&limit=100 In InsVar, the data is saved in a python dictionary format. the key is the GRID identifier, for example: "grid.1001.0" (Australian National University), and the value is a list of redirected aliases strings. {"grid.1001.0": ["ANU", "ANU College", "ANU College of Arts and Social Sciences", "ANU College of Asia and the Pacific", "ANU Union", "ANUSA", "Asia Pacific Week",    "Australia National University", "Australian Forestry School", "the Australian National University", ...], "grid.1002.3": ...}
keywords: PubMed; MEDLINE; Digital Libraries; Bibliographic Databases; Institution Names; Author Affiliations; Institution Name Ambiguity; Authority files
published: 2025-06-10
 
This dataset contains all the raw and processed data used to generate the figures presented in the main text and the supplementary information of the paper "Operation of a high frequency, phase slip qubit." It also includes code for data analysis and code for generating the figures.
keywords: phase slip qubit; superconducting qubit; quantum information; disordered superconductors
published: 2021-08-27
 
The dataset shows all poison frogs (superfamily Dendrobatoidea) in private U.S. collections during 1990–2020. For each species and color morph, there is a date of arrival, the way it arrived in U.S. collections, and detailed notes related to its presence in the pet trade.
keywords: pet trade; amphibians; Dendrobatidae
published: 2025-06-06
 
The materials used to provide Continuing Medical Education on ticks and tick-borne diseases in Illinois on February 1, 2023 at Carle Hospital, along with the pre- and post-quiz and deidentified data of the quiz takers. Files: "Ticks and Tick-borne Diseases of Illinois_Final_w_speaker_notes.pptx": Presentation slides used for CME course, with notes to indicate verbal commentary "CME assessment_final.docx": Pre- and post-CME quiz questions and answers, annotated to indicate correct answers and reasoning for incorrect answers "CME_prequiz_data_for_sharing.csv": De-identified data from pre-CME quiz "CME_postquiz_data_for_sharing.csv": De-identified data from post-CME quiz, including demographics "DataCleaning_forSharing.R": R file used to clean the raw data and calculate the scores "ReadMe.txt":
keywords: tick-borne disease; CME
published: 2025-06-03
 
GIS data and geoprocessing tools associated with White and Lambert (2025) modeling paper that assesses the potential impact of development on the archaeological resources of Illinois.
keywords: development; archaeology; climate change; GIS
published: 2025-06-04
 
These datasets contain the complete output from a Monte Carlo simulation of the number of wild cervids to test for chronic wasting disease (CWD) depending on true prevalence. Five CSVs of the simulation results are provided, split due to limitations in file size. The R code used to run the simulation and process the data is included. The data to replicated Table 1 and the data used to compare the simulation results to the CWD surveillance efforts of the Illinois Department of Natural Resources (IDNR) are also provided.
keywords: chronic wasting disease; cwd; cervid; test; sample size; diagnostic testing; surveillance
published: 2025-06-03
 
This is a peptide imaging data obtained by mtarix assisted laser desoption ionization trapped ion mobility datasets from the central nervous sytem and select ganglion of aplysia Californica.
keywords: Neuropeptides, Iosmerization, D-amino acids, MALDI-TIMS
published: 2019-06-13
 
This lexicon is the expanded/enhanced version of the Moral Foundation Dictionary created by Graham and colleagues (Graham et al., 2013). Our Enhanced Morality Lexicon (EML) contains a list of 4,636 morality related words. This lexicon was used in the following paper - please cite this paper if you use this resource in your work. Rezapour, R., Shah, S., & Diesner, J. (2019). Enhancing the measurement of social effects by capturing morality. Proceedings of the 10th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis (WASSA). Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), Minneapolis, MN. In addition, please consider citing the original MFD paper: <a href="https://doi.org/10.1016/B978-0-12-407236-7.00002-4">Graham, J., Haidt, J., Koleva, S., Motyl, M., Iyer, R., Wojcik, S. P., & Ditto, P. H. (2013). Moral foundations theory: The pragmatic validity of moral pluralism. In Advances in experimental social psychology (Vol. 47, pp. 55-130)</a>.
keywords: lexicon; morality
published: 2025-02-14
 
This dataset includes the original data (including photographs as .jpg files and sound recordings as .wav files) and detailed descriptions of workflows for analyses of acoustic and morphometric data for the Neoaliturus tenellus (beet leafhopper) species complex. Files needed for different parts of the two analytical workflows are included in the "Acoustics.zip" and "PCA.zip" archives. The "Folder Structure.png" file contains a diagram of the folder structure of the two archives. Each archive contains a "ReadMe" file with instructions for repeating the analyses. File and folder names including the two-letter abbreviations TB, TD, TN and TP refer to four different putative species (operational taxonomic units, or OTUs, of the Neoaliturus tenellus complex.
keywords: Hemiptera; Cicadellidae; integrative taxonomy; courtship; morphology
published: 2025-04-23
 
These data files were used for phylogenomic analyses of Darnini and related Membracidae (Hemiptera: Auchenorrhyncha) in the referenced article by Gonzalez-Mozo et al. - The "mem_50p_alignment.fas" file contains the aligned, concatenated nucleotide sequence data for 51 species and 492 genetic loci included in the phylogenetic analyses ("N" indicates missing data and "-" indicates an alignment gap). - The file "Table1.rtf" lists the included species, country of origin and genbank accession number. Species newly sequenced for this study have a Sample ID with prefix "DAR"; previously sequenced species for which data were downloaded from genbank have "NCBI" indicated in the same column of the table. - The file "partition_def.txt" lists the 492 genetic loci included in the alignment with their exact positions indicated by the range of numbers given at the end of each line (e.g., locus "uce-1" occupies positions 1-280 in the alignment). - The substitution model file "mem_50p.model" contains information on the substitution models used in the partitioned maximum likelihood analysis, including the models used for different data partitions and parameter values, as output by the phylogenetic software IQ-TREE. - Individual tree files in Newick format (plain text) are provided for the phylogeny from concatenated analysis with the best likelihood score ("mem_50p_bestLikelihoodScore"), concatenated likelihood analysis with gene concordance factors ("mem_50p_gcf") and site concordance factors ("mem_50p_scf"). - The tree file from the ASTRAL analysis is "mem_50p_astral". - The zip archive entitled “IQ-TREE analysis results.zip” includes output from the maximum likelihood analysis of the concatenated nucleotide sequence data, including the following: (1) main output file “mem_50p.iqtree” summarizing model selection, partitioning schemes, likelihood scores, and run parameters; (2) “mem_50p.mldist” including pairwise ML distances between taxa; (3) “mem_50p.best_scheme.nex” with the best partitioning scheme identified by ModelFinder in NEXUS format and (4) “mem_50p.best_scheme” the RAxM-compatible version of the same file. - The “Ultrafast bootstrap results.zip” zip archive contains: (1) “mem_50p.ufboot” with the bootstrap replicate trees; (2) “mem_50p.contree” with the majority-rule consensus tree with support values; (3) “mem_50p.splits.nex”, with split support values across the replicates; (4) “mem_50p.log” is the log file. - The “gene_trees.zip” zip archive contains the individual gene trees as input for subsequent coalescent gene tree analysis in the phylogenetic program ASTRAL. - The file "DarniniAHE_Character Matrix.csv" contains the data for 6 morphological characters for which the ancestral states were reconstructed using the phylogenetic results from analysis of anchored-hybrid data (see article text for details). - The file "scriptACRDarnini.txt" contains the commands used to reconstruct ancestral morphological characters states using the corHMM 2.8 R package. See the Methods section of the article for more details.
keywords: Insecta; Hemiptera; anchored-hybrid enrichment; phylogeny; treehopper