Illinois Data Bank
Deposit Dataset
Find Data
Policies
Guides
Contact Us
Log in with NetID
Toggle navigation
Illinois Data Bank
Deposit Dataset
Find Data
Policies
Guides
Contact Us
Log in with NetID
Displaying 76 - 100 of 714 in total
<
1
2
3
4
5
6
7
8
9
…
28
29
>
25 per page
50 per page
Show All
Go
Clear Filters
Generate Report from Search Results
Subject Area
Life Sciences (388)
Social Sciences (139)
Physical Sciences (103)
Technology and Engineering (67)
Uncategorized
Arts and Humanities (1)
Funder
Other (223)
U.S. National Science Foundation (NSF) (203)
U.S. Department of Energy (DOE) (74)
U.S. National Institutes of Health (NIH) (69)
U.S. Department of Agriculture (USDA) (47)
Illinois Department of Natural Resources (IDNR) (20)
U.S. Geological Survey (USGS) (7)
U.S. National Aeronautics and Space Administration (NASA) (6)
Illinois Department of Transportation (IDOT) (4)
U.S. Army (2)
Publication Year
2024 (113)
2021 (108)
2022 (107)
2020 (96)
2023 (77)
2019 (72)
2018 (61)
2017 (36)
2016 (30)
2025 (9)
2009 (1)
2011 (1)
2012 (1)
2014 (1)
2015 (1)
License
CC0 (389)
CC BY (304)
custom (21)
Illinois Data Bank Dataset Search Results
Dataset Search Results
published: 2021-02-24
Bieri, Carolina A.; Dominguez, Francina (2021): Southeastern South America Soil Moisture Alteration Experiment Using CESM2. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3047783_V1
This dataset contains model output from the Community Earth System Model, Version 2 (CESM2; Danabasoglu et al. 2020). These data were used for analysis in Impacts of Large-Scale Soil Moisture Anomalies in Southeastern South America, published in the Journal of Hydrometeorology (DOI: 10.1175/JHM-D-20-0116.1). See this publication for details of the model simulations that created these data. Four NetCDF (.nc) files are included in this dataset. Two files correspond to the control simulation (FHIST_SP_control) and two files correspond to a simulation with a dry soil moisture anomaly imposed in southeastern South America (FHIST_SP_dry; see the publication mentioned in the preceding paragraph for details on the spatial extent of the imposed anomaly). For each simulation, one file corresponds to output from the atmospheric model (file names with "cam") of CESM2 and the other to the land model (file names with "clm2"). These files are raw CESM output concatenated into a single file for each simulation. All files include data from 1979-01-02 to 2003-12-31 at a daily resolution. The spatial resolution of all files is about 1 degree longitude x 1 degree latitude. Variables included in these files are listed or linked below. Variables in atmosphere model output: Vertical velocity (omega) Convective precipitation Large-scale precipitation Surface pressure Specific humidity Temperature (atmospheric profile) Reference temperature (temp. at reference height, 2 meters in this case) Zonal wind Meridional wind Geopotential height Variables in land model output: See https://www.cesm.ucar.edu/models/cesm1.2/clm/models/lnd/clm/doc/UsersGuide/history_fields_table_40.xhtml Note that not all of the variables listed at the above link are included in the land model output files in this dataset. This material is based upon work supported by the National Science Foundation under Grant No. 1454089. We acknowledge high-performance computing support from Cheyenne (doi:10.5065/D6RX99HX) provided by NCAR's Computational and Information Systems Laboratory, sponsored by the National Science Foundation. The CESM project is supported primarily by the National Science Foundation. We thank all the scientists, software engineers, and administrators who contributed to the development of CESM2. References Danabasoglu, G., and Coauthors, 2020: The Community Earth System Model Version 2 (CESM2). Journal of Advances in Modeling Earth Systems, 12, e2019MS001916, https://doi.org/10.1029/2019MS001916.
keywords:
Climate modeling; atmospheric science; hydrometeorology; hydroclimatology; soil moisture; land-atmosphere interactions
published: 2021-02-25
Ferin, Kelsie; Chen, Luoye; Zhong, Jia; Heaton, Emily; Khanna, Madhu; VanLoocke, Andy (2021): Simulated Land Allocation, Nitrogen Use, and Nitrogen Loss in the Mississippi Atchafalaya River Basin for Various RFS2 (Renewable Fuel Standard) Policy Scenarios. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3388479_V1
Total nitrogen leaching rates were calculated over the Mississippi Atchafalaya River Basin (MARB) using an integrated economic-biophysical modeling approach. Land allocation for corn production and total nitrogen application rates were calculated for crop reporting districts using the Biofuel and Environmental Policy Analysis Model (BEPAM) for 5 RFS2 policy scenarios. These were used as input in the Integrated BIosphere Simulator-Agricultural Version (Agro-IBIS) and the Terrestrial Hydrologic Model with Biogeochemistry (THMB) to calculate the nitrogen loss. Land allocation and total nitrogen application simulations were simulated for the period 2016-2030 for 303 crop reporting districts (https://www.nass.usda.gov/Data_and_Statistics/County_Data_Files/Frequently_Asked_Questions/county_list.txt). The final 2030 values are reported here. Both are stored in csv files. Units for land allocation are million ha and nitrogen application are million kg. The nitrogen leaching rates were modeled with a spatial resolution of 5' x 5' using the North American Datum of 1983 projection and stored in NetCDF files. The 30-year average is calculated over the last 30 years of the 45 years being simulated. Leaching rates are calculated in kg-N/ha.
keywords:
nitrogen leaching, bioethanol, bioenergy crops
published: 2021-03-14
Kang, Jeon-Young; Michels, Alexander; Lyu, Fangzheng; Wang, Shaohua; Agbodo, Nelson; Freeman, Vincent L; Wang, Shaowen; Anand, Padmanabhan (2021): Spatial accessibility of COVID-19 healthcare resources in Illinois, USA. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-6582453_V1
This dataset contains all the code, notebooks, datasets used in the study conducted to measure the spatial accessibility of COVID-19 healthcare resources with a particular focus on Illinois, USA. Specifically, the dataset measures spatial access for people to hospitals and ICU beds in Illinois. The spatial accessibility is measured by the use of an enhanced two-step floating catchment area (E2FCA) method (Luo & Qi, 2009), which is an outcome of interactions between demands (i.e, # of potential patients; people) and supply (i.e., # of beds or physicians). The result is a map of spatial accessibility to hospital beds. It identifies which regions need more healthcare resources, such as the number of ICU beds and ventilators. This notebook serves as a guideline of which areas need more beds in the fight against COVID-19. ## What's Inside A quick explanation of the components of the zip file * `COVID-19Acc.ipynb` is a notebook for calculating spatial accessibility and `COVID-19Acc.html` is an export of the notebook as HTML. * `Data` contains all of the data necessary for calculations: * `Chicago_Network.graphml`/`Illinois_Network.graphml` are GraphML files of the OSMNX street networks for Chicago and Illinois respectively. * `GridFile/` has hexagonal gridfiles for Chicago and Illinois * `HospitalData/` has shapefiles for the hospitals in Chicago and Illinois * `IL_zip_covid19/COVIDZip.json` has JSON file which contains COVID cases by zip code from IDPH * `PopData/` contains population data for Chicago and Illinois by census tract and zip code. * `Result/` is where we write out the results of the spatial accessibility measures * `SVI/`contains data about the Social Vulnerability Index (SVI) * `img/` contains some images and HTML maps of the hospitals (the notebook generates the maps) * `README.md` is the document you're currently reading! * `requirements.txt` is a list of Python packages necessary to use the notebook (besides Jupyter/IPython). You can install the packages with `python3 -m pip install -r requirements.txt`
keywords:
COVID-19; spatial accessibility; CyberGISX
published: 2021-05-14
Gramig, Benjamin; Khanna, Madhu; Jain, Atul (2021): An Assessment of the Impacts of Climate Change in Illinois, Chapter 4: Climate Change Impacts on Agriculture, Supplemental Materials. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8285949_V1
This document contains the Supplemental Materials for Chapter 4: Climate Change Impacts on Agriculture from the report "An Assessment of the Impacts of Climate Change in Illinois" published in 2021.
keywords:
Illinois; climate change; agriculture; impacts; adaptation; crop yield; ISAM; econometrics; days suitable for fieldwork
published: 2021-03-10
Trivellone, Valeria; Wei, Wei; Filippin, Luisa; Dietrich, Christopher H (2021): FASTA file of the final sequence alignment used in the phylogenetic analyses of Phytoplasmas detected in leafhoppers. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2694515_V1
The PhytoplasmasRef_Trivellone_etal.fas fasta file contains the original final sequence alignment used in the phylogenetic analyses of Trivellone et al. (Ecology and Evolution, in review). The 27 sequences (21 phytoplasma reference strains and 6 phytoplasmas strains from the present study) were aligned using the Muscle algorithm as implemented in MEGA 7.0 with default settings. The final dataset contains 952 positions of the F2n/R2 fragment of the 16S rRNA gene. The data analyses are further described in the cited original paper.
keywords:
Hemiptera; Cicadellidae; Mollicutes; Phytoplasma; biorepository
published: 2021-05-14
Miller, Jim; Czesny, Sergiusz; Dai, Qihong; Ellis, James; Iverson, Louis; Matthews, Jeff; Roswell, Charlie; Suski, Cory; Taft, John; Ward, Mike (2021): An Assessment of the Impacts of Climate Change in Illinois, Chapter 6: Climate Change Impacts on Ecosystems, Supplement 6.1: Scientific and Common Species Names. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-9049988_V1
Please cite as: Jim Miller, Sergiusz Czesny, Qihong Dai, James Ellis, Louis Iverson, Jeff Matthews, Charles Roswell, Cory Suski, John Taft, and Mike Ward. 2021. “Climate Change Impacts on Ecosystems: Scientific and Common Species Names”.
keywords:
Scientific names; Common names; Illinois species
published: 2021-05-14
Iverson, Louis (2021): An Assessment of the Impacts of Climate Change in Illinois, Chapter 6: Climate Change Impacts on Ecosystems, Supplemental Forest Data. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3459813_V1
Supplemental Forest Data for Chapter 6: Climate Change Impacts on Ecosystems in "An Assessment of the Impacts of Climate Change in Illinois"
published: 2021-04-19
Xia, Yushu; Wander, Michelle (2021): Response of Soil Quality Indictors including β-glucosidase, Fluorescein Diacetate Hydrolysis and Permanganate Oxidizable Carbon. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-2865725_V3
Dataset compiled by Yushu Xia and Michelle Wander for the Soil Health Institute. Data were recovered from peer reviewed literature reporting results for three soil quality indicators (SQIs) (β-glucosidase (BG), fluorescein diacetate (FDA) hydrolysis, and permanganate oxidizable carbon (POXC)) in terms of their relative response to management where soils under grassland cover, no-tillage, cover crops, residue return and organic amendments were compared to conventionally managed controls. Peer-reviewed articles published between January of 1990 and May 2018 were searched using the Thomas Reuters Web of Science database (Thomas Reuters, Philadelphia, Pennsylvania) and Google Scholar to identify studies reporting results for: “β-glucosidase”, “permanganate oxidizable carbon”, “active carbon”, “readily oxidizable carbon”, and “fluorescein diacetate hydrolysis”, together with one or more of the following: “management practice”, “tillage”, “cover crop”, “residue”, “organic fertilizer”, or “manure”. Records were tabulated to compare SQI abundance in soil maintained under a control and soil aggrading practice with the intent to contribute to SQI databases that will support development of interpretive frameworks and/or algorithms including pedo-transfer functions relating indicator abundance to management practices and site specific factors. Meta-data include the following key descriptor variables and covariates useful for development of scoring functions: 1) identifying factors for the study site (location, year of initiation of study and year in which data was reported), 2) soil textural class, pH, and SOC, 3) depth and timing of soil sampling, 4) analytical methods for SQI quantification, 5) units used in published works (i.e. equivalent mass, concentration), 6) SQI abundances, and 7) statistical significance of difference comparisons. *Note: Blank values in tables are considered unreported data.
keywords:
Soil health promoting practices; Soil quality indicators; β-glucosidase; fluorescein diacetate hydrolysis; Permanganate oxidizable carbon; Greenhouse gas emissions; Scoring curves; Soil Management Assessment Framework
published: 2021-05-09
Zuckermann, Federico (2021): Bacillus-based direct-fed microbial reduces the pathogenic synergy of a co-infection with Salmonella enterica serovar Choleraesuis and porcine reproductive and respiratory syndrome virus. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0439780_V1
Raw data and its analysis collected from a trial designed to test the impact of providing a Bacillus-based direct-fed microbial (DFM) on the syndrome resulting from orally infecting pigs with either Salmonella enterica serotype Choleraesuis (S. Choleraesuis) alone, or in combination with an intranasal challenge, three days later, with porcine reproductive and respiratory syndrome virus (PRRSV).
keywords:
excel file
published: 2022-04-19
Saleh, Ehsan; Ghaffari, Saba; Forsyth, David; Yu-Xiong, Wang (2022): Dataset for On the Importance of Firth Bias Reduction in Few-Shot Classification. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1016367_V1
This data repository includes the features and the trained backbone parameters used in the ICLR 2022 Paper "On the Importance of Firth Bias Reduction in Few-Shot Classification". The code accompanying this data is open-source and available at https://github.com/ehsansaleh/firth_bias_reduction The code and the data have three modules: 1. The "code_firth" module (10 files) relates to the basic ResNet backbones and logistic classifiers (e.g., Figures 2 and 3 in the main paper). 2. The "code_s2m2rf" module (2 files) relates to the S2M2R feature backbones and cosine classifiers (e.g., Figure 4 in the main paper). 3. The "code_dcf" module (3 files) relates to the few-shot Distribution Calibration (DC) method (e.g., Table 1 in the main paper). The relevant files for each module have the module name as a prefix in their name. 1. For instance, the "code_dcf_features.tar" file should be placed at the "features" directory of the "code_dcf" module. 2. As another example, "code_firth_features_cifarfs_novel.tar" should be placed in the "features" directory of the "code_firth" module, and it includes the features extracted from the novel split of mini-ImageNet dataset. Each tar-ball should be extracted in its relevant directory, and the md5 check-sums of the extracted files are also provided in the open-source code repository for verification. Please note that the actual datasets of images are not included here (since we do not own those datasets). However, helper scripts for automatically downloading the original datasets are also provided in the every module and sub-directory of the GitHub code repository.
keywords:
Computer Vision; Few-Shot Classification; Few-Shot Learning; Firth Bias Reduction
published: 2023-03-27
Littlefield, Alexander; Xie, Dajie; Richards, Corey; Ocier, Christian; Gao, Haibo; Messinger, Jonah; Ju, Lawrence; Gao, Jingxing; Edwards, Lonna; Braun, Paul; Goddard, Lynford (2023): Data for Enabling High Precision Gradient Index Control in Subsurface Multiphoton Lithography. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3190140_V1
This dataset contains the full data used in the paper titled "Enabling High Precision Gradient Index Control in Subsurface Multiphoton Lithography," available at https://doi.org/10.1021/acsphotonics.2c01950 . The data used for Table 1 can be found in the dataset for the related Figure 8. Some supplemental figures' data can be found in the main figures data: Figure S2's data is contained in Figure 6. Figure S4 and Table S1 data is derived from Figure 6. Figure S9 is derived from Figure 7. Figure S10 is contained in Figure 7. Figure S12 is derived from Figure 6 and the Python code prism-fringe-analysis. Figures without a data file named after them do not have any data affiliated with them and are purely graphical representations.
published: 2020-07-16
Mishra, Shubhanshu (2020): SocialMediaIE Tutorial Material for IC2S2 2020. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0851257_V1
Dataset to be for SocialMediaIE tutorial
keywords:
social media; deep learning; natural language processing
published: 2020-08-01
Rhoads, Bruce ; Lewis, Quinn; Sukhodolov, Alexander; Constantinescu, George (2020): Mixing Data for Three Small Confluences in East Central Illinois. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-1255710_V1
This data set includes information used to determine patterns of mixing at three small confluences in East Central Illinois based on differences in the temperature or turbidity of the two confluent flows.
keywords:
mixing; confluences; flow structure
published: 2020-10-15
Khanna, Madhu; Wang, Weiwei; Wang, Michael (2020): BEPAM Model Code and CABBI Simulation Results for "Assessing the Additional Carbon Savings with Biofuel". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4272529_V1
This dataset consists of various input data that are used in the GAMS model. All the data are in the format of .inc which can be read within GAMS or Notepad. Main data sources include: acreage data (acre), crop budget data ($/acre), crop yield data (e.g. bushel/acre), Soil carbon sequestration data (KgCO2/ha/yr). Model details can be found in the "Assessing the Additional Carbon Savings with Biofuel" and GAMS model package. ## File Description (1) GAMS Model.zip: This includes all the input files and scripts for running the model (2) Table*.csv: These files include the data from the tables in the manuscript (3) Figure2_3_4.csv: This contains the data used to create the figures in the manuscript (4) BaselineResults.csv: This includes a summary of the model results. (5) SensitivityResults_*.csv: Model results from the various sensitivity analyses performed (6) LUC_emission.csv: land use change emissions by crop reporting district for changes of pasturelands to annual crops.
keywords:
Biogenic carbon intensity; Corn ethanol; Economic model; Dynamic optimization; Anticipated baseline approach; Life cycle carbon intenisty
published: 2020-10-14
Dalling, James W.; Heineman, Katherine D. (2020): Multiple stem and environmental variables dataset. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4045402_V1
Data on permanent plots at Fortuna and the Panama Canal Watershed, Republic of Panama, containing counts and percent of trees with one or more multiple stems >10cm diameter, with and without palms. Accompanying environmental data includes elevation, precipitation, soil type and soil chemical variables (pH, total N, NO3, NO4, resin P, mehlich Ca, K and Mg.
keywords:
multiple stems; resprouting; Panama Canal Watershed; Fortuna Forest Reserve
published: 2021-02-18
Wang, Shaowen; Lyu, Fangzheng; Wang, Shaohua; Catlet, Charles; Padmanabhan, Anand; Soltani, Kiumars (2021): Data for Integrating CyberGIS and Urban Sensing for Reproducible Streaming Analytics. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0286574_V1
Increasingly pervasive location-aware sensors interconnected with rapidly advancing wireless network services are motivating the development of near-real-time urban analytics. This development has revealed both tremendous challenges and opportunities for scientific innovation and discovery. However, state-of-the-art urban discovery and innovation are not well equipped to resolve the challenges of such analytics, which in turn limits new research questions from being asked and answered. Specifically, commonly used urban analytics capabilities are typically designed to handle, process, and analyze static datasets that can be treated as map layers and are consequently ill-equipped in (a) resolving the volume and velocity of urban big data; (b) meeting the computing requirements for processing, analyzing, and visualizing these datasets; and (c) providing concurrent online access to such analytics. To tackle these challenges, we have developed a novel cyberGIS framework that includes computationally reproducible approaches to streaming urban analytics. This framework is based on CyberGIS-Jupyter, through integration of cyberGIS and real-time urban sensing, for achieving capabilities that have previously been unavailable toward helping cities solve challenging urban informatics problems. The files included in this dataset functions as follows: 1) Spatial_interpolation.ipynb is a python based Jupyter notebook that enables users to conduct spatial interpolation with AoT data; 2) Urban_Informatics.ipynb is a Jupyter notebook that helps to explore the AoT dataset; 3) chicago-complete.weekly.2019-09-30-to-2019-10-06.tar includes all the high-frequency urban sensing data from AoT sensors from 2019 September 30th to 2019 October 6th collected in Chicago, US; 4) sensors.csv is a processed dataset including information about the temperature in Chicago, and it is used in Spatial_interpolation.ipynb.
keywords:
CyberGIS; Urban informatics; Array of Things
published: 2020-05-17
Mishra, Sudhanshu; Prasad, Shivangi; Mishra, Shubhanshu (2020): Trained models for Multilingual Joint Fine-tuning of Transformer models for identifying Trolling, Aggression and Cyberbullying at TRAC 2020. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8882752_V1
Models and predictions for submission to TRAC - 2020 Second Workshop on Trolling, Aggression and Cyberbullying Our approach is described in our paper titled: Mishra, Sudhanshu, Shivangi Prasad, and Shubhanshu Mishra. 2020. “Multilingual Joint Fine-Tuning of Transformer Models for Identifying Trolling, Aggression and Cyberbullying at TRAC 2020.” In Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying (TRAC-2020). The source code for training this model and more details can be found on our code repository: https://github.com/socialmediaie/TRAC2020 NOTE: These models are retrained for uploading here after our submission so the evaluation measures may be slightly different from the ones reported in the paper.
keywords:
Social Media; Trolling; Aggression; Cyberbullying; text classification; natural language processing; deep learning; open source;
published: 2020-05-12
Eick, Brian (2020): Acceleration and strain data for free vibration of pre-tensioned, partially-submerged beams. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7897650_V1
The data provided herein is accelerometer and strain data taken from free vibration response of pre-tensioned, partially submerged steel beam specimens (modulus of elasticity assumed = 29,000 ksi). The specimens were subjected to various levels of pre-tension, and various levels of submersion in water. The purpose of the testing was to quantify the effects of partial submersion on the vibrating frequencies of pretensioned beams. Three specimens were tested, each with different cross section (but identical cross-sectional area). The different cross sections allow investigation of the effects of specimen width as the specimen vibrates through water. The testing procedure was as follows: 1) Apply a specified level of tension in the beam. Measure tension via 3 strain gages. 2) Submerge the specimens to a specified depth of water 3) Excite the beams with either a hammer impact or a pull-and-release method (physically pull the middle of the bar and quickly release) 4) Measure the free vibration of the beam with 2 accelerometers. Schematic drawings of the test setup and the test specimens are provided, as is a picture of the test setup.
keywords:
free vibration; beam; partially-submerged; prestressed;
published: 2020-05-30
Long, Stephen Patrick (2020): Original data for "Light, Not Age, Underlies the Q9 Maladaptation of Maize and Miscanthus Photosynthesis to Self-Shading". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4821336_V1
Original leaf gas exchange and absorptance data used in the Collison et al. (2020) Light, Not Age, Underlies the Q9 Maladaptation of Maize and Miscanthus Photosynthesis to Self-Shading - Frontiers in Plant Science doi: 10.3389/fpls.2020.00783
keywords:
C4 photosynthesis; canopy; bioenergy; food security; quantum yield; shade acclimation; photosynthetic light-use efficiency; leaf aging
published: 2020-07-01
Rykhlevskii, Andrei; Huff, Kathryn D. (2020): SaltProc output for TAP MSR and MSBR online reprocessing depletion simulations. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-7364919_V1
keywords:
molten salt; fuel cycle; reprocessing; refueling
published: 2020-07-15
Molloy, Erin K. (2020): Data from: Supertree-like methods for genome-scale species tree estimation. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4004605_V1
This repository includes scripts and datasets for Chapter 6 of my PhD dissertation, " Supertree-like methods for genome-scale species tree estimation," that had not been published previously. This chapter is based on the article: Molloy, E.K. and Warnow, T. "FastMulRFS: Fast and accurate species tree estimation under generic gene duplication and loss models." Bioinformatics, In press. https://doi.org/10.1093/bioinformatics/btaa444. The results presented in my PhD dissertation differ from those in the Bioinformatics article, because I re-estimated species trees using FastMulRF and MulRF on the same datasets in the original repository (https://doi.org/10.13012/B2IDB-5721322_V1). To re-estimate species trees, (1) a seed was specified when running MulRF, and (2) a different script (specifically preprocess_multrees_v3.py from https://github.com/ekmolloy/fastmulrfs/releases/tag/v1.2.0) was used for preprocessing gene trees (which were then given as input to MulRF and FastMulRFS). Note that this preprocessing script is a re-implementation of the original algorithm for improved speed (a bug fix also was implemented). Finally, it was brought to my attention that the simulation in the Bioinformatics article differs from prior studies, because I scaled the species tree by 10 generations per year (instead of 0.9 years per generation, which is ~1.1 generations per year). I re-simulated datasets (true-trees-with-one-gen-per-year-psize-10000000.tar.gz and true-trees-with-one-gen-per-year-psize-50000000.tar.gz) using 0.9 years per generation to quantify the impact of this parameter change (see my PhD dissertation or the supplementary materials of Bioinformatics article for discussion).
keywords:
Species tree estimation; gene duplication and loss; statistical consistency; MulRF, FastRFS
published: 2021-10-11
Peng, Jianhao; Ochoa, Idoia (2021): ClonalKinetic Data and Intermediate Results of SimiC. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3975180_V1
This dataset contains the ClonalKinetic dataset that was used in SimiC and its intermediate results for comparison. The Detail description can be found in the text file 'clonalKinetics_Example_data_description.txt' and 'ClonalKinetics_filtered.DF_data_description.txt'. The required input data for SimiC contains: 1. ClonalKinetics_filtered.clustAssign.txt => cluster assignment for each cell. 2. ClonalKinetics_filtered.DF.pickle => filtered scRNAseq matrix. 3. ClonalKinetics_filtered.TFs.pickle => list of driver genes. The results after running SimiC contains: 1. ClonalKinetics_filtered_L10.01_L20.01_Ws.pickle => inferred GRNs for each cluster 2. ClonalKinetics_filtered_L10.01_L20.01_AUCs.pickle => regulon activity scores for each cell and each driver gene. <b>NOTE:</b> “ClonalKinetics_filtered.rds” file which is mentioned in “ClonalKinetics_filtered.DF_data_description.txt” is an intermediate file and the authors have put all the processed in the pickle/txt file as described in the filtered data text.
keywords:
GRNs;SimiC;RDS;ClonalKinetic
published: 2024-01-04
Blind-Doskocil, Leanne; Trapp, Robert J.; Nesbitt, Stephen W. (2024): Radar analyzed quasi-linear convective system mesovortices during the Propagation, Evolution, and Rotation in Linear Storms (PERiLS) Project. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-3906187_V1
This is a collection of 31 quasi-linear convective system (QLCS) mesovortices (MVs) that were manually identified and analyzed using the lowest elevation scan of the nearest relevant Weather Surveillance Radar–1988 Doppler (WSR-88D) during the two years (springs of 2022 and 2023) of the Propagation, Evolution, and Rotation in Linear Storms (PERiLS) field campaign. Throughout the two years of PERiLS, a total of nine intensive observing periods (IOPs) occurred (see https://catalog.eol.ucar.edu/perils_2022/missions and https://catalog.eol.ucar.edu/perils_2023/missions for exact IOP dates/times). However, only six of these IOPs (specifically, IOPs 2, 3, and 4 from both years) are included in this dataset. The inclusion criteria were based on the presence of strictly QLCS MVs within the C-band On Wheels (COW) domain, one of the research radars deployed in the field for the PERiLS project. Further details on how MVs were identified are provided below. This analysis was completed using the Gibson Ridge radar-viewing software (GR2Analyst). Each MV had to be produced by a QLCS, defined as a continuous area of 35 dBZ radar reflectivity over at least 100 km when viewed from the lowest elevation scan. The MVs analyzed also had to pass through/near the COW’s domain at some point during their lifetimes to allow for additional analysis using the COW data. Tornadic (TOR), wind-damaging (WD), and non-damaging (ND) MVs were analyzed. ND MVs were ones that usually had a tornado warning placed on them but did not produce any damage and persisted for five or more radar scans; this was done to target the strongest MVs that forecasters thought could be tornadic. The QLCS MVs were identified using objective criteria, which included the existence of a circulation with a maximum differential velocity (dV; i.e., the difference between the maximum outbound and minimum inbound velocities at a constant range) of at least 20 kt over a distance ≤ 7 km. The following radar-based characteristics were catalogued for each QLCS MV at the lowest elevation angle of the nearest WSR-88D: latitude and longitude locations of the MV, the genesis to decay time of the MV, the maximum dV across the MV, the maximum rotational velocity (Vrot; i.e., dV divided by two), diameter of the MV, the range from the radar of the MV center, and the height above radar level of the MV center. In the Excel sheet, there are a total of 37 sheets. 32 of the 37 sheets are for each MV that was examined. One of those MVs (sheet titled 'EFU_tor_iop3') was not included in the final count of MVs (31). This MV produced an EFU tornado and only tornadoes that were given ratings were used to calculate MV statistics. The 31 MV sheets that were used to calculate MV statistics are labeled following the convention 'mv#_iop#_qlcs'. ‘mv#’ is the unique number that was assigned to each MV for clear identification, 'iop#' is the IOP in which the MV occurred, 'qlcs' denotes that the MV was produced by a QLCS, and the 2023 IOPs are denoted by ‘_2023’ after ‘qlcs’ in the sheet name. In these sheets, there are notes on what was visually seen in the radar data, damage associated with each MV (using the National Centers for Environmental Information (NCEI) database), and the characteristics of the MV at each time step of its lifetime. The yellow rows in each of the sheets indicate the last row of data included in the pretornadic, predamaging (wind damage), and pre-nondamaging statistics. The orange boxes in the notes column indicate any reports that were in NCEI but not in GR2Analyst. There are also sheets that examine pretornadic and predamaging diameter trends, box and whisker plot statistics of the overall characteristics of the different types of MVs, and the overall characteristics of each MV, with one Excel sheet (‘combined_qlcs_mvs’) examining the characteristics of each MV over its entire lifetime and one Excel sheet (‘combined_qlcs_mvs_before_report’) examining the characteristics of each MV before it first produced damage or had a tornado warning placed on it.
keywords:
quasi-linear convective system; QLCS; tornado; radar; mesovortex; PERiLS; low-level rotation; tornadic; nontornadic; wind-damaging; Propagation, Evolution, and Rotation in Linear Storms; tornado warning; C-band On Wheels
published: 2024-05-23
Xing, Yuqing; Bae, Seokjin; Ritz, Ethan; Yang, Fan; Birol, Turan; Salinas , Andrea N. Capa ; Ortiz, Brenden R.; Wilson , Stephen D.; Wang, Ziqiang; Fernandes, Rafael M.; Madhavan, Vidya (2024): Data for manuscript entitled "Optical Manipulation of the Charge Density Wave state in RbV3Sb5". University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-4197245_V1
This dataset consists of all the figure files that are part of the main text and supplementary of the manuscript titled "Optical manipulation of the charge density wave state in RbV3Sb5". For detailed information on the individual files refer to the readme file.
keywords:
kagome superconductor; optics; charge density wave
published: 2024-08-06
Xing, Yuqing; Bae, Seokjin; Madhavan, Vidya (2024): Raw topographies for the study of magnetic field response of RbV3Sb5. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-0274244_V1
This is the raw topographies (without linear background subtraction) related to the publication: https://www.nature.com/articles/s41586-024-07519-5