Illinois Data Bank Dataset Search Results
Results
published:
2024-05-13
Gopalakrishnappa, Chandana; Li, Zeqian; Kuehn, Seppe
(2024)
Supplemental data for the paper titled 'Environmental modulators of algae-bacteria interactions at scale'. Each of the excel workbooks corresponding to datasets 1, 2, and 3 contain a README sheet explaining the reported data. Dataset 4 comprising microscopy data contains a README text file describing the image files.
keywords:
Algae-bacteria interactions; high-throughput; microfluidic-droplet platform
published:
2022-12-31
Maffeo, Christopher; Wilson, Jim; Quednau, Lauren; Aksimentiev, Aleksei
(2022)
Trajectory data for Nature Nanotechnology manuscript "DNA double helix, a tiny electromotor" that demonstrates how an electric field applied along the helical axis of a DNA or RNA molecule will generate an electroosmotic flow that causes the duplex to spin about that axis, much like a turbine.
keywords:
All-atom MD simulation; DNA; nanotechnology; motors and rotors
published:
2021-05-14
This is the complete dataset for the "Anomalous density fluctuations in a strange metal" Proceedings of the National Academy of Sciences publication (https://doi.org/10.1073/pnas.1721495115). This is an integration of the Zenodo dataset which includes raw M-EELS data.
<b>METHODOLOGICAL INFORMATION</b>
1. Description of methods used for collection/generation of data: Data have been collected with a M-EELS instrument and according to the data acquisition protocol described in the original PNAS publication and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026)
2. Methods for processing the data: Raw data were collected with a channeltron-based M-EELS apparatus described in the reference PNAS publication and analyzed according to the procedure outlined both in the PNAS paper and in SciPost Phys. 3, 026 (2017) (doi: 10.21468/SciPostPhys.3.4.026). The raw M-EELS spectra at each momentum have been subject to minor data processing involving:
(a) averaging of different acquisitions at the same conditions,
(b) energy binning,
(c) division of an effective Coulomb matrix element (which yields a structure factor S(q,\omega)),
(d) antisymmetrization (which yields the imaginary chi)
All these procedures are described in the PNAS paper.
3. Instrument- or software-specific information needed to interpret the data: These data are simple .txt or .dat files which can be read with any standard data analysis software, notably Python notebooks, MatLab, Origin, IgorPro, and others. We do not include scripts in order to provide maximum flexibility.
4. Relationship between files, if important: We divided in different folders raw data, structure factors and imaginary chi.
<b>DATA-SPECIFIC INFORMATION</b>
There are 8 folders within the Data_public_deposition_v1.zip. Each folder contain data needed to create the corresponding figure in the publication.
<b>1. Fig1:</b> This folder contains 21 DAT files needed to plot the theory data in panels C and D, following this naming conventions:
[chiA]or[chiB]or[Pi]_q_number.dat
With chiA is the imaginary RPA charge susceptibility with a Coulomb interaction of electronically weakly coupled layers
chiB is the imaginary RPA charge susceptibility with the usual 4\pi e^2/q^2 Coulomb interaction.
Pi is the imaginary Lindhard polarizability.
q is momentum in reciprocal lattice units
Number is the numerical momentum value in reciprocal lattice units
<b>2. Fig2:</b> Files needed to plot Fig. 2 of the PNAS paper. Contains 3 folders as listed below. The files in this folder are named following this convention: Bi2212_295K_(1,-1)_50eV_161107_q_number_2.16_avg.dat,
295K is the sample temperature
(1,-1) is the momentum direction in reciprocal lattice units
50 eV is the incident e beam energy
161107 is the start date of the experiment in yymmdd format
Q is the momentum
Number is the momentum in reciprocal lattice units
2.16 is the energy range covered by the data in eV
Avg identifies averaged data
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>3. Fig3:</b> Files needed to plot Fig. 3 of the PNAS paper. OP/ OD prefix identifies optimally doped or overdosed sample data, respectively.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>4. Fig4:</b> Files needed to plot Fig. 4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>5. FigS1:</b> Files needed to plot Fig. S1 of the PNAS paper. There are 5 files in this folder. DAT files are M-EELS data following the prior naming convention, while the two .txt files are digitized data from N. Nücker, U. Eckern, J. Fink, and P. Müller, Long-Wavelength Collective Excitations of Charge Carriers in High-Tc Superconductors, Phys. Rev. B 44, 7155(R) (1991), and K. H. G. Schulte, The interplay of Spectroscopy and Correlated Materials, Ph.D. thesis, University of Groningen (2002).
<b>6. FigS2:</b> Files needed to plot Fig. S2 of the PNAS paper.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
<b>7. FigS3:</b> Files needed to plot Fig. S3 of the PNAS paper. There are 2 files in this folder:
20K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 20 K
295K_phi_0_q_0.dat: is a M-EELS raw intensity at zero momentum transfer on Bi2212 at 295 K
<b>8. FigS4:</b> Files needed to plot Fig. S4 of the PNAS paper. The _fit_parameters.dat file contains the fit parameters extracted according to the fit procedure described in the manuscript and at all momenta.
ImChi: is the imaginary susceptibility obtained by antisymmetryzing the structure factor
Raw_avg_data: raw averaged M-EELS spectra
Sqw: Structure factors derived from the M-EELS spectra
keywords:
Momentum resolved electron energy loss spectroscopy (M-EELS); cuprates; plasmons; strange metal
published:
2022-05-26
Madhavan, Vidya; Aishwarya, Anuva
(2022)
The data files are for the paper entitled: Long-lifetime spin excitations near domain walls in 1T-TaS2 to be published in PNAS. The data was obtained on a 300 mK custom designed Unisoku scanning tunneling microscope using the Nanonis module. All the data files have been named based on the Figure numbers that they represent.
keywords:
Mott Insulator; Spins; Charge Density Wave; Domain walls; Long lifetime
published:
2022-08-06
Madhavan, Vidya; Aishwarya, Anuva
(2022)
This dataset consists of all the files and codes that are part of the manuscript (main text and supplement) titled "Spin-selective tunneling from nanowires of the candidate topological Kondo insulator SmB6". For detailed information on the individual files refer to the specific readme files.
keywords:
Topology; Kondo Inuslator; Spin; Scanning tunneling microscopy; antiferromagnetism
published:
2018-08-29
This dataset contains best estimates of the particle size distribution and measurements of the radar reflectivity factor and total water content for instances where ground-based radar and airborne microphysical observations were considered collocated with each other.
keywords:
MC3E; MCS; GPM; microphysics; radar; aircraft; ice
published:
2022-02-07
Karakoc, Deniz Berfin; Wang, Junren; Konar, Megan
(2022)
This dataset provides estimates of agricultural and food commodity flows [kg] between all county pairs within the United States for the years 2007, 2012, and 2017. The database provides 206.3 million data points, since pairwise information is provided between 3134 counties, for 7 commodity categories, and 3 time periods. The commodity categories correspond to the Standardized Classification of Transported Goods and are:
- SCTG 1: Iive animals and fish
- SCTG 2: cereal grains
- SCTG 3: agricultural products (except for animal feed, cereal grains, and forage products)
- SCTG 4: animal feed, eggs, honey, and other products of animal origin
- SCTG 5: meat, poultry, fish, seafood, and their preparations
- SCTG 6: milled grain products and preparations, and bakery products
- SCTG 7: other prepared foodstuffs, fats and oils
For additional information, please see the related paper by Karakoc et al. (2022) in Environmental Research Letters.
keywords:
food flows; high-resolution; county-scale; time-series; United States
published:
2021-04-12
Urco Cordero, Juan M.; Kamalabadi, Farzad; Kamaci, Ulas; Harding, Brian J.; Frey, Harald U.; Mende, Stephen B.; Huba, Joe D.; England, Scott L.; Immel, Thomas J.
(2021)
Conjugate photoelectron energy spectra derived from coincident FUV and radio measurements. These are outputs of simulations from the semi-empirical SAMI2-PE (Varney et al. 2012) for the night of January 4, 2020.
keywords:
Conjugate photoelectrons, SAMI2-PE, ICON
published:
2024-04-10
Konar, Megan; Ruess, Paul J.; Wanders, Niko; Bierkens, Marc F.P.
(2024)
This dataset provides estimates of total Irrigation Water Use (IWU) by crop, county, water source, and year for the Continental United States. Total irrigation from Surface Water Withdrawals (SWW), total Groundwater Withdrawals (GWW), and nonrenewable Groundwater Depletion (GWD) is provided for 20 crops and crop groups from 2008 to 2020 at the county spatial resolution.
In total, there are nearly 2.5 million data points in this dataset (3,142 counties; 13 years; 3 water sources; and 20 crops). This dataset supports the paper by Ruess et al (2024) "Total irrigation by crop in the Continental United States from 2008 to 2020", Scientific Data, doi: 10.1038/s41597-024-03244-w
When using, please cite as:
Ruess, P.J., Konar, M., Wanders, N., and Bierkens, M.F.P. (2024) Total irrigation by crop in the Continental United States from 2008 to 2020, Scientific Data, doi: 10.1038/s41597-024-03244-w
keywords:
water use; irrigation; surface water; groundwater; groundwater depletion; counties; crops; time series
published:
2022-06-15
Wong, Tony; Oudshoorn, Luuk; Sofovich, Eliyahu; Green, Alex; Shah, Charmi; Indebetouw, Remy; Meixner, Margaret; Hacar, Alvaro; Nayak, Omnarayani; Tokuda, Kazuki; Bolatto, Alberto D.; Chevance, Melanie; De Marchi, Guido; Fukui, Yasuo; Hirschauer, Alec S.; Jameson, K. E.; Kalari, Venu; Lebouteiller, Vianney; Looney, Leslie W.; Madden, Suzanne C.; Onishi, Toshikazu; Roman-Duval, Julia; Rubio, Monica; Tielens, A. G. G. M.
(2022)
12CO and 13CO emission maps of the 30 Doradus molecular cloud in the Large Magellanic Cloud, obtained with the Atacama Large Millimeter/submillimeter Array (ALMA) during Cycle 7. See the associated article in the Astrophysical Journal, and README file, for details. Please cite the article if you use these data.
keywords:
Radio astronomy
published:
2025-08-13
Tang, Wenhan; Arabas, Sylwester; Curtis, Jeffrey H.; Knopf, Daniel A.; West, Matthew; Riemer, Nicole
(2025)
This dataset contains the values directly shown in the figures of the article "The impact of aerosol mixing state on immersion freezing: Insights from classical nucleation theory and particle-resolved simulations". This article is in preparation for submission to the journal Atmospheric Chemistry and Physics. The dataset consists of 15 NetCDF files processed from the raw output of the PartMC model. It does not include the theoretical values of frozen fraction, which can be computed using the equations provided in the paper.
keywords:
Aerosol mixing state; Ice nucleating particles; Classical nucleation theory
published:
2024-07-28
Xing, Yuqing; Bae, Seokjin; Madhavan, Vidya
(2024)
This is a set of topographies to study the magnetic field response of RbV3Sb5 (related to Fig.4 of https://www.nature.com/articles/s41586-024-07519-5)
published:
2020-10-27
keywords:
Phase equilibria; Granite; Quartz; Feldspar
published:
2021-02-10
Stickley, Samuel; Fraterrigo, Jennifer
(2021)
This dataset consists of microclimatic temperature and vegetation structure maps at a 3-meter spatial resolution across the Great Smoky Mountains National Park. Included are raster models for sub-canopy, near-surface, minimum and maximum temperature averaged across the study period, season, and month during the growing season months of March through November from 2006-2010. Also available are the topographic and vegetation inputs developed for the microclimate models, including LiDAR-derived vegetation height, LiDAR-derived vegetation structure within four height strata, solar insolation, distance-to-stream, and topographic convergence index (TCI).
keywords:
microclimate buffering; forest vegetation structure; temperature; Appalachian Mountains; climate downscaling; understory; LiDAR
published:
2021-08-15
Felix, Hanau; Hannes, Rost; Ochoa, Idoia
(2021)
This data set contains mass spectrometry data used for the publication "mspack: efficient lossless and lossy mass spectrometry data compression".
keywords:
mass-spectrometry data; compression; proteomics
published:
2022-01-31
Dominguez, Francina
(2022)
This dataset contains results from WRF simulations over northern South America. The Orinoco Low-Level Jet (OLLJ) and the Cross-Equatorial Moisture Transport are important circulation structures of the climate of tropical South America. We explore the sensitivity of the OLLJ and cross-equatorial transport to the representation of surface fluxes and turbulence by using two different Land Surface Model (LSM) schemes (Noah and CLM) and three Planetary Boundary Layer (PBL) schemes (YSU, QNSE and MYNN).
keywords:
WRF; Orinoco LLJ; preicpitation
published:
2024-11-15
BL30K is a synthetic dataset rendered using Blender with ShapeNet's data. We break the dataset into six segments, each with approximately 5K videos. The videos are organized in a similar format as DAVIS and YouTubeVOS, so dataloaders for those datasets can be used directly. Each video is 160 frames long, and each frame has a resolution of 768*512. There are 3-5 objects per video, and each object has a random smooth trajectory -- we tried to optimize the trajectories in a greedy fashion to minimize object intersection (not guaranteed), with occlusions still possible (happen a lot in reality). See [Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion (MiVOS), CVPR 2022] for details.
published:
2021-08-04
Sabrina, Sadia; Lewis, Quinn; Rhoads, Bruce
(2021)
This dataset contains data derived from large-scale particle velocimetry measurements obtained at the confluence of the Saline Branch and an unnamed tributary in Illinois. The data were collected using two cameras positioned about the confluence, one mounted on a cable and the other mounted on a tripod. A description of the content of the files can be found in Description of Files.rtf.
keywords:
confluence; hydrodynamics; LSPIV; flow structure; stagnation
published:
2022-07-10
Winogradoff, David; Chou, Han-Yi; Maffeo, Christopher; Aksimentiev, Aleksei
(2022)
keywords:
Nuclear pore complex; system files; trajectory files
published:
2025-03-13
ALMA Band 4 and 7 observations of the dust continuum in the Class 0 protostellar system L1448 IRS3B. We include the selfcal script, imaging scripts, fits files, and the python scripts for the figures in the paper.
keywords:
ALMA; Band 4; Band 6; polarization; L1448 IRS3B
published:
2025-03-19
Bieri, Carolina A.; Dominguez, Francina; Miguez-Macho, Gonzalo; Fan, Ying
(2025)
This repository includes HRLDAS Noah-MP model output generated as part of Bieri et al. (2025) - Implementing deep soil and dynamic root uptake in Noah-MP (v4.5): Impact on Amazon dry-season transpiration.
These data are distributed in two different formats: Raw model output files and subsetted files that include data for a specific variable. All files are .nc format (NetCDF) and aggregated into .tar files to facilitate download. Given the size of these datasets, Globus transfer is the best way to download them.
Raw model output for four model experiments is available: FD (control), GW, SOIL, and ROOT. See the associated publication for information on the different experiments. These data span an approximately 20 year period from 01 Jun 2000 to 31 Dec 2019. The data have a spatial resolution of 4 km and a temporal frequency of 3 hours. These data are for a domain in the southern Amazon basin (see Figure 1 in the associated publication). Data for each experiment is available as a .tar file which includes 3-hourly NetCDF files. All default Noah-MP output variables are included in each file. As a result, the .tar files are quite large and may take many hours or even days to transfer depending on your network speed and local configurations. These files are named 'noahmp_output_2000_2019_EXP.tar', where EXP is the name of the experiment (FD, GW, SOIL, or ROOT).
Subsetted model output at a daily temporal resolution for all four model experiments is also available. These .tar files include the following variables: water table depth (ZWT), latent heat flux (LH), sensible heat flux (HFX), soil moisture (SOIL_M), canopy evaporation (ECAN), ground evaporation (EDIR), transpiration (ETRAN), rainfall rate at the surface (QRAIN), and two variables that are specific to the ROOT experiment: ROOTACTIVITY (root activity function) and GWRD (active root water uptake depth). There is one file for each variable within the tarred files. These files are named 'noahmp_output_subset_2000_2019_EXP.tar', where EXP is the name of the experiment (FD, GW, SOIL, or ROOT).
Finally, there is a sample dataset with raw 3-hourly output from the ROOT experiment for one day. The purpose of this sample dataset is to allow users to confirm if these data meet their needs before initiating a full transfer via Globus. This file is named 'noahmp_output_sample_ROOT.tar'.
The README.txt file provides information on the Noah-MP output variables in these datasets, among other specifications.
Information on HRLDAS Noah-MP and names/definitions of model output variables that are useful in working with these data are available here: http://dx.doi.org/10.5065/ew8g-yr95. Note that some output variables may be listed in this document under a different variable name, so searching for the long name (e.g. 'baseflow' instead of 'QRF') is recommended.
Information on additional output variables that were added to the model as part of this study is available here: https://github.com/bieri2/bieri-et-al-2025-EGU-GMD/tree/DynaRoot.
Model code, configuration files, and forcing data used to carry out the model simulations are linked in the related resources section.
keywords:
Land surface model; NetCDF
published:
2022-08-29
Winogradoff, David; Chou, Han-Yi; Maffeo, Christopher; Aksimentiev, Aleksei
(2022)
Example scripts and configuration files needed to perform select simulations described in the manuscript "Percolation transition prescribes protein size-specific barrier to passive transport through the nuclear pore complex."
keywords:
Nuclear Pore Complex; simulation setup
published:
2021-02-01
These datasets provide the basis of our analysis in the paper - The Potential Impact of a Clean Energy Society On Air Quality. All datasets here are from the model output (CAM4-chem). All the simulations were run to steady-state and only the outputs used in the analysis are archived here.
keywords:
clean energy; ozone; particulates
published:
2021-10-13
Lyu, Fangzheng; Xu, Zewei; Ma, Xinlin; Wang, Shaohua; Li, Zhiyu; Wang, Shaowen
(2021)
Drainage network analysis is fundamental to understanding the characteristics of surface hydrology. Based on elevation data, drainage network analysis is often used to extract key hydrological features like drainage networks and streamlines. Limited by raster-based data models, conventional drainage network algorithms typically allow water to flow in 4 or 8 directions (surrounding grids) from a raster grid. To resolve this limitation, this paper describes a new vector-based method for drainage network analysis that allows water to flow in any direction around each location. The method is enabled by rapid advances in Light Detection and Ranging (LiDAR) remote sensing and high-performance computing. The drainage network analysis is conducted using a high-density point cloud instead of Digital Elevation Models (DEMs) at coarse resolutions. Our computational experiments show that the vector-based method can better capture water flows without limiting the number of directions due to imprecise DEMs. Our case study applies the method to Rowan County watershed, North Carolina in the US. After comparing the drainage networks and streamlines detected with corresponding reference data from US Geological Survey generated from the Geonet software, we find that the new method performs well in capturing the characteristics of water flows on landscape surfaces in order to form an accurate drainage network.
This dataset contains all the code, notebooks, datasets used in the study conducted for the research publication titled " A Vector-Based Method for Drainage Network Analysis Based on LiDAR Data ".
## What's Inside
A quick explanation of the components
* `A Vector Approach to Drainage Network Analysis Based on LiDAR Data.ipynb` is a notebook for finding the drainage network based on LiDAR data
*`Picture1.png` is a picture representing the pseudocode of our new algorithm
* HPC` folder contains codes for running the algorithm with sbatch in HPC
** `execute.sh` is a bash script file that use sbatch to conduct large scale analysis for the algorithm
** `run.sh` is a bash script file that calls the script file `execute.sh` for large scale calculation for the algorithm
** `run.py` includes the codes implemented for the algorithm
* `Rowan Creek Data` includes data that are used in the study
** `3_1.las` and `3_2.las ` are the LiDAR data files that is used in our analysis presented in the paper. Users may use this data file to reproduce our results and may replace it with their own LiDAR file to run this method over different areas
** `reference` folder includes reference data from USGS
*** `reference_3_1.tif` and `reference_3_2.tif` are reference data for the drainage system analysis retrieved from USGS.
keywords:
CyberGIS; Drainage System Analysis; LiDAR
published:
2022-06-22
Kang, Jeon-Young; Farkhad, Bita Fayaz; Chan, Man-pui Sally; Michels, Alexander; Albarracin, Dolores; Wang, Shaowen
(2022)
This dataset helps to investigate the Spatial Accessibility to HIV Testing, Treatment, and Prevention Services in Illinois and Chicago, USA.
The main components are: population data, healthcare data, GTFS feeds, and road network data. The core components are:
1) `GTFS` which contains GTFS (<a href="https://gtfs.org/">General Transit Feed Specification</a>) data which is provided by Chicago Transit Authority (CTA) from <a href="https://developers.google.com/transit/gtfs">Google's GTFS feeds</a>. Documentation defines the format and structure of the files that comprise a GTFS dataset: <a href="https://developers.google.com/transit/gtfs/reference?csw=1">https://developers.google.com/transit/gtfs/reference?csw=1</a>.
2) `HealthCare` contains shapefiles describing HIV healthcare providers in Chicago and Illinois respectively. The services come from <a href="https://locator.hiv.gov/">Locator.HIV.gov</a>.
3) `PopData` contains population data for Chicago and Illinois respectively. Data come from The American Community Survey and <a href="https://map.aidsvu.org/map">AIDSVu</a>. AIDSVu (https://map.aidsvu.org/map) provides data on PLWH in Chicago at the census tract level for the year 2017 and in the State of Illinois at the county level for the year 2016. The American Community Survey (ACS) provided the number of people aged 15 to 64 at the census tract level for the year 2017 and at the county level for the year 2016. The ACS provides annually updated information on demographic and socio economic characteristics of people and housing in the U.S.
4) `RoadNetwork` contains the road networks for Chicago and Illinois respectively from <a href="https://www.openstreetmap.org/copyright">OpenStreetMap</a> using the Python <a href="https://osmnx.readthedocs.io/en/stable/">osmnx</a> package.
<b>The abstract for our paper is:</b>
Accomplishing the goals outlined in “Ending the HIV (Human Immunodeficiency Virus) Epidemic: A Plan for America Initiative” will require properly estimating and increasing access to HIV testing, treatment, and prevention services. In this research, a computational spatial method for estimating access was applied to measure distance to services from all points of a city or state while considering the size of the population in need for services as well as both driving and public transportation. Specifically, this study employed the enhanced two-step floating catchment area (E2SFCA) method to measure spatial accessibility to HIV testing, treatment (i.e., Ryan White HIV/AIDS program), and prevention (i.e., Pre-Exposure Prophylaxis [PrEP]) services. The method considered the spatial location of MSM (Men Who have Sex with Men), PLWH (People Living with HIV), and the general adult population 15-64 depending on what HIV services the U.S. Centers for Disease Control (CDC) recommends for each group. The study delineated service- and population-specific accessibility maps, demonstrating the method’s utility by analyzing data corresponding to the city of Chicago and the state of Illinois. Findings indicated health disparities in the south and the northwest of Chicago and particular areas in Illinois, as well as unique health disparities for public transportation compared to driving. The methodology details and computer code are shared for use in research and public policy.
keywords:
HIV;spatial accessibility;spatial analysis;public transportation;GIS