Illinois Data Bank Dataset Search Results
Results
published:
2025-12-09
Chase, Marissa H.; Fraterrigo, Jennifer M.; Charles, Brian; Harmon-Threatt, Alexandra
(2025)
The dataset includes bee community data from a study conducted down in southern Illinois across three forested public land sites. Bee diversity and abundance data, as well as environmental variables, are included for each plot. Each plot was visited a total of four times.
keywords:
wild bees; forest management; resource availability
published:
2025-04-26
Alvarez, Jennifer; Fraterrigo, Jennifer; Dalling, James
(2025)
Census data collected at Trelease Woods in 1936 with information on tree species, stem count, diameter at breast height (DBH), and basal area. The plot boundaries from the 1936 census were georeferenced to subset 2018 census data for a direct comparison between the two census years.
keywords:
old-growth; temperate forest; species composition; forest dynamics; historical data
published:
2025-11-07
Lee, Ye-Gi; Kang, Nam Kyu; Kim, Chanwoo; Tran, Vinh; Cao, Mingfeng; Yoshikuni, Yasuo; Zhao, Huimin; Jin, Yong-Su
(2025)
This study presents a cost-effective strategy for producing organic acids from glucose and xylose using the acid-tolerant yeast, Issatchenkia orientalis. I. orientalis was engineered to produce lactic acid from xylose, and the resulting strain, SD108XL, successfully converted sorghum hydrolysates into lactic acid. In order to enable low-pH fermentation, a self-buffering strategy, where the lactic acid generated by the SD108XL strain during fermentation served as a buffer, was developed. As a result, the SD108 strain produced 67 g/L of lactic acid from 73 g/L of glucose and 40 g/L of xylose, simulating a sugar composition of sorghum biomass hydrolysates. Moreover, techno-economic analysis underscored the efficiency of the self-buffering strategy in streamlining the downstream process, thereby reducing production costs. These results demonstrate the potential of I. orientalis as a platform strain for the cost-effective production of organic acids from cellulosic hydrolysates.
keywords:
Conversion;Gene Editing;Hydrolysate;Metabolic Engineering
published:
2021-02-01
These datasets provide the basis of our analysis in the paper - The Potential Impact of a Clean Energy Society On Air Quality. All datasets here are from the model output (CAM4-chem). All the simulations were run to steady-state and only the outputs used in the analysis are archived here.
keywords:
clean energy; ozone; particulates
published:
2025-11-15
Hassell Jr., James E.; Arellano Perez, Angel David; Vasudevan, Krithika; Ressler, Reed; Garcia, Gabriela; Parr, Madison; Vierkant, Valerie; Bayer, Hugo; Maren, Stephen
(2025)
This dataset contains all the raw data, figures, and Prism files corresponding to each experiment performed for the paper “Hippocampal ensembles regulate circuit-induced relapse of extinguished fear.”
keywords:
Hippocampal ensembles; circuit-induced relapse, fear extinction.
published:
2021-10-13
Lyu, Fangzheng; Xu, Zewei; Ma, Xinlin; Wang, Shaohua; Li, Zhiyu; Wang, Shaowen
(2021)
Drainage network analysis is fundamental to understanding the characteristics of surface hydrology. Based on elevation data, drainage network analysis is often used to extract key hydrological features like drainage networks and streamlines. Limited by raster-based data models, conventional drainage network algorithms typically allow water to flow in 4 or 8 directions (surrounding grids) from a raster grid. To resolve this limitation, this paper describes a new vector-based method for drainage network analysis that allows water to flow in any direction around each location. The method is enabled by rapid advances in Light Detection and Ranging (LiDAR) remote sensing and high-performance computing. The drainage network analysis is conducted using a high-density point cloud instead of Digital Elevation Models (DEMs) at coarse resolutions. Our computational experiments show that the vector-based method can better capture water flows without limiting the number of directions due to imprecise DEMs. Our case study applies the method to Rowan County watershed, North Carolina in the US. After comparing the drainage networks and streamlines detected with corresponding reference data from US Geological Survey generated from the Geonet software, we find that the new method performs well in capturing the characteristics of water flows on landscape surfaces in order to form an accurate drainage network.
This dataset contains all the code, notebooks, datasets used in the study conducted for the research publication titled " A Vector-Based Method for Drainage Network Analysis Based on LiDAR Data ".
## What's Inside
A quick explanation of the components
* `A Vector Approach to Drainage Network Analysis Based on LiDAR Data.ipynb` is a notebook for finding the drainage network based on LiDAR data
*`Picture1.png` is a picture representing the pseudocode of our new algorithm
* HPC` folder contains codes for running the algorithm with sbatch in HPC
** `execute.sh` is a bash script file that use sbatch to conduct large scale analysis for the algorithm
** `run.sh` is a bash script file that calls the script file `execute.sh` for large scale calculation for the algorithm
** `run.py` includes the codes implemented for the algorithm
* `Rowan Creek Data` includes data that are used in the study
** `3_1.las` and `3_2.las ` are the LiDAR data files that is used in our analysis presented in the paper. Users may use this data file to reproduce our results and may replace it with their own LiDAR file to run this method over different areas
** `reference` folder includes reference data from USGS
*** `reference_3_1.tif` and `reference_3_2.tif` are reference data for the drainage system analysis retrieved from USGS.
keywords:
CyberGIS; Drainage System Analysis; LiDAR
published:
2021-10-28
Suski, Cory; Curtis-Quick, Jocelyn
(2021)
Bigheaded carp were collected from the Illinois and Des Plaines Rivers, parts of the Illinois Waterway, from May to November 2018. A total of 93 fish were collected during sampling for a study comprised of 40 females, 41 males, and 12 unsexed fish. GC/MS metabolite profiling analysis detected 180 compounds.
Livers from carp at the leading edge had differences in energy use and metabolism, and suppression of protective mechanisms relative to downstream fish; differences were consistent across time. This body of work provides evidence that water quality is linked to carp movement in the Illinois River. As water quality in this region continues to improve, consideration of this impact on carp spread is essential to protect the Great Lakes.
keywords:
water quality; metabolites; range expansion; energy; contaminants
published:
2022-06-22
Kang, Jeon-Young; Farkhad, Bita Fayaz; Chan, Man-pui Sally; Michels, Alexander; Albarracin, Dolores; Wang, Shaowen
(2022)
This dataset helps to investigate the Spatial Accessibility to HIV Testing, Treatment, and Prevention Services in Illinois and Chicago, USA.
The main components are: population data, healthcare data, GTFS feeds, and road network data. The core components are:
1) `GTFS` which contains GTFS (<a href="https://gtfs.org/">General Transit Feed Specification</a>) data which is provided by Chicago Transit Authority (CTA) from <a href="https://developers.google.com/transit/gtfs">Google's GTFS feeds</a>. Documentation defines the format and structure of the files that comprise a GTFS dataset: <a href="https://developers.google.com/transit/gtfs/reference?csw=1">https://developers.google.com/transit/gtfs/reference?csw=1</a>.
2) `HealthCare` contains shapefiles describing HIV healthcare providers in Chicago and Illinois respectively. The services come from <a href="https://locator.hiv.gov/">Locator.HIV.gov</a>.
3) `PopData` contains population data for Chicago and Illinois respectively. Data come from The American Community Survey and <a href="https://map.aidsvu.org/map">AIDSVu</a>. AIDSVu (https://map.aidsvu.org/map) provides data on PLWH in Chicago at the census tract level for the year 2017 and in the State of Illinois at the county level for the year 2016. The American Community Survey (ACS) provided the number of people aged 15 to 64 at the census tract level for the year 2017 and at the county level for the year 2016. The ACS provides annually updated information on demographic and socio economic characteristics of people and housing in the U.S.
4) `RoadNetwork` contains the road networks for Chicago and Illinois respectively from <a href="https://www.openstreetmap.org/copyright">OpenStreetMap</a> using the Python <a href="https://osmnx.readthedocs.io/en/stable/">osmnx</a> package.
<b>The abstract for our paper is:</b>
Accomplishing the goals outlined in “Ending the HIV (Human Immunodeficiency Virus) Epidemic: A Plan for America Initiative” will require properly estimating and increasing access to HIV testing, treatment, and prevention services. In this research, a computational spatial method for estimating access was applied to measure distance to services from all points of a city or state while considering the size of the population in need for services as well as both driving and public transportation. Specifically, this study employed the enhanced two-step floating catchment area (E2SFCA) method to measure spatial accessibility to HIV testing, treatment (i.e., Ryan White HIV/AIDS program), and prevention (i.e., Pre-Exposure Prophylaxis [PrEP]) services. The method considered the spatial location of MSM (Men Who have Sex with Men), PLWH (People Living with HIV), and the general adult population 15-64 depending on what HIV services the U.S. Centers for Disease Control (CDC) recommends for each group. The study delineated service- and population-specific accessibility maps, demonstrating the method’s utility by analyzing data corresponding to the city of Chicago and the state of Illinois. Findings indicated health disparities in the south and the northwest of Chicago and particular areas in Illinois, as well as unique health disparities for public transportation compared to driving. The methodology details and computer code are shared for use in research and public policy.
keywords:
HIV;spatial accessibility;spatial analysis;public transportation;GIS
published:
2020-09-27
Vandewalle, Rebecca; Barley, William; Padmanabhan, Anand; Katz, Daniel S.; Wang, Shaowen
(2020)
This dataset contains R codes used to produce the figures submitted in the manuscript titled "Understanding the multifaceted geospatial software ecosystem: a survey approach". The raw survey data used to populate these charts cannot be shared due to the survey consent agreement.
keywords:
R; figures; geospatial software
published:
2025-10-02
Jin, Yong-Su; Rao, Christopher; Ye, Quanhui; Oh, Hyunjoon; Tohidifar, Payman; Koh, Hyun Gi
(2025)
For economic and sustainable biomanufacturing, the oleaginous yeast Rhodotorula toruloides has emerged as a promising platform for producing biofuels, pharmaceuticals, and other valuable chemicals. However, genetic manipulation of R. toruloides has been limited by its high GC content and the lack of a replicating plasmid, necessitating gene integration into the genome of the yeast. To address these challenges, we developed the RT-EZ (R. toruloides Efficient Zipper) toolkit, a versatile tool based on Golden Gate assembly, designed to streamline R. toruloides engineering with improved efficiency and flexibility. The RT-EZ toolkit simplifies vector construction by incorporating new features such as bidirectional promoters and 2A peptides, color-based screening using RFP, and sequences optimized for both Agrobacterium tumefaciens-mediated transformation (ATMT) and easy linearization, enabling straightforward selection and transformation. Notably, the RT-EZ kit can be used to construct an expression cassette with four different genes in one assembly reaction, significantly improving vector construction speed and efficiency. The utility of the RT-EZ toolkit was demonstrated through the successful synthesis of arachidonic acid in R. toruloides by coexpressing fatty acid elongases and desaturases. This result underscores the potential of the RT-EZ toolkit to advance synthetic biology in R. toruloides, providing a streamlined method for addressing genetic engineering challenges in the yeast.
keywords:
gene editing; genome engineering
published:
2018-08-16
Portier, Evan; Silver, Whendee; Yang, Wendy H.
(2018)
This dataset includes data on soil properties, soil N pools, and soil N fluxes presented in the manuscript, "Effects of an invasive perennial forb on gross soil nitrogen cycling and nitrous oxide fluxes," submitted to Ecology for peer-reviewed publication. Please refer to that publication for details about methodologies used to generate these data and for the experimental design.
keywords:
pepperweed; nitrogen cycling; nitrous oxide; invasive species; Bay Delta
published:
2021-06-24
Egan, Maximillian; Larsen, Ryan; Sadaghiani, Sepideh
(2021)
This dataset contains EEG and Temperature data acquired from inside the bore of an MRI scanner during scanning with two different types of fMRI sequences: single-band and and multi-band. The EEG data were acquired from the heads of adult humans undergoing scanning, and can be used to assess differences in EEG data quality due to sequence type. The temperature data were acquired from a watermelon phantom and can be used to assess heating differences due to sequence type.
keywords:
Simultaneous EEG-fMRI, Multi-band fMRI, Safety, Heating
published:
2020-11-20
Jaikumar, Nikhil; Clemente, Tom; Long, Steve; Ge, Zhengxiang; Changa, Timothy
(2020)
This data set explores the effect of the cyanobacterial gene ictB on photosynthesis in sorghum, under both normal greenhouse growing temperatures (32 C / 25 C) and during and after an 8 day chilling stress (10 C / 5 C). IctB is a cyanobacterial gene of unknown function, which was initially thought to be involved in inorganic carbon transport into cells. While ictB is known now not to be an independently active carbon transporter in its own right, it may play a role in passive diffusion of metabolites. This transgene was introduced into sorghum by the lab of Thomas Clemente, through Agrobacterium mediated transformation, alone and in combination with the tomato sedoheptulose-1,7-bisphosphatase (SBPase) gene. Eleven events (six double construct and five single construct ictB) were involved in this study. SBPase was included because some previous experiments in C3 species and some previous modeling work, as well as its position at a metabolic branch point, indicates it plays a role as a control point for photosynthesis. A chilling treatment was included because chilling is one of the most serious ecological factors limiting the range of C4 species.
Data includes gene expression, metabolomics (at normal growing temperature), SBPase enzyme activity, biomass and photosynthetic traits at both warm temperature and during and after chilling stress.
-----------------
EXPLANATORY NOTES FOR ICTB/SBPASE SORGHUM MANUSCRIPT
Data are organized into 10 worksheets, representing an expected 10 tables that will serve a supplementary role in the final publication. These include data on gene expression, metabolomics (at normal growing temperature), SBPase enzyme activity, biomass and photosynthetic traits at both warm temperature and during and after chilling stress.
<i><b>Tables are as follows:</i></b>
1. Event_Code: for Table S1. Event codes for events and constructs. Two constructs were generated for this study, and numerous transgenic “events” (i.e. independent transformations) were carried out for each construct. A construct represents the actual vector which was introduced into the plants (complete with promoter, gene of interest, marker gene, etc.) while an event represents a single successful introduction of the transgene. Events are uniquely labeled with letter and number strings but also with a four-digit number for ease of reference, this table explains which event corresponds to each four-digit number.
2. Photosynthetic_Data: for Table S2. Photosynthetic data at greenhouse growing temperature, for ictB single construct, ictB/SBPase double construct, and wild type lines. Five ictB and six ictB/SBPase events were included. Greenhouse growing temperature was approximately 32 °C and 25 °C night. Photosynthetic parameters were measured using a Licor 6400-XT, and included parameters related to carbon dioxide uptake, water loss, and chlorophyll fluorescence.
3. Chilling_Treatment: for Table S3. Photosynthetic response to chilling treatment, for ictB single construct, and wild type lines. Four ictB events were included. Chilling treatment lasted approximately 8 days and began either 3.5 or 5.5 weeks after transplanting the plants (chilling was done in two batches). Chilling treatment involved temperatures of 10 °C day / 7 °C night in growth chambers. Photosynthetic parameters were measured at several time points during and after the chilling treatment, were measured using a Licor 6400-XT, and included parameters related to carbon dioxide uptake, water loss, and chlorophyll fluorescence.
4. SBPase_Activity: for Table S4. SBPase activity in double construct plants. These data measure in vitro substrate-saturated activity of SBPase in desalted extracts from leaf tissues, at 25 °C. Units are micromoles of SBP processed per second per m2 of leaf tissue. Five ictB/SBPase events were included.
5. 2014_gene_exp: for Table S5. Gene expression in 2014 experiment (units of cycle times). These data measure cycle times to threshold, relative to reference genes, for expression of ictB and SBPase. Six ictB single construct events and five ictB/SBPase double construct events were included. Cycle times to threshold relative to reference genes (ΔCT) are inversely related to number of transcripts relative to reference genes, as follows:
ΔCT = -log2([NictB]/[Nreference])/[1 + log2b] where b = efficiency of replication.
6. 2016_gene_exp: for Table S5. Gene expression in 2016 experiment (units of cycle times). These data measure cycle times to threshold, relative to reference genes, for expression of ictB and SBPase. Six ictB single construct events and five ictB/SBPase double construct events were included. Cycle times to threshold relative to reference genes (ΔCT) are inversely related to number of transcripts relative to reference genes, as follows:
ΔCT = -log2([NictB]/[Nreference])/[1 + log2b] where b = efficiency of replication.
7. Metabolites: for Table S7. Levels of 267 metabolites in leaf tissue. Four ictB single construct events and four ictB/SBPase double construct events were included in these analyses. Metabolites were measured in methanol-extracted samples, either by liquid chromatography / mass spectrometry or by gas chromatography / mass spectrometry, and were compared between events on a relative basis. As quantification was relative to wild type rather than on an absolute basis, no units are included.
8. Metabolite_F_values: for Table S8. F values for effects of ictB, SBPase (in cases where the model was better with a SBPase effect) and event. These analyses are done for each metabolite included in Table S7, and show effects of the explanatory variables ictB, SBPase, and individual event.
9. Biomass_2020: for Table S9. Biomass and grain yield at harvest, for ictB, ictB/SBPase and wild type sorghum plants in spring 2020. Four ictb/SBPase double construct and four ictB single construct events were included.
10. Biomass_2017: for Table S10. Biomass and grain yield at harvest, in chilled and non-chilled sorghum plants containing the ictB transgene (along with wild type controls) in fall 2017. Four ictB single construct events were included. Chilling treatment involved temperatures of 10 °C day / 7 °C night in growth chambers.
<i><b>All the variables in the file are explained as below:</i></b>
o Type (IctB-SBPase and IctB). This refers to whether a plant is wild type, single construct (contains only the ictB transgene) or double construct (contains both the ictB and SBPase transgenes).
o Code: these codes are shorter labels to refer to each transgene event for the sake of convenience.
o Alternate_Code: these codes are shorter labels to refer to each transgene event for the sake of convenience.
o Event Number: these are unique labels for each transgenic events.
o Construct Number: these are labels for each transgenic construct (either the ictB single construct or the ictB/SBPase double construct).
o year (i): this refers to the year in which the study was conducted (2014, 2016, 2017, or 2020)
o transgene or Transgenic: whether the transgene was present
o construct or Type : whether the ictB or the ictB/SBPase construct was present (double, single, wildtype):
o temp: leaf temperature during the measurement
o A: carbon assimilation rate, in μmol m-2 s-1
o gs: stomatal conductance, in mol m-2 s-1
o CI: intercellular carbon dioxide concentration, in parts per million or μL L-1
o fvfm:FV’/FM’ (maximal potential photosystem II quantum yield under light adapted conditions), dimensionless ratio
o phipsill: ΦPSII (maximal potential photosystem II quantum yield under light adapted conditions), dimensionless ratio
o qP: photochemical quenching, i.e. ratio of ΦPSII to FV’/FM’ , dimensionless ratio
o iwue: intrinsic water use efficiency, i.e. ratio of carbon assimilation rate to stomatal conductance, in units of μmol mol-1
o event: individual transgenic / transformation event
o Vmax: substrate-saturated in vitro activity of the SBPase enzyme, in μmol m-2 s-1
o ID: identification number of sample
o ΔCT1: difference in cycle times to threshold during gene expression (quantitative PCR) assay, between ictB and the reference gene GAPDH, in units of cycles
o ΔCT2: cycle times to threshold during gene expression (quantitative PCR) assay, between SBPase and the reference gene GAPDH, in units of cycles
o GAPDH: cycle times to threshold for the reference gene GAPDH (glyceraldehyde phosphate dehydrogenase)
o IctB: cycle times to threshold for the gene of interest ictB
o SBPase: cycle times to threshold for the gene of interest SBPase
o v1 to v267 represent individual metabolite (see the heading immediately above the labels v1, v2, etc.). Variables v268-v272 refer to total (summed) metabolite levels for particular pathways of interest.
o leaf: Leaf and stem dry biomass (in grams)
o seed: Seedhead dry biomass (in grams)
o biomass: Total (leaf, stem + seed head) dry biomass (in grams)
o harvind: ratio of seed head dry biomass to total dry biomass
o treatment (chilled and nonchilled): “Chilled” plants were grown under warm greenhouse conditions (32 °C day / 25 °C night) for 6 or 8 weeks, then switched to chilling temperatures under growth chamber conditions (10 °C / 7 °C night) for 8 days, and were then returned to greenhouse growing conditions.
-----------------
keywords:
ictB; SBPase; photosynthesis; sorghum; chilling
published:
2022-07-25
Related to the raw entity mentions (https://doi.org/10.13012/B2IDB-4163883_V1), this dataset represents the effects of the data cleaning process and collates all of the entity mentions which were too ambiguous to successfully link to the ChEBI ontology.
keywords:
synthetic biology; NERC data; chemical mentions; ambiguous entities
published:
2025-07-25
Mori, Jameson; Rivera, Nelda; Brown, William; Skinner, Daniel; Schlichting, Peter; Novakofski, Jan; Mateus-Pinilla, Nohra
(2025)
This dataset contains the pregnancy status of wild, white-tailed deer (Odocoileus virginianus) from northern Illinois culled as part of the Illinois Department of Natural Resources' chronic wasting disease (CWD) surveillance program. Fiscal years 2005 through 2024 are included. A fiscal year is the time between July 1st of one calendar year and June 30th of the next. Variables in this dataset include the pregnancy status, CWD infection status, age, weight, and day of mortality for each female deer, as well as the deer land cover utility (LCU) score for the TRS, township, or county from which the deer was culled. The deer population density of the county is also included. Data have been anonymized for landowner privacy reasons so that the location and year are not identifiable, but will give the same modeling results by maintaining how the data are grouped. The R code used to conduct the regression modeling is also included.
keywords:
cervid; Cervidae, chronic wasting disease; CWD; reproduction; white-tailed deer; Odocoileus virginianus; pregnancy; regression
published:
2019-04-05
Dong, Xiaoru; Xie, Jingyi; Hoang, Linh
(2019)
File Name: Inclusion_Criteria_Annotation.csv
Data Preparation: Xiaoru Dong
Date of Preparation: 2019-04-04
Data Contributions: Jingyi Xie, Xiaoru Dong, Linh Hoang
Data Source: Cochrane systematic reviews published up to January 3, 2018 by 52 different Cochrane groups in 8 Cochrane group networks.
Associated Manuscript authors: Xiaoru Dong, Jingyi Xie, Linh Hoang, and Jodi Schneider.
Associated Manuscript, Working title: Machine classification of inclusion criteria from Cochrane systematic reviews.
Description: The file contains lists of inclusion criteria of Cochrane Systematic Reviews and the manual annotation results. 5420 inclusion criteria were annotated, out of 7158 inclusion criteria available. Annotations are either "Only RCTs" or "Others". There are 2 columns in the file:
- "Inclusion Criteria": Content of inclusion criteria of Cochrane Systematic Reviews.
- "Only RCTs": Manual Annotation results. In which, "x" means the inclusion criteria is classified as "Only RCTs". Blank means that the inclusion criteria is classified as "Others".
Notes:
1. "RCT" stands for Randomized Controlled Trial, which, in definition, is "a work that reports on a clinical trial that involves at least one test treatment and one control treatment, concurrent enrollment and follow-up of the test- and control-treated groups, and in which the treatments to be administered are selected by a random process, such as the use of a random-numbers table." [Randomized Controlled Trial publication type definition from https://www.nlm.nih.gov/mesh/pubtypes.html].
2. In order to reproduce the relevant data to this, please get the code of the project published on GitHub at: https://github.com/XiaoruDong/InclusionCriteria and run the code following the instruction provided.
3. This datafile (V2) is a updated version of the datafile published at https://doi.org/10.13012/B2IDB-5958960_V1 with some minor spelling mistakes in the data fixed.
keywords:
Inclusion criteri; Randomized controlled trials; Machine learning; Systematic reviews
published:
2021-12-01
Crist, Samantha; Kopsco, Heather; Miller, Alexandria; Gronemeyer, Margaret; Mateus-Pinilla, Nohra; Smith, Rebecca
(2021)
An online knowledge, attitudes, and practices survey on ticks and tick-borne diseases was distributed to veterinary professionals in Southern and Central Illinois during summer and fall 2020. These are the raw data associated with that survey and the survey questions used.
* NOTE: "age" and "gender" variables were removed from the data to protect participants.
keywords:
ticks; veterinary medicine; tick-borne disease; survey
published:
2024-01-31
Kent, Angela; Bohn, Martin
(2024)
This dataset contains: field study design parameters, plant performance metrics, and nitrogen cycling rates associated with a field experiment that compared nitrification rates between maize lines with and without nitrification inhibition loci nitrogen fixation rates with with and without a nitrogen fixing inoculant product.
The overarching goal was to evaluate nitrogen fixation by a diazotroph inoculant and retention of nitrogen in the rhizosphere via a novel nitrification inhibition phenotype of maize.
keywords:
maize; microbiome; nitrogen cycling; nitrification; nitrogen fixation
published:
2024-04-15
Lyu, Zhiheng; Lehan, Yao; Zhisheng, Wang; Chang, Qian; Zuochen, Wang; Jiahui, Li; Yufeng, Wang; Qian, Chen
(2024)
The dataset contains trajectories of Pt nanoparticles in 1.98 mM NaBH4 and NaCl, tracked under liquid-phase TEM. The coordinates (x, y) of nanoparticles are provided, together with the conversion factor that translates pixel size to actual distance. In the file, ∆t denotes the time interval and NaN indicates the absence of a value when the nanoparticle has not emerged or been tracked. The labeling of nanoparticles in the paper is also noted in the second row of the file.
keywords:
nanomotor; liquid-phase TEM
published:
2020-09-02
Schneider, Jodi; Ye, Di; Hill, Alison
(2020)
Citation context annotation. This dataset is a second version (V2) and part of the supplemental data for Jodi Schneider, Di Ye, Alison Hill, and Ashley Whitehorn. (2020) "Continued post-retraction citation of a fraudulent clinical trial report, eleven years after it was retracted for falsifying data". Scientometrics. In press, DOI: 10.1007/s11192-020-03631-1
Publications were selected by examining all citations to the retracted paper Matsuyama 2005, and selecting the 35 citing papers, published 2010 to 2019, which do not mention the retraction, but which mention the methods or results of the retracted paper (called "specific" in Ye, Di; Hill, Alison; Whitehorn (Fulton), Ashley; Schneider, Jodi (2020): Citation context annotation for new and newly found citations (2006-2019) to retracted paper Matsuyama 2005. University of Illinois at Urbana-Champaign. <a href="https://doi.org/10.13012/B2IDB-8150563_V1">https://doi.org/10.13012/B2IDB-8150563_V1</a> ). The annotated citations are second-generation citations to the retracted paper Matsuyama 2005 (RETRACTED: Matsuyama W, Mitsuyama H, Watanabe M, Oonakahara KI, Higashimoto I, Osame M, Arimura K. Effects of omega-3 polyunsaturated fatty acids on inflammatory markers in COPD. Chest. 2005 Dec 1;128(6):3817-27.), retracted in 2008 (Retraction in: Chest (2008) 134:4 (893) https://doi.org/10.1016/S0012-3692(08)60339-6).
<b>OVERALL DATA for VERSION 2 (V2)</b>
FILES/FILE FORMATS
Same data in two formats:
2010-2019 SG to specific not mentioned FG.csv - Unicode CSV (preservation format only) - same as in V1
2010-2019 SG to specific not mentioned FG.xlsx - Excel workbook (preferred format) - same as in V1
Additional files in V2:
2G-possible-misinformation-analyzed.csv - Unicode CSV (preservation format only)
2G-possible-misinformation-analyzed.xlsx - Excel workbook (preferred format)
<b>ABBREVIATIONS: </b>
2G - Refers to the second-generation of Matsuyama
FG - Refers to the direct citation of Matsuyama (the one the second-generation item cites)
<b>COLUMN HEADER EXPLANATIONS </b>
File name: 2G-possible-misinformation-analyzed. Other column headers in this file have same meaning as explained in V1. The following are additional header explanations:
Quote Number - The order of the quote (citation context citing the first generation article given in "FG in bibliography") in the second generation article (given in "2G article")
Quote - The text of the quote (citation context citing the first generation article given in "FG in bibliography") in the second generation article (given in "2G article")
Translated Quote - English translation of "Quote", automatically translation from Google Scholar
Seriousness/Risk - Our assessment of the risk of misinformation and its seriousness
2G topic - Our assessment of the topic of the cited article (the second generation article given in "2G article")
2G section - The section of the citing article (the second generation article given in "2G article") in which the cited article(the first generation article given in "FG in bibliography") was found
FG in bib type - The type of article (e.g., review article), referring to the cited article (the first generation article given in "FG in bibliography")
FG in bib topic - Our assessment of the topic of the cited article (the first generation article given in "FG in bibliography")
FG in bib section - The section of the cited article (the first generation article given in "FG in bibliography") in which the Matsuyama retracted paper was cited
keywords:
citation context annotation; retraction; diffusion of retraction; second-generation citation context analysis
published:
2021-05-10
Varela Quintela, Sebastian; Leakey, Andrew
(2021)
UAV-based high-resolution multispectral time-series orthophotos utilized to understand the relation between growth dynamics, imagery temporal resolution, and end-of-season biomass productivity of biomass sorghum as bioenergy crop. Sensor utilized is a RedEdge Micasense flown at 40 meters above ground level at the Energy Farm- UIUC in 2019.
keywords:
Unmanned aerial vehicles; High throughput phenotyping; Machine learning; Bioenergy crops
published:
2022-10-14
Dietrich, Christopher; Dmitriev, Dmitry; Takiya, Daniela; Thomas, Michael; Webb, Michael D; Zahniser, James; Zhang, Yalin
(2022)
The Membracoidea_morph_data_Final.nex text file contains the original data used in the phylogenetic analyses of Dietrich et al. (Insect Systematics and Diversity, in review). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The complete taxon names corresponding to the 131 genus names listed under “BEGIN TAXA” are listed in Table 1 in the included PDF file “Taxa_and_characters”; the 229 morphological characters (names abbreviated under under “BEGIN CHARACTERS” are fully explained in the list of character descriptions following Table 1 in the same PDF). The data matrix follows “MATRIX” and gives the numerical values of characters for each taxon. Question marks represent missing data. The lists of characters and taxa and details on the methods used for phylogenetic analysis are included in the submitted manuscript.
keywords:
leafhopper; treehopper; evolution; Cretaceous; Eocene
published:
2021-01-25
Zenzal, T. J. ; Ward, Michael; Diehl, Rob; Buler, Jeffrey; Smolinsky, Jaclyn; Deppe, Jill; Bolus, Rachel; Celis-Murillo, Antonio; Moore, Frank
(2021)
Dataset associated with Zenzal et al. Oikos submission: Retreat, detour, or advance? Understanding the movements of birds confronting the Gulf of Mexico. https://doi.org/10.1111/oik.07834
Four CSV files were used for analysis and are related to the following subsections under the “Statistics” heading in the “Materials and Methods” section of the journal article:
1. Departing the Edge = “AIC Analysis.csv”
2. Comparing Retreating to Advancing = “Advance and Retreat Analysis.csv” and “Wind Data at Departure.csv”
3. Food Abundance = “Fruit Data.csv” and “Arthropod Data.csv”
<b>Description of variables:</b>
Year: the year in which data were collected.
Departure: the direction in which an individual departed the Bon Secour National Wildlife Refuge. “North” indicates an individual that departed ≥315° or <45°; “Circum” indicates an individual that departed east (45 – 134°) or west ( 225 – 314°); “Trans” indicates an individual that departed south (135 – 224°).
Age: the age of an individual at capture. Individuals were aged as hatch year (HY) or after hatch year (AHY) according to Pyle (1997; see related article for full citation).
Fat: the fat score of an individual at capture. Individuals were scored on a 6-point scale ranging from 0-5 following Helms and Drury (1960; see related article for full citation).
Species: the standardized four letter alphabetic code used as an abbreviation for English common names of North American Birds. SWTH: Catharus ustulatus; REVI: Vireo olivaceus; INBU: Passerina cyanea; WOTH: Hylocichla mustelina; RTHU: Archilochus colubris.
FTM_SD: stopover duration or number of days between first capture and departure from automated radio telemetry system coverage at the Bon Secour National Wildlife Refuge.
TMB_SD: stopover duration or number of days between first and last detection from automated radio telemetry systems north of Mobile Bay, AL, USA.
Mean speed north (km/hr): the northbound travel speed of individuals retreating from the Bon Secour National Wildlife Refuge by determining the time when the signal strength indicated the bird was directly east or west of the automated telemetry system and dividing the amount of time it took for an individual to move in an assumed straight path between the Refuge systems and those north of Mobile Bay, AL, USA.
Mean speed south (km/hr): the southbound travel speed of individuals advancing from north of Mobile Bay, AL, USA by determining the time when the signal strength indicated the bird was directly east or west of the automated telemetry system and dividing the amount of time it took for an individual to move in an assumed straight path between the Refuge systems and those north of Mobile Bay, AL, USA.
LN_FTM_DEP_TIME: the natural log of departure time from the Bon Secour National Wildlife Refuge. Departure time is defined as the number of hours before or after civil twilight.
LN_TMB_DEP_TIME: the natural log of departure time from north of Mobile Bay, AL, USA. Departure time is defined as the number of hours before or after civil twilight.
Paired_FTM_DEP_TIME: the departure time or number of hours before or after civil twilight from Bon Secour National Wildlife Refuge.
Paired_TMB_DEP_TIME: the departure time or number of hours before or after civil twilight from north of Mobile Bay, AL, USA.
Wind Direction: the direction from which the wind originated at the Bon Secour National Wildlife Refuge on nights when individuals were departing. “N” indicates winds from the north (≥315° or <45°); “E” indicates winds from the east (45 – 134°); “W” indicates winds from the west ( 225 – 314°); “S” indicates winds from the south (135 – 224°).
Wind Speed (m/s): the wind speed on nights when individuals were departing the Bon Secour National Wildlife Refuge.
Group: the direction the bird was traveling under specific wind conditions. Northbound individuals traveled north from Bon Secour National Wildlife Refuge. Southbound individuals traveled south from habitats north of Mobile Bay, AL, USA.
Fruit: weekly mean number of ripe fruit per meter.
Site: the site from which the data were collected. FTM is located within the Bon Secour National Wildlife Refuge. TMB is located within the Jacinto Port Wildlife Management Area.
DOY: number indicating day of year (i.e., 1 January = 001….31 December = 365).
Arthropod Biomass: estimated mean arthropod biomass from each sampling period.
<b>Note:</b> Empty cells indicate unavailable data where applicable.
keywords:
migratory birds; migration; automated telemetry; Gulf of Mexico
published:
2025-10-29
Zhang, Zhengyi; Feng, Jianqiang; Yang, Chao; Cui, Haiyang (Ocean); Harrison, Wesley; Zhong, Dongping; Wang, Binju; Zhao, Huimin
(2025)
Since the discovery of Hofmann–Löffler–Freytag reaction more than 130 years ago, both the structure and reactivity of nitrogen-centred radicals have been widely studied. Nevertheless, catalytic enantioselective intermolecular radical hydroamination remains a challenge due to the existence of side reactions, the short lifetime of nitrogen-centred radicals and lack of understanding of the fundamental catalytic steps. In the laboratory, nitrogen-centred radicals are produced with radical initiators, photocatalysts or electrocatalysts. In contrast, their generation and reaction are unknown in nature. Here we report a pure biocatalytic system for the photoenzymatic production of nitrogen-centred radicals and enantioselective intermolecular radical hydroaminations by successfully repurposing an ene-reductase through directed evolution. These reactions progress efficiently at room temperature under visible light without any external photocatalysts and exhibit excellent enantioselectivities. A detailed mechanistic study reveals that the enantioselectivity originates from the radical-addition step while the reactivity originates from the ultrafast photoinduced electron transfer from reduced flavin mononucleotide to nitrogen-containing substrates.
keywords:
Conversion;Catalysis
published:
2024-11-27
Han, Hee-Sun; Schrader, Alex; Lee, JuYeon; Yeo, Seokjin; Traniello, Ian
(2024)
Honey bee (apis mellifera) MERFISH data set prepared by the Han lab, from brains collected by the Robinson lab at UIUC. Dataset is comprised of ~22 thousand cells and 130 genes with x,y locations for each cell. Jupyter notebook file is included as an example to load the data using Scanpy.
keywords:
smFISH; single transcript spatial transcriptomics; Honey bee brain; Apis mellifera; MERFISH