Illinois Data Bank Dataset Search Results
Results
published:
2024-09-19
Klimasmith, Isaac; Kent, Angela
(2024)
The use of potentially beneficial microorganisms in agriculture (microbial inoculants) has rapidly accelerated in recent years. For microbial inoculants to be effective as agricultural tools, these organisms must be able to survive and persist in novel environments while not destabilizing the resident community or spilling over into adjacent natural ecosystems. Here, we adapt a macroecological propagule pressure model to a microbial scale and present an experimental approach for testing the role of propagule pressure in microbial inoculant introductions. We experimentally determined the risk-release relationship for an IAA-expressing Pseudomonas simiae inoculant in a model monocot system. We then used this relationship to simulate establishment outcomes under a range of application frequencies (propagule number) and inoculant concentrations (propagule size). Our simulations show that repeated inoculant applications may increase establishment, even when increased inoculant concentration does not alter establishment probabilities.
The dataset filed here includes the experimemtal datafile, and a RMarkdown file that includes all the code used in in both the modeling and anaylsis.
keywords:
microbial inoculants; invasion ecology; propagule pressure; agriculture; modeling
published:
2024-06-17
Stuchiner, Emily; Jernigan, Wyatt; Zhang, Ziliang; Eddy, William; DeLucia, Evan; Yang, Wendy
(2024)
Data includes carbon mineralization rates, potential denitrification rates, net nitrous oxide fluxes, and soil chemical properties from a laboratory incubation of soil samples collected from 20 locations across an Illinois maize field.
keywords:
denitrification; nitrous oxide; dissolved organic carbon; maize
published:
2025-05-02
This dataset contains the first-generation (1st-gen) and second-generation (2nd-gen) citation relationships to a set of focal papers. The 1st-gen citation relationships are the instances of one paper citing a focal paper. These citing papers are called "1st-gen citations." The 2nd-gen citation relationships are the instances that a paper cites a 1st-gen citation. The citing paper in the 2nd-gen citation relationship is a second-generation (2nd-gen) citation. When a 2nd-gen citation is also a 1st-gen citation, it creates a transitive closure with the focal paper.
Each focal paper has an abbreviation, which can be found below. The 1st-gen and 2nd-gen citation relationships were extracted from the Curated Open Citation Dataset (Korobskiy & Chacko, 2023), which is derived from a copy of COCI, the OpenCitations Index of Crossref Open DOI-to-DOI Citations, downloaded on May 6, 2023. Scripts used to collect this dataset can be found at https://github.com/yuanxiesa/transitive_closure_study. Each focal paper currently has two files: {abbreviation}_1st.csv contains the 1st-gen citation relationships; {abbreviation}_2nd.csv contains the 2nd-gen citation relationships.
Focal paper abbreviation == "louvain": Blondel, V. D., Guillaume, J.-L., Lambiotte, R., & Lefebvre, E. (2008). Fast unfolding of communities in large networks. Journal of Statistical Mechanics: Theory and Experiment, 2008(10), P10008. https://doi.org/10.1088/1742-5468/2008/10/P10008
Focal paper abbreviation == "lp": Raghavan, U. N., Albert, R., & Kumara, S. (2007). Near linear time algorithm to detect community structures in large-scale networks. Physical Review E, 76(3), 036106. https://doi.org/10.1103/PhysRevE.76.036106
Focal paper abbreviation == "gn": Newman, M. E. J., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69(2), 026113. https://doi.org/10.1103/PhysRevE.69.026113
keywords:
transitive closure; citations; community detection algorithms; OpenCitations; method papers
published:
2025-10-30
Dwivedi, Nidhi; Yamamoto, Senri; Zhao, Yunjun; Hou, Guichuan; Bowling, Forrest; Tobimatsu, Yuki; Liu, Chang-Jun
(2025)
Grass lignocelluloses feature complex compositions and structures. In addition to the presence of conventional lignin units from monolignols, acylated monolignols and flavonoid tricin also incorporate into lignin polymer; moreover, hydroxycinnamates, particularly ferulate, cross-link arabinoxylan chains with each other and/or with lignin polymers. These structural complexities make grass lignocellulosics difficult to optimize for effective agro-industrial applications. In the present study, we assess the applications of two engineered monolignol 4-O-methyltransferases (MOMTs) in modifying rice lignocellulosic properties. Two MOMTs confer regiospecific para-methylation of monolignols but with different catalytic preferences. The expression of MOMTs in rice resulted in differential but drastic suppression of lignin deposition, showing more than 50% decrease in guaiacyl lignin and up to an 90% reduction in syringyl lignin in transgenic lines. Moreover, the levels of arabinoxylan-bound ferulate were reduced by up to 50%, and the levels of tricin in lignin fraction were also substantially reduced. Concomitantly, up to 11 μmol/g of the methanol-extractable 4-O-methylated ferulic acid and 5–7 μmol/g 4-O-methylated sinapic acid were accumulated in MOMT transgenic lines. Both MOMTs in vitro displayed discernible substrate promiscuity towards a range of phenolics in addition to the dominant substrate monolignols, which partially explains their broad effects on grass phenolic biosynthesis. The cell wall structural and compositional changes resulted in up to 30% increase in saccharification yield of the de-starched rice straw biomass after diluted acid-pretreatment. These results demonstrate an effective strategy to tailor complex grass cell walls to generate improved cellulosic feedstocks for the fermentable sugar-based production of biofuel and bio-chemicals.
keywords:
Feedstock Production;Biomass Analytics;Genome Engineering
published:
2025-09-25
Vu-Le, The-Anh; Park, Minhyuk; Chen, Ian; Warnow, Tandy
(2025)
Dataset for "Using Stochastic Block Models for Community Detection". This contains synthetic networks with ground-truth community structure generated using synthetic network generators (specifically, ABCD+o) based on real-world networks and computed clusterings on these real-world networks.
Note:
* networks.zip contains the synthetic networks
published:
2025-11-17
Bayer , Hugo; Hassell Jr, James; Oleksiak, Cecily; Garcia, Gabriela; Hollis, Vaughan; Juliano, Vitor; Maren, Stephen
(2025)
Raw data from the article "Pharmacological stimulation of infralimbic cortex after fear conditioning facilitates subsequent fear extinction", published in Neuropsychopharmacology in 2024.
published:
2024-11-19
Salami, Malik Oyewale; McCumber, Corinne
(2024)
This project investigates retraction indexing agreement among data sources: Crossref, Retraction Watch, Scopus, and Web of Science. As of July 2024, this reassesses the April 2023 union list of Schneider et al. (2023): https://doi.org/10.55835/6441e5cae04dbe5586d06a5f. As of April 2023, over 1 in 5 DOIs had discrepancies in retraction indexing among the 49,924 DOIs indexed as retracted in at least one of Crossref, Retraction Watch, Scopus, and Web of Science (Schneider et al., 2023). Here, we determine what changed in 15 months.
Pipeline code to get the results files can be found in the GitHub repository
https://github.com/infoqualitylab/retraction-indexing-agreement in the iPython notebook 'MET-STI2024_Reassessment_of_retraction_indexing_agreement.ipynb'
Some files have been redacted to remove proprietary data, as noted in README.txt. Among our sources, data is openly available only for Crossref and Retraction Watch.
FILE FORMATS:
1) unionlist_completed_2023-09-03-crws-ressess.csv - UTF-8 CSV file
2) unionlist_completed-ria_2024-07-09-crws-ressess.csv - UTF-8 CSV file
3) unionlist-15months-period_sankey.png - Portable Network Graphics (PNG) file
4) unionlist_ria_proportion_comparison.png - Portable Network Graphics (PNG) file
5) README.txt - text file
FILE DESCRIPTION:
Description of the files can be found in README.txt
keywords:
retraction status; data quality; indexing; retraction indexing; metadata; meta-science; RISRS
published:
2025-10-10
Singh, Ramkrishna; Liu, Hui; Shanklin, John; Singh, Vijay
(2025)
Lipids accumulated in the vegetative tissues of cellulosic feedstocks can be a potential raw material for biodiesel and bioethanol production. In this work, bagasse of genetically engineered sorghum was subjected to liquid hot-water pretreatment at 170, 180, and 190 °C for different reaction time. Under the optimal pretreatment condition (170 °C, 20 min), the residue was enriched in glucan (57.39 ± 2.63 % w/w) and xylan (13.38 ± 0.49 % w/w). The total lipid content of the pretreated residue was 6.81% w/w, similar to that observed in untreated bagasse (6.30% w/w). Pretreatment improved the enzymatic digestibility of bagasse, allowing a recovery of 79% w/w and 86% w/w of glucose and xylose, respectively. The pretreatment and enzymatic saccharification resulted in a 2-fold increase in total lipid in enzymatic residue compared to the original bagasse. Thus, pretreatment and enzymatic hydrolysis enabled high sugar recovery while concentrating triglycerides and free fatty acids in the residue.
keywords:
Conversion;Feedstock Production;Feedstock Bioprocessing
published:
2018-11-21
Clark, Lindsay V.; Lipka, Alexander E.; Sacks, Erik J.
(2018)
This set of scripts accompanies the manuscript describing the R package polyRAD, which uses DNA sequence read depth to estimate allele dosage in diploids and polyploids. Using several high-confidence SNP datasets from various species, allelic read depth from a typical RAD-seq dataset was simulated, then genotypes were estimated with polyRAD and other software and compared to the true genotypes, yielding error estimates.
keywords:
R programming language; genotyping-by-sequencing (GBS); restriction site-associated DNA sequencing (RAD-seq); polyploidy; single nucleotide polymorphism (SNP); Bayesian genotype calling; simulation
published:
2018-12-20
Dong, Xiaoru; Xie, Jingyi; Linh, Hoang
(2018)
File Name: Inclusion_Criteria_Annotation.csv
Data Preparation: Xiaoru Dong
Date of Preparation: 2018-12-14
Data Contributions: Jingyi Xie, Xiaoru Dong, Linh Hoang
Data Source: Cochrane systematic reviews published up to January 3, 2018 by 52 different Cochrane groups in 8 Cochrane group networks.
Associated Manuscript authors: Xiaoru Dong, Jingyi Xie, Linh Hoang, and Jodi Schneider.
Associated Manuscript, Working title: Machine classification of inclusion criteria from Cochrane systematic reviews.
Description: The file contains lists of inclusion criteria of Cochrane Systematic Reviews and the manual annotation results. 5420 inclusion criteria were annotated, out of 7158 inclusion criteria available. Annotations are either "Only RCTs" or "Others". There are 2 columns in the file:
- "Inclusion Criteria": Content of inclusion criteria of Cochrane Systematic Reviews.
- "Only RCTs": Manual Annotation results. In which, "x" means the inclusion criteria is classified as "Only RCTs". Blank means that the inclusion criteria is classified as "Others".
Notes:
1. "RCT" stands for Randomized Controlled Trial, which, in definition, is "a work that reports on a clinical trial that involves at least one test treatment and one control treatment, concurrent enrollment and follow-up of the test- and control-treated groups, and in which the treatments to be administered are selected by a random process, such as the use of a random-numbers table." [Randomized Controlled Trial publication type definition from https://www.nlm.nih.gov/mesh/pubtypes.html].
2. In order to reproduce the relevant data to this, please get the code of the project published on GitHub at: https://github.com/XiaoruDong/InclusionCriteria and run the code following the instruction provided.
keywords:
Inclusion criteria, Randomized controlled trials, Machine learning, Systematic reviews
published:
2020-06-02
Xue, Qingquan; Dietrich, Christopher; Zhang, Yalin
(2020)
The text file contains the original data used in the phylogenetic analyses of Xue et al. (2020: Systematic Entomology, in press). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first six lines of the file identify the file as NEXUS, indicate that the file contains data for 89 taxa (species) and 2676 characters, indicate that the first 2590 characters are DNA sequence and the last 86 are morphological, that gaps inserted into the DNA sequence alignment and inapplicable morphological characters are indicated by a dash, and that missing data are indicated by a question mark. The file contains aligned nucleotide sequence data for 5 gene regions and 86 morphological characters. The positions of data partitions are indicated in the mrbayes block of commands for the phylogenetic program MrBayes at the end of the file (Subset1 = 16S gene; Subset2 = 28S gene; Subset3 = COI gene; Subset 4 = Histone H3 and H2A genes). The mrbayes block also contains instructions for MrBayes on various non-default settings for that program. These are explained in the original publication. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the supplementary document included as a separate pdf, also available from the journal website. The original raw DNA sequence data are available from NCBI GenBank under the accession numbers indicated in the supplementary file.
keywords:
phylogeny; DNA sequence; morphology; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; 16S rDNA; histone H3; histone H2A; cytochrome oxidase I; Bayesian analysis
published:
2024-03-21
Becker, Maria; Han, Kanyao; Werthmann, Antonina; Rezapour, Rezvaneh; Lee, Haejin; Diesner, Jana; Witt, Andreas
(2024)
Impact assessment is an evolving area of research that aims at measuring and predicting the potential effects of projects or programs. Measuring the impact of scientific research is a vibrant subdomain, closely intertwined with impact assessment. A recurring obstacle pertains to the absence of an efficient framework which can facilitate the analysis of lengthy reports and text labeling. To address this issue, we propose a framework for automatically assessing the impact of scientific research projects by identifying pertinent sections in project reports that indicate the potential impacts. We leverage a mixed-method approach, combining manual annotations with supervised machine learning, to extract these passages from project reports. This is a repository to save datasets and codes related to this project.
Please read and cite the following paper if you would like to use the data:
Becker M., Han K., Werthmann A., Rezapour R., Lee H., Diesner J., and Witt A. (2024). Detecting Impact Relevant Sections in Scientific Research. The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING).
This folder contains the following files:
evaluation_20220927.ods: Annotated German passages (Artificial Intelligence, Linguistics, and Music) - training data
annotated_data.big_set.corrected.txt: Annotated German passages (Mobility) - training data
incl_translation_all.csv: Annotated English passages (Artificial Intelligence, Linguistics, and Music) - training data
incl_translation_mobility.csv: Annotated German passages (Mobility) - training data
ttparagraph_addmob.txt: German corpus (unannotated passages)
model_result_extraction.csv: Extracted impact-relevant passages from the German corpus based on the model we trained
rf_model.joblib: The random forest model we trained to extract impact-relevant passages
Data processing codes can be found at: https://github.com/khan1792/texttransfer
keywords:
impact detection; project reports; annotation; mixed-methods; machine learning
published:
2025-06-30
Mori, Jameson; Skowron, Nicholas; Barr, Daniel; Johnson, Ben; Novakofski, Jan; Mateus-Pinilla, Nohra
(2025)
This dataset contains measurements of water loss as white-tailed deer (Odocoileus virginianus) retroypharyngeal lymph nodes air-dried in a refrigerator for 31 days. Daily weights for lymph nodes are recorded every 24 hours, as are the variables "firmness" and "surface wetness". "Firmness" is a categorical variable measuring how much the tissue deforms to the touch (soft, medium, or hard). "Surface wetness" is the amount of visible moisture on the outside of the lymph node (all, some, or none). Lymph node weights were measured until their weights stabilized for 3 consecutive days at two decimal places (ex. 3.02, 3.02, 3.02) or until the weights fluctuated only by 0.01 (ex. 3.02, 3.03, 3.02). Lymph nodes were from northern Illinois white-tailed deer collected as part of the Illinois Department of Natural Resources' ongoing chronic wasting disease (CWD) management efforts.
keywords:
cervid; lymph node; chronic wasting disease; cwd; diagnostic testing; dessication; drying; tissue
published:
2025-10-24
Choe, Kisurb; Jindra, Michael A.; Hubbard, Susan; Pfleger, Brian; Sweedler, Jonathan
(2025)
Creating controlled lipid unsaturation locations in oleochemicals can be a key to many bioengineered products. However, evaluating the effects of modifications to the acyl-ACP desaturase on lipid unsaturation is not currently amenable to high-throughput assays, limiting the scale of redesign efforts to <200 variants. Here, we report a rapid mass spectrometry (MS) assay for profiling the positions of double bonds on membrane lipids produced by Escherichia coli colonies after treatment with ozone gas. By MS measurement of the ozonolysis products of Δ6 and Δ8 isomers of membrane lipids from colonies expressing recombinant Thunbergia alata desaturase, we screened a randomly mutagenized library of the desaturase gene at 5 s per sample. Two variants with altered regiospecificity were isolated, indicated by an increase in 16:1 Δ8 proportion. We also demonstrated the ability of these desaturase variants to influence the membrane composition and fatty acid distribution of E. coli strains deficient in the native acyl-ACP desaturase gene, fabA. Finally, we used the fabA deficient chassis to concomitantly express a non-native acyl-ACP desaturase and a medium-chain thioesterase from Umbellularia californica, demonstrating production of only saturated free fatty acids.
keywords:
Conversion;Lipidomics;Mass Spectrometry
published:
2025-07-28
McCumber, Corinne; Salami, Malik Oyewale
(2025)
This project investigates retraction indexing agreement in PubMed between 2024-07-03 and 2025-05-09 in order to address an API limitation that resulted in 199 items being excluded from analysis in "Analyzing the consistency of retraction indexing". PubMed was queried on 2024-07-03 and on 2025-05-09 using the search “Retracted Publication[PT]”. PubMed is only able to return 10,000 items when queried via the E-Utilities API. When the pipeline was run 2024-07-03, the search between 2020 and 2024 returned 10,199 items, meaning that an expected 199 items indexed as retracted in PubMed were excluded. This dataset uses and compares information from PubMed as of 2025-05-09 to attempt to identify those 199 items.
keywords:
retraction status; data quality; indexing; retraction indexing; metadata; meta-science; RISRS; PubMed
published:
2025-11-06
Deshavath, Narendra Naik; Woodruff, William; Eller, Fred; Susanto, Vionna; Yang, Cindy; Rao, Christopher V.; Singh, Vijay
(2025)
Microbial oils are a sustainable biomass-derived substitute for liquid fuels and vegetable oils. Oilcane, an engineered sugarcane with superior feedstock characteristics for biodiesel production, is a promising candidate for bioconversion. This study describes the processing of oilcane stems into juice and hydrothermally pretreated lignocellulosic hydrolysate and their valorization to ethanol and microbial oil using Saccharomyces cerevisiae and engineered Rhodosporidium toruloides strains, respectively. A bioethanol titer of 106 g/L was obtained from S. cerevisiae grown on oilcane juice in a 3 L fermenter, and a lipid titer of 8.8 g/L was obtained from R. toruloides grown on oilcane hydrolysate in a 75 L fermenter. Oil was extracted from the R. toruloides cells using supercritical CO2, and the observed fatty acid profile was consistent with previous studies on this strain. These results demonstrate the feasibility of pilot-scale lipid production from oilcane hydrolysate as part of an integrated bioconversion strategy.
keywords:
Conversion;Bioproducts;Feedstock Bioprocessing;Hydrolysate
published:
2022-09-29
3DIFICE: 3-dimensional Damage Imposed on Frame structures for Investigating Computer vision-based Evaluation methods
This dataset contains 1,396 synthetic images and label maps with various types of earthquake damage imposed on reinforced concrete frame structures. Damage includes: cracking, spalling, exposed transverse rebar, and exposed longitudinal rebar. Each image has an associated label map that can be used for training machine learning algorithms to recognize the various types of damage.
keywords:
computer vision; earthquake engineering; structural health monitoring; civil engineering; structural engineering;
published:
2026-01-15
Singh, Ramkrishna; Bhagwat, Sarang; Viswanathan, Mothi Bharath; Cortes-Pena, Yoel; Eilts, Kristen; Mingfeng, Cao; Guest, Jeremy; Zhao, Huimin; Singh, Vijay
(2026)
Triacetic acid lactone (TAL) can be microbially produced and further chemically upgraded to several high-value chemicals. In this work, several acidic and basic ion-exchange resins and activated charcoal were evaluated for their ability to adsorb microbially produced TAL. Activated charcoal and a weak base resin, Dowex 66, showed similar TAL adsorption capacity of 0.18 ± 0.002 g/g. At 15% w/v activated charcoal, about 98% of TAL present in fermentation broth could be adsorbed. Further, ethanol washing allowed recovery of 72% of adsorbed TAL. A biorefinery producing TAL from sucrose was designed, simulated, and evaluated (through technoeconomic analysis) under uncertainty, for an estimated TAL minimum product selling price (MPSP) of $4.27/kg [$3.71−4.94/kg; 5th-95th percentiles] for the current state of technology and $2.83/kg [$2.46–3.29/kg] following potential near-term improvements to fermentation. Thus, this work provides an adsorptive process to recover microbially produced TAL that can be chemically upgraded to several industrial products.
keywords:
Bioproducts; Feedstock Bioprocessing
published:
2024-10-07
Kole Aspray, Elise; Ainsworth, Elizabeth; McGrath, Jesse; McGrath, Justin; Montes, Christopher; Whetten, Andrew; Ort, Donald; Long, Stephen; Puthuval, Kannan; Mies, Timothy; Bernacchi, Carl; DeLucia, Evan; Dalsing, Bradley; Leakey, Andrew; Li, Shuai; Herriott, Jelena; Miglietta, Franco
(2024)
This data set is related to the SoyFACE experiments, which are open-air agricultural climate change experiments that have been conducted since 2001. The fumigation experiments take place at the SoyFACE farm and facility in Champaign County, Illinois during the growing season of each year, typically between June and October.
This V4 contains new experimental data files, hourly fumigation files, and weather/ambient files for 2022 and 2023, since the original dataset only included files for 2001-2021. The MATLAB code has also been updated for efficiency, and explanatory files have been updated accordingly. Below are new changes in V4:
- The "SoyFACE Plot Information 2001 to 2021" file is renamed to “SoyFACE ring information 2001 to 2023.xlsx”. Data for 2022 and 2023 were added. File contains information about each year of the SoyFACE experiments, including the fumigation treatment type (CO2, O3, or a combination treatment), the crop species, the plots (also referred to as 'rings' and labeled with numbers between 2 and 31) used in each experiment, important experiment dates, and the target concentration levels or 'setpoints' for CO2 and O3 in each experiment.
- The "SoyFACE 1-Minute Fumigation Data Files" were updated to contain sub-folders for each year of the experiments (2001-2023), each of which contains sub-folders for each ring used in that year's experiments. This data set also includes hourly data files for the fumigation experiments ("SoyFACE Hourly Fumigation Data Files" folder) created from the 1-minute files, and hourly ambient/weather data files for each year of the experiments ("Hourly Weather and Ambient Data Files" folder which has also been updated to include 2022 and 2023 data). The ambient CO2 and O3 data are collected at SoyFACE, and the weather data are collected from the SURFRAD and WARM weather stations located near the SoyFACE farm.
- “Rings.xlsx” is new in this version. This file lists the rings and treatments used in each year of the SoyFACE experiments between 2001 and 2023 and is used in several of the MATLAB codes.
- “CMI Weather Data Explanation.docx” is newly added. This file contains specific information about the processing of raw weather data, which is used in the hourly weather and ambient data files.
- Files that were in RAR format in V3 are now updated and saved as ZIP format, including: Hourly Weather and Ambient Data Files.zip , SoyFACE 1-Minute Fumigation Data Files.zip , SoyFACE Hourly Fumigation Data Files.zip, and Matlab Files.zip.
- The "Fumigation Target Percentages" file was updated to add data for 2022 and 2023. This file shows how much of the time the CO2 and O3 fumigation levels are within a 10 or 20 percent margin of the target levels when the fumigation system is turned on.
- The "Matlab Files" folder contains custom code (Aspray, E.K.) that was used to clean the "SoyFACE 1-Minute Fumigation Data" files and to generate the "SoyFACE Hourly Fumigation Data" and "Fumigation Target Percentages" files. Code information can be found in the various "Explanation" files. The Matlab code changes are as follows:
1. “Data_Issues_Finder.m” code was changed to use the “Ring.xlsx” file to gather ring and treatment information based on the contents of the file rather than being hardcoded in the Matlab code itself.
2. “Data_Issues_Finder_all.m” code is new. This code is the same as the “Data_Issues_Finder.m” code except that it identifies all CO2 and O3 repeats. In contrast, the “Data_Issues_Finder.m” code only identifies CO2 and O3 repeats that occur when the fumigation system is turned on.
3. “Target_Yearly.m” code was changed to use the “Ring.xlsx” file to gather ring and treatment information based on the contents of the file rather than being hardcoded in the Matlab code itself.
4. “HourlyFumCode.m” code is new. This code uses the “Rings.xlsx” file to gather ring and treatment information based on the contents of the file instead of the user needing to define these values explicitly. This code also defines a list of all ring folders for the year selected and runs the hourly code for each ring, instead of the user having to run the hourly code for each ring individually. Finally, the code generates two dialog boxes for the user, one which allows user to specify whether they want the hourly code to be run for 1-minute fumigation files or 1-minute ambient files, and another which allows user to specify whether they would like the hourly fumigation averages to be replaced with hourly ambient averages when the fumigation system is turned off.
5. “HourlyDataFun.m” code was changed to run either “HourlyData.m” code or “HourlyDataAmb.m” code, depending on user input in the first dialog box.
6. “HourlyData.m” code was changed to replace hourly fumigation averages with hourly ambient averages when the fumigation system is turned off, depending on user input in the second dialog box.
7. “HourlyDataAmb.m” code is new. This code is similar to “HourlyData.m” code but is used to calculate hourly averages for 1-minute ambient files instead 1-minute fumigation files.
8. “batch.m” code was changed to account for new function input variables in “HourlyDataFun.m” code, along with adding header columns for “FumOutput.xlsx” and “AmbOutput.xlsx” output files generated by “HourlyData.m” and “HourlyDataAmb.m” code.
- Finally, the " * Explanation" files contain information about the column names, units of measurement, steps needed to use Matlab code, and other pertinent information for each data file. Some of them have been updated to reflect the current change of data.
keywords:
SoyFACE; agriculture; agricultural; climate; climate change; atmosphere; atmospheric change; CO2; carbon dioxide; O3; ozone; soybean; fumigation; treatment
published:
2019-03-06
Makhnenko, Roman; Tarokh, Ali
(2019)
This dataset is provided to support the statements in Tarokh, A., and R.Y. Makhnenko. 2019. Remarks on the solid and bulk responses of fluid-filled porous rock, Geophysics.
The unjacketed bulk modulus is a poroelastic parameter that can be directly measured in a laboratory test under a loading that preserves the difference between the mean stress and pore pressure constant. For a monomineralic rock, the measurement of the unjacketed bulk modulus is ignored because it is assumed to be equal to the bulk modulus of the solid phase. To examine this assumption, we tested porous sandstones (Berea and Dunnville) and limestones (Apulian and Indiana) mainly composed of quartz and calcite, respectively, under the unjacketed condition. The presence of microscale inhomogeneities, in the form of non-connected (occluded) pores, was shown to cause a considerable difference between the unjacketed bulk modulus and the bulk modulus of the solid phase. Furthermore, we found the unjacketed bulk modulus to be independent of the unjacketed pressure and Terzaghi effective pressure and therefore a constant.
keywords:
Poroelasticity; anisotropic solid skeleton; unjacketed bulk modulus; non-connected porosity
published:
2025-09-17
Zhao, Huimin; Rabinowitz, Joshua; Guest, Jeremy; Zhu, Zhixin; Bhagwat, Sarang; Li, Xi; Weilandt, Daniel; Xu, Hao; Tan, Shih-I; Tran, Vinh
(2025)
Microbial production of chemicals may suffer from inadequate cofactor provision, a challenge further exacerbated in yeasts due to compartmentalized cofactor metabolism. Here, we perform cofactor engineering through the decompartmentalization of mitochondrial metabolism to improve succinic acid (SA) production in Issatchenkia orientalis. We localize the reducing equivalents of mitochondrial NADH to the cytosol through cytosolic expression of its pyruvate dehydrogenase (PDH) complex and couple a reductive tricarboxylic acid pathway with a glyoxylate shunt, partially bypassing an NADH-dependent malate dehydrogenase to conserve NADH. Cytosolic SA production reaches a titer of 104 g/L and a yield of 0.85 g/g glucose, surpassing the yield of 0.66 g/g glucose constrained by cytosolic NADH availability. Additionally, expressing cytosolic PDH, we expand our I. orientalis platform to enhance acetyl-CoA-derived citramalic acid and triacetic acid lactone production by 1.22- and 4.35-fold, respectively. Our work establishes I. orientalis as a versatile platform to produce markedly reduced and acetyl-CoA-derived chemicals.
keywords:
bioproducts; metabolic engineering
published:
2022-04-19
Saleh, Ehsan; Ghaffari, Saba; Forsyth, David; Yu-Xiong, Wang
(2022)
This data repository includes the features and the trained backbone parameters used in the ICLR 2022 Paper "On the Importance of Firth Bias Reduction in Few-Shot Classification".
The code accompanying this data is open-source and available at https://github.com/ehsansaleh/firth_bias_reduction
The code and the data have three modules:
1. The "code_firth" module (10 files) relates to the basic ResNet backbones and logistic classifiers (e.g., Figures 2 and 3 in the main paper).
2. The "code_s2m2rf" module (2 files) relates to the S2M2R feature backbones and cosine classifiers (e.g., Figure 4 in the main paper).
3. The "code_dcf" module (3 files) relates to the few-shot Distribution Calibration (DC) method (e.g., Table 1 in the main paper).
The relevant files for each module have the module name as a prefix in their name.
1. For instance, the "code_dcf_features.tar" file should be placed at the "features" directory of the "code_dcf" module.
2. As another example, "code_firth_features_cifarfs_novel.tar" should be placed in the "features" directory of the "code_firth" module, and it includes the features extracted from the novel split of mini-ImageNet dataset.
Each tar-ball should be extracted in its relevant directory, and the md5 check-sums of the extracted files are also provided in the open-source code repository for verification.
Please note that the actual datasets of images are not included here (since we do not own those datasets). However, helper scripts for automatically downloading the original datasets are also provided in the every module and sub-directory of the GitHub code repository.
keywords:
Computer Vision; Few-Shot Classification; Few-Shot Learning; Firth Bias Reduction
published:
2018-08-06
Hoang, Linh; Cao, Linh ; Guan, Yingjun; Cheng, Yi-Yun; Schneider, Jodi
(2018)
This annotation study compared RobotReviewer's data extraction to that of three novice data extractors, using six included articles synthesized in one Cochrane review: Bailey E, Worthington HV, van Wijk A, Yates JM, Coulthard P, Afzal Z. Ibuprofen and/or paracetamol (acetaminophen) for pain relief after surgical removal of lower wisdom teeth. Cochrane Database Syst Rev. 2013; CD004624; doi:10.1002/14651858.CD004624.pub2 The goal was to assess the relative advantage of RobotReviewer's data extraction with respect to quality.
keywords:
RobotReviewer; annotation; information extraction; data extraction; systematic review automation; systematic reviewing;
published:
2019-03-06
Anderson, Nicholas L.; Harmon-Threatt, Alexandra N.
(2019)
Chronic contact exposure to realistic soil concentrations (0, 7.5, 15, and 100 ppb) of the neonicotinoid pesticide imidacloprid had species- and sex-specific effects on bee adult longevity, immature development speed, and mass. This dataset contains a life table tracking the development, mass, and deaths of a single cohort of Osmia lignaria and Megachile rotundata over the course of two summers. Other data files include files created for multi-event survival analysis to analyze the effect on development speed. Detected effects included: decreased adult longevity for female O. lignaria at the highest concentration, a trend for a hormetic effect on female M. rotundata development speed and mass (longest development time and greatest mass in the 15 ppb treatment), and decreased adult longevity and increased development speed at high imidacloprid concentrations as well as a hormetic effect on mass (lowest in the 15 ppb treatment treatment) on male M. rotundata.
keywords:
neonicotinoid; imidacloprid; bee; habitat restoration;
published:
2021-02-24
Bieri, Carolina A.; Dominguez, Francina
(2021)
This dataset contains model output from the Community Earth System Model, Version 2 (CESM2; Danabasoglu et al. 2020). These data were used for analysis in Impacts of Large-Scale Soil Moisture Anomalies in Southeastern South America, published in the Journal of Hydrometeorology (DOI: 10.1175/JHM-D-20-0116.1). See this publication for details of the model simulations that created these data.
Four NetCDF (.nc) files are included in this dataset. Two files correspond to the control simulation (FHIST_SP_control) and two files correspond to a simulation with a dry soil moisture anomaly imposed in southeastern South America (FHIST_SP_dry; see the publication mentioned in the preceding paragraph for details on the spatial extent of the imposed anomaly). For each simulation, one file corresponds to output from the atmospheric model (file names with "cam") of CESM2 and the other to the land model (file names with "clm2"). These files are raw CESM output concatenated into a single file for each simulation.
All files include data from 1979-01-02 to 2003-12-31 at a daily resolution. The spatial resolution of all files is about 1 degree longitude x 1 degree latitude. Variables included in these files are listed or linked below.
Variables in atmosphere model output:
Vertical velocity (omega)
Convective precipitation
Large-scale precipitation
Surface pressure
Specific humidity
Temperature (atmospheric profile)
Reference temperature (temp. at reference height, 2 meters in this case)
Zonal wind
Meridional wind
Geopotential height
Variables in land model output:
See https://www.cesm.ucar.edu/models/cesm1.2/clm/models/lnd/clm/doc/UsersGuide/history_fields_table_40.xhtml
Note that not all of the variables listed at the above link are included in the land model output files in this dataset.
This material is based upon work supported by the National Science Foundation under Grant No. 1454089.
We acknowledge high-performance computing support from Cheyenne (doi:10.5065/D6RX99HX) provided by NCAR's Computational and Information Systems Laboratory, sponsored by the National Science Foundation. The CESM project is supported primarily by the National Science Foundation. We thank all the scientists, software engineers, and administrators who contributed to the development of CESM2.
References
Danabasoglu, G., and Coauthors, 2020: The Community Earth System Model Version 2 (CESM2). Journal of Advances in Modeling Earth Systems, 12, e2019MS001916, https://doi.org/10.1029/2019MS001916.
keywords:
Climate modeling; atmospheric science; hydrometeorology; hydroclimatology; soil moisture; land-atmosphere interactions