Illinois Data Bank Dataset Search Results
Results
published:
2021-02-28
Ghosh, Sudipta; Riemer, Nicole; Giuliani, Graziano; Giorgi , Filippo; Ganguly, Dilip; Dey, Sagnik
(2021)
This dataset contains the RegCM4 simulations used in the article " Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM". This dataset was used to investigate the impact of a new aging parameterisation scheme implemented in a regional climate model RegCM4. The dataset contains two sets of simulations: Expt_fix and Expt_dyn. It consists of the seasonal mean and daily mean values of the variables that were used to create the visualizations of this study. The Expt_fix and Expt_dyn dataset contain 34 and 38 NetCDF files, respectively. The CERES_vs_2expts_new.mat file is the comparison between CERES shortwave downward flux at the surface and same model outputs from two experiments for clear sky and all sky conditions.
--------------------------------------------------
The following information about the dataset was generated on 2021-01-08 by SUDIPTA GHOSH
<b>GENERAL INFORMATION</b>
<i>1. Date of data collection (single date, range, approximate date):</i> 2019-01-01 to 2019-12-31
<i>2. Geographic location of data collection:</i> Urbana-Champaign,Illinois, USA
<i>3. Information about funding sources that supported the collection of the data:</i> This work is supported by the MoEFCC under the NCAP-COALESCE project [Grant No. 14/10/2014-CC]. The first author acknowledges DST-INSPIRE fellowship [IF150055] and Fulbright-Kalam Climate Doctoral fellowship. N. R. acknowledges funding from NSF AGS-1254428 and DOE grant DE-SC0019192. Department of Science and Technology, Funds for Improvement of Science and Technology infrastructure in universities and higher educational institutions (DST-FIST) grant (SR/FST/ESII-016/2014) are acknowledged for the computing support.
<b>DATA & FILE OVERVIEW</b>
<i>1. File List:</i> Expt_fix and Expt_dyn datasets contain the analysed seasonal means and daily means of the variables that have been used to create the visualizations of this study. Each of the Expt_fix and Expt_dyn datasets contains 34 and 38 NetCDF files, respectively.
<i>2. Relationship between files, if important:</i> NA
<i>3. Additional related data collected that was not included in the current data package:</i> No
<b>METHODOLOGICAL INFORMATION</b>
<i>1. Description of methods used for collection/generation of data: </i>
The model RegCM4 code is freely available online from <a href="http://gforge.ictp.it/gf/project/regcm/">http://gforge.ictp.it/gf/project/regcm/</a>.
The anthropogenic aerosol emissions considered for the simulations are taken from IIASA inventory. The data used can be easily accessed online <a href="http://clima-dods.ictp.it/regcm4/">http://clima-dods.ictp.it/regcm4/</a> website.
TRMM observed precipitation data can be assessed from <a href="https://giovanni.gsfc.nasa.gov/giovanni/">https://giovanni.gsfc.nasa.gov/giovanni/</a> website.
CRU temperature data is available at <a href="https://crudata.uea.ac.uk/cru/data/hrg/">https://crudata.uea.ac.uk/cru/data/hrg/</a>.
CERES satellite surface shortwave downward fluxes are available at <a href="https://ceres.larc.nasa.gov/data/">https://ceres.larc.nasa.gov/data/</a> website.
Input files for the RegCM4 model are archived in <a href="http://clima-dods.ictp.it/regcm4/">http://clima-dods.ictp.it/regcm4/</a> website.
This dataset contains the RegCM4 simulations used in the article " Implementation of dynamic ageing of carbonaceous aerosols in regional climate model RegCM ". Two sets of simulations: Expt_fix and Expt_dyn consists of the output data . This dataset only contains the analysed seasonal mean and daily mean of the variables that have been used to create the visualizations of this study. Each of Expt_fix and Expt_dyn contains 34 and 38 NetCDF files respectively. This dataset was used to investigate the impact of a new aging parameterisation scheme implemented in a regional climate model RegCM4.
<i>2. Methods for processing the data:</i> Seasonal Mean and daily average values were extracted from 6-hourly model output.
<i>3. Instrument- or software-specific information needed to interpret the data:</i> CDO-1.7.1, Grads-2.0.a9, Matlab2016b
<i>4. Standards and calibration information, if appropriate:</i> NA
<i>5. Environmental/experimental conditions:</i> NA
<i>6. Describe any quality-assurance procedures performed on the data:</i> NA
<i>7. People involved with sample collection, processing, analysis and/or submission:</i> Sudipta Ghosh, Nicole Riemer, Graziano Giuliani, Filippo Giorgi, Dilip Ganguly, Sagnik Dey
<b>DATA-SPECIFIC INFORMATION FOR: Expt_fix_data.tar.gz</b>
<i>1. Number of variables:</i> 29
<i>2. Number of cases/rows:</i> NA
<i>3. Variable List:</i> Mass concentration (Kg m-3) of BC, BC_HB, BC_HL, OC, OC_HB, OC_HL; Columnar burden (mg m-2)] of BC, BC_HL, BC_HB, OC; Dry deposition flux (mg m-2 day-1) of BC_HB, BC_HL, OC_HB, OC_HL; Wet deposition flux due washout (mg m-2 day-1) of BC_HB, BC_HL, OC_HB, OC_HL; Wet deposition flux due to rainout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; AOD (unit less), precipitation (Kg m-2 s-1), temperature (K) , v-wind (m s-1), u-wind (m s-1), Surface shortwave downward flux (W m-2), Shortwave radiative forcing at the surface and top of atmosphere (W m-2)
<b>DATA-SPECIFIC INFORMATION FOR: Expt_dyn_data.tar.gz</b>
<i>1. Number of variables:</i> 30
<i>2. Number of cases/rows:</i> NA
<i>3. Variable List:</i> Mass concentration (Kg m-3) of BC, BC_HB, BC_HL, OC, OC_HB, OC_HL; Columnar burden (mg m-2)] of BC, BC_HL, BC_HB, OC; Dry deposition flux (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; Wet deposition flux due washout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; Wet deposition flux due to rainout (mg m-2 day-1) of BC_HB, BC_HL OC_HB, OC_HL; AOD (unit less); precipitation (Kg m-2 s-1); temperature (K); v-wind (m s-1); u-wind (m s-1); Surface shortwave downward flux (W m-2); Shortwave radiative forcing at the surface and top of atmosphere (W m-2); ageingscale (s-1)
<b>DATA-SPECIFIC INFORMATION FOR: CERES_vs_2expts_new.mat</b>
<i>1. Number of variables:</i> 12
<i>2. Number of cases/rows:</i> NA
<i>3. Variable List:</i> Surface shortwave downward flux for clear sky (W/m-2) for CERES, Expt_fix, Expt_dyn (for winter JF and monsoon JJAS seasons); Surface shortwave downward flux for all sky conditions (W/m-2) for CERES, Expt_fix, Expt_dyn (for winter JF and monsoon JJAS seasons).
<b>NOTE:</b> The following information applies for all three (3) files:
<i> Missing data codes:</i> NA
<i>Specialized formats or other abbreviations used:</i> NA
keywords:
Carbonaceous aerosols; ageing parameterisation scheme; regional climate model; NetCDF
published:
2025-08-21
Viral vectors provide an increasingly versatile platform for transformation-free reagent delivery to plants. RNA viral vectors can be used to induce gene silencing, overexpress proteins, or introduce gene editing reagents; however, they are often constrained by carrying capacity or restricted tropism in germline cells. Site-specific recombinases that catalyze precise genetic rearrangements are powerful tools for genome engineering that vary in size and, potentially, efficacy in plants. In this work, we show that viral vectors based on tobacco rattle virus (TRV) deliver and stably express four recombinases ranging in size from ∼0.6 to ∼1.5 kb and achieve simultaneous marker removal and reporter activation through targeted excision in transgenic Nicotiana benthamiana lines. TRV vectors with Cre, FLP, CinH, and Integrase13 efficiently mediated recombination in infected somatic tissue and led to heritable modifications at high frequency. An excision-activated Ruby reporter enabled simple and high-resolution tracing of infected cell lineages without the need for molecular genotyping. Together, our experiments broaden the scope of viral recombinase delivery and offer insights into infection dynamics that may be useful in developing future viral vectors.
keywords:
gene editing; genome engineering; plant transformation
published:
2025-11-24
Li, Maolin; Harrison, Wesley; Zhang, Zhengyi; Yuan, Yujie; Zhao, Huimin
(2025)
Strategies for achieving asymmetric catalysis with azaarenes have traditionally fallen short of accomplishing remote stereocontrol, which would greatly enhance accessibility to distinct azaarenes with remote chiral centres. The primary obstacle to achieving superior enantioselectivity for remote stereocontrol has been the inherent rigidity of the azaarene ring structure. Here we introduce an ene-reductase system capable of modulating the enantioselectivity of remote carbon-centred radicals on azaarenes through a mechanism of chiral hydrogen atom transfer. This photoenzymatic process effectively directs prochiral radical centres located more than six chemical bonds, or over 6 Å, from the nitrogen atom in azaarenes, thereby enabling the production of a broad array of azaarenes possessing a remote γ-stereocentre. Results from our integrated computational and experimental investigations underscore that the hydrogen bonding and steric effects of key amino acid residues are important for achieving such high stereoselectivities.
keywords:
Conversion;Catalysis
published:
2025-10-01
Schetter, August; Lin, Cheng-Hsien; Zumpf, Colleen; Jang, Chunhwa; Hoffmann Jr., Leo; Rooney, William; Lee, DoKyoung
(2025)
Recently introduced photoperiod-sensitive (PS) biomass sorghum (Sorghum bicolor L. Moench) needs to be investigated for yield potential under different cultivation environments with reasonable nitrogen (N) inputs. The objectives of this study were to (1) evaluate the biomass yield and feedstock quality of four sorghum hybrids with different levels of PS ranging from very PS (VPS) hybrids and to moderate PS (MPS) hybrids, and (2) determine the optimal N inputs (0~168 kg N ha−1) under four environments: combinations of both temperate (Urbana, IL) and subtropical (College Station, TX) regions during 2018 and 2019. Compared to TX, the PS sorghums in central IL showed higher yield potential and steady feedstock production with an extended day length and with less precipitation variability, especially for the VPS hybrids. The mean dry matter (DM) yields of VPS hybrids were 20.5 Mg DM ha−1 and 17.7 Mg DM ha−1 in IL and TX, respectively. The highest N use efficiency occurred at a low N rate of 56 kg N ha−1 by improving approximately 33 kg DM ha−1 per 1.0 kg N ha−1 input. Approximately 70% of the PS sorghum biomass can be utilized for biofuel production, consisting of 58-65% of the cell-wall components and 4-11% of the soluble sugar. This study demonstrated that the rainfed temperate area (e.g., IL) has a great potential for the sustainable cultivation of PS energy sorghum due to their observed high yield potential, stable production, and low N requirements.
keywords:
Sustainability;Biomass Analytics;Field Data
published:
2023-07-01
Tonks, Adam; Hwang, Jeongwoo
(2023)
This is the data used in the paper "Assessment of spatiotemporal flood risk due to compound precipitation extremes across the contiguous United States".
Code from the Github repository https://github.com/adtonks/precip_extremes can be used with the data here to reproduce the paper's results. v1.0.0 of the code is also archived at https://doi.org/10.5281/zenodo.8104252
This dataset is derived from NOAA-CIRES-DOE 20th Century Reanalysis V3. The NOAA-CIRES-DOE Twentieth Century Reanalysis Project version 3 used resources of the National Energy Research Scientific Computing Center managed by Lawrence Berkeley National Laboratory which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC02-05CH11231 and used resources of NOAA's Remotely Deployed High Performance Computing Systems.
keywords:
spatiotemporal; CONUS; United States; precipitation; extremes; flooding
published:
2025-05-27
Rani, Sonia; Cao, Xi; Baptista, Alejandro E.; Hoffmann, Axel; Pfaff, Wolfgang
(2025)
This dataset contains all raw and processed data used to generate the figures in the main text and supplementary material of the paper "High dynamic-range quantum sensing of magnons and their dynamics using a superconducting qubit." The data can be used to reproduce the plots and validate the analysis. Accompanying Jupyter notebooks provide step-by-step analysis pipelines for figure generation. The dataset also includes drawings for the mechanical samples used to perform the experiment. In addition, the dataset provides ANSYS HFSS electromagnetic simulation files used to design and analyze the resonator structures and estimate field distributions.
keywords:
superconducting qubit; magnon sensing; hybrid quantum systems; spin-photon coupling; magnon decay; cavity QED
published:
2025-12-14
Fraterrigo, Jennifer; Chen, Weile
(2025)
This dataset contains information about absorptive roots from 170 plots along a latitudinal and temperature gradient in northern Alaska, including tussock sedges and deciduous alder, birch, and willow shrubs. This dataset accompanies the paper "Impacts of Arctic Shrubs on Root Traits and Belowground Nutrient Cycles Across a Northern Alaskan Climate Gradient," which was published in Frontiers in Plant Sciences.
<b>*Note:</b> in the "patch coordinates" tab, the same coordinates/elevation ("Long", "Lat", and "Elev (m)") apply to all patches that share a number. For ex: "Patch" W1, B1, and G1 share the same "Long", "Lat", and "Elev (m)" values as "Patch" A1.
keywords:
absorptive root traits; shrub expansion; Arctic; Alaskan tundra
published:
2025-07-31
Gibson, Jared; Jiang, Zhanzhi; Kou, Angela
(2025)
This repository includes data files and analysis and plotting codes for reproducing the figures in the paper "A scanning resonator for probing quantum coherent devices" arXiv:2506.22620
published:
2025-08-04
Hartman, Theodore; Studt, Jacob; VanLoocke, Andy; McDaniel, Marshall; Howe, Adina; Masters, Michael D. ; Mitchell, Corey; DeLucia, Evan H.; Heaton, Emily
(2025)
This dataset contains the data used for the publication “Aboveground rather than belowground productivity drives variability in Miscanthus x giganteus net primary productivity”. This dataset contains Miscanthus x giganteus biomass, carbon, and nitrogen tissue data for aboveground and belowground plant parts collected in 2021 for three different sites in Iowa with three different nitrogen application rates. Data at the Iowa sites were collected via biometric hand harvesting, belowground excavations, and soil coring both in-clump and beside-clump. Data were collected at two collection timepoints to calculate the contributions of belowground parts to Miscanthus x giganteus net primary productivity. This dataset also includes Miscanthus x giganteus and Switchgrass soil coring and excavation data collected in 2012 at the University of Illinois Urbana Champaign Energy Farm.
keywords:
Miscanthus; Net Primary Productivity; Excavation; Nitrogen fertilization; Translocation; Belowground Biomass; Carbon
published:
2025-09-06
4D-STEM datasets for solution-treated (CrCoNi)93Al4Ti2Nb MEA in [111], [112], and [114] zone. Data used for Ultramicroscopy article "Differentiating electron diffuse scattering via 4D-STEM spatial fluctuation and correlation analysis in complex FCC alloys". Experiment details can be found in the paper. Data-specific details are listed in the Readme file.
keywords:
4D-STEM; MEA; Electron Diffuse-Scattering; FluCor
published:
2025-09-24
Lee, Jaewon; Kwak, Suryang; Liu, Jing-Jing; Yu, Sora; Yun, Eun Ju; Kim, Dong Hyun; Liu, Cassie; Kim, Kyoung Heon; Jin, Yong-Su
(2025)
2′-Fucosyllactose (2′-FL), a human milk oligosaccharide with confirmed benefits for infant health, is a promising infant formula ingredient. Although Escherichia coli, Saccharomyces cerevisiae, Corynebacterium glutamicum, and Bacillus subtilis have been engineered to produce 2′-FL, their titers and productivities need be improved for economic production. Glucose along with lactose have been used as substrates for producing 2′-FL, but accumulation of by-products due to overflow metabolism of glucose hampered efficient production of 2′-FL regardless of a host strain. To circumvent this problem, we used xylose, which is the second most abundant sugar in plant cell wall hydrolysates and is metabolized through oxidative metabolism, for the production of 2′-FL by engineered yeast. Specifically, we modified an engineered S. cerevisiae strain capable of assimilating xylose to produce 2′-FL from a mixture of xylose and lactose. First, a lactose transporter (Lac12) from Kluyveromyces lactis was introduced. Second, a heterologous 2′-FL biosynthetic pathway consisting of enzymes Gmd, WcaG, and WbgL from E. coli was introduced. Third, we adjusted expression levels of the heterologous genes to maximize 2′-FL production. The resulting engineered yeast produced 25.5 g/L of 2′-FL with a volumetric productivity of 0.35 g/L∙h in a fed-batch fermentation with lactose and xylose feeding to mitigate the glucose repression. Interestingly, the major location of produced 2′-FL by the engineered yeast can be changed using different culture media. While 72% of the produced 2′-FL was secreted when a complex medium was used, 82% of the produced 2′-FL remained inside the cells when a minimal medium was used. As yeast extract is already used as food and animal feed ingredients, 2′-FL enriched yeast extract can be produced cost-effectively using the 2′-FL-accumulating yeast cells.
keywords:
Conversion;Genome Engineering
published:
2025-08-05
Zhu, Minjiang; Sanders, Derrick M.; Kim, Yun Seong; Shah, Rohan ; Hossain, Mohammad Tanver; Ewoldt, Randy H.; Tawfick, Sameh H.; Geubelle, Philippe H.
(2025)
published:
2019-12-20
Wang, Yu; Burgess, Steven J. ; de Becker, Elsa ; Long, Stephen P.
(2019)
This dynamic photosynthesis model of soybean canopy is developed by Yu Wang (yuwangcn@illinois.edu), IGB, University of Illinois.
If you want to know more details, please check the following publication
Yu Wang, Steven J. Burgess, Elsa de Becker, Stephen P. Long. Photosynthesis in the fleeting shadows: An overlooked opportunity for increasing crop productivity? The Plant Journal.
keywords:
Matlab; Soybean canopy; photosynthesis model
published:
2019-12-12
Kamuda, Mark; Huff, Kathryn
(2019)
This dataset contains gamma-ray spectra templates for a source interdiction and uranium enrichment measurement task. This dataset also contains Keras machine learning models trained using datasets created using these templates.
keywords:
gamma-ray spectroscopy; neural networks; machine learning; isotope identification; uranium enrichment; sodium iodide; NaI(Tl)
published:
2025-07-11
Zhixin, Zhang; Jinho, Lim; Haoyang, Ni; Jian-Min, Zuo; Axel, Hoffmann
(2025)
This dataset includes experimental data supporting the findings in the manuscript "Magnetostriction and Temperature Dependent Gilbert Damping in Boron Doped Fe80Ga20 Thin Films". It contains raw data for X-Ray diffraction, high resolution transmission electron microscopy, magnetic hysteresis loop measurement, magnetostriction measurement, and temperature dependent magnetic damping measurement.
keywords:
magnetostriction; magnetic damping; magnetoelasticity; magnon-phonon coupling
published:
2025-10-10
Clark, Teresa J.; Schwender, Jorg
(2025)
Upregulation of triacylglycerols (TAGs) in vegetative plant tissues such as leaves has the potential to drastically increase the energy density and biomass yield of bioenergy crops. In this context, constraint-based analysis has the promise to improve metabolic engineering strategies. Here we present a core metabolism model for the C4 biomass crop Sorghum bicolor (iTJC1414) along with a minimal model for photosynthetic CO2 assimilation, sucrose and TAG biosynthesis in C3 plants. Extending iTJC1414 to a four-cell diel model we simulate C4 photosynthesis in mature leaves with the principal photo-assimilatory product being replaced by TAG produced at different levels. Independent of specific pathways and per unit carbon assimilated, energy content and biosynthetic demands in reducing equivalents are about 1.3 to 1.4 times higher for TAG than for sucrose. For plant generic pathways, ATP- and NADPH-demands per CO2 assimilated are higher by 1.3- and 1.5-fold, respectively. If the photosynthetic supply in ATP and NADPH in iTJC1414 is adjusted to be balanced for sucrose as the sole photo-assimilatory product, overproduction of TAG is predicted to cause a substantial surplus in photosynthetic ATP. This means that if TAG synthesis was the sole photo-assimilatory process, there could be an energy imbalance that might impede the process. Adjusting iTJC1414 to a photo-assimilatory rate that approximates field conditions, we predict possible daily rates of TAG accumulation, dependent on varying ratios of carbon partitioning between exported assimilates and accumulated oil droplets (TAG, oleosin) and in dependence of activation of futile cycles of TAG synthesis and degradation. We find that, based on the capacity of leaves for photosynthetic synthesis of exported assimilates, mature leaves should be able to reach a 20% level of TAG per dry weight within one month if only 5% of the photosynthetic net assimilation can be allocated into oil droplets. From this we conclude that high TAG levels should be achievable if TAG synthesis is induced only during a final phase of the plant life cycle.
keywords:
Feedstock Production;Modeling
published:
2019-12-03
These are the alignments of transcriptome data used for the analysis of members of Heteroptera. This dataset is analyzed in "Deep instability in the phylogenetic backbone of Heteroptera is only partly overcome by transcriptome-based phylogenomics" published in Insect Systematics and Diversity.
keywords:
Heteroptera; Hemiptera; Phylogenomics; transcriptome
published:
2023-03-27
Littlefield, Alexander; Xie, Dajie; Richards, Corey; Ocier, Christian; Gao, Haibo; Messinger, Jonah; Ju, Lawrence; Gao, Jingxing; Edwards, Lonna; Braun, Paul; Goddard, Lynford
(2023)
This dataset contains the full data used in the paper titled "Enabling High Precision Gradient Index Control in Subsurface Multiphoton Lithography," available at https://doi.org/10.1021/acsphotonics.2c01950 .
The data used for Table 1 can be found in the dataset for the related Figure 8.
Some supplemental figures' data can be found in the main figures data:
Figure S2's data is contained in Figure 6.
Figure S4 and Table S1 data is derived from Figure 6.
Figure S9 is derived from Figure 7.
Figure S10 is contained in Figure 7.
Figure S12 is derived from Figure 6 and the Python code prism-fringe-analysis.
Figures without a data file named after them do not have any data affiliated with them and are purely graphical representations.
published:
2019-05-20
Lao, Yuyang; Schiffer, Peter
(2019)
This is the experimental data of tetris artificial spin ice. The islands are made of Permalloy materials with size of 170 nm by 470 nm by 2.5 nm. The systems are measured at a temperature where the islands are fluctuating around room temperature. The data is recorded as photoemission electron microscopy intensity. More details about the dataset can be found in the file Note.txt and Tetris_data_list.xlsx
Note:
2 files name bl11_teris600_033 and bl11_tetris600_2_135 are not recorded in the excel sheet because they are corrupted during the measurement. Any data that is not recorded in the excel sheet is either corrupted or of low quality.
From files *_028 to *_049, tetris is spelled with “t” while in the raw data folder without “t”. This is a typo. Throughout the dataset, tetris and teris are supposed to have the same meaning.
keywords:
artificial spin ice
published:
2020-10-15
Khanna, Madhu; Wang, Weiwei; Wang, Michael
(2020)
This dataset consists of various input data that are used in the GAMS model. All the data are in the format of .inc which can be read within GAMS or Notepad. Main data sources include: acreage data (acre), crop budget data ($/acre), crop yield data (e.g. bushel/acre), Soil carbon sequestration data (KgCO2/ha/yr). Model details can be found in the "Assessing the Additional Carbon Savings with Biofuel" and GAMS model package.
## File Description
(1) GAMS Model.zip: This includes all the input files and scripts for running the model
(2) Table*.csv: These files include the data from the tables in the manuscript
(3) Figure2_3_4.csv: This contains the data used to create the figures in the manuscript
(4) BaselineResults.csv: This includes a summary of the model results.
(5) SensitivityResults_*.csv: Model results from the various sensitivity analyses performed
(6) LUC_emission.csv: land use change emissions by crop reporting district for changes of pasturelands to annual crops.
keywords:
Biogenic carbon intensity; Corn ethanol; Economic model; Dynamic optimization; Anticipated baseline approach; Life cycle carbon intenisty
published:
2019-08-05
Skinner, Rachel; Dietrich, Christopher; Walden, Kimberly; Gordon, Eric; Sweet, Andrew; Podsiadlowski, Lars; Petersen, Malte; Simon, Chris; Takiya, Daniela; Johnson, Kevin
(2019)
The data in this directory corresponds to:
Skinner, R.K., Dietrich, C.H., Walden, K.K.O., Gordon, E., Sweet, A.D., Podsiadlowski, L., Petersen, M., Simon, C., Takiya, D.M., and Johnson, K.P.
Phylogenomics of Auchenorrhyncha (Insecta: Hemiptera) using Transcriptomes: Examining Controversial Relationships via Degeneracy Coding and Interrogation of Gene Conflict.
Systematic Entomology.
Correspondance should be directed to: Rachel K. Skinner, rskinn2@illinois.edu
If you use these data, please cite our paper in Systematic Entomology.
The following files can be found in this dataset:
Amino_acid_concatenated_alignment.phy: the amino acid alignment used in this analysis in phylip format.
Amino_acid_raxml_partitions.txt (for reference only): the partitions for the amino acid alignment, but a partitioned amino acid analysis was not performed in this study.
Amino_acid_concatenated_tree.newick: the best maximum likelihood tree with bootstrap values in newick format.
ASTRAL_input_gene_trees.tre: the concatenated gene tree input file for ASTRAL
README_pie_charts.md: explains the the scripts and data needed to recreate the pie charts figure from our paper. There is also another
Corresponds to the following files:
ASTRAL_species_tree_EN_only.newick: the species tree with only effective number (EN) annotation
ASTRAL_species_tree_pp1_only.newick: the species tree with only the posterior probability 1 (main topology) annotation
ASTRAL_species_tree_q1_only.newick: the species tree with only the quartet scores for the main topology (q1)
ASTRAL_species_tree_q2_only.newick: the species tree with only the quartet scores for the first alternative topology (q2)
ASTRAL_species_tree_q3_only.newick: the species tree with only the quartet scores for the second alternative topology (q3)
print_node_key_files.py: script needed to create the following files:
node_keys.key: text file with node IDs and topologies
complete_q_scores.key: text file with node IDs multiplied q scores
EN_node_vals.key: text file with node IDs and EN values
create_pie_charts_tree.py: script needed to visualize the tree with pie charts, pp1, and EN values plotted at nodes
ASTRAL_species_tree_full_annotation.newick: the species tree with full annotation from the ASTRAL analysis.
NOTE: It may be more useful to examine individual value files if you want to visualize the tree,
e.g., in figtree, since the full annotations are extensive and can make viewing difficult.
Complete_NT_concatenated_alignment.phy: the nucleotide alignment that includes unmodified third codon positions. The alignment is in phylip format.
Complete_NT_raxml_partitions.txt: the raxml-style partition file of the nucleotide partitions
Complete_NT_concatenated_tree.newick: the best maximum likelihood tree from the concatenated complete analysis NT with bootstrap values in newick format
Complete_NT_partitioned_tree.newick: the best maximum likelihood tree from the partitioned complete NT analysis with bootstrap values in newick format
Degeneracy_coded_nt_concatenated_alignment.phy: the degeneracy coded nucleotide alignment in phylip format
Degeneracy_coded_nt_raxml_partitions.txt: the raxml-style partition file for the degeneracy coded nucleotide alignment
Degeneracy_coded_nt_concatenated_tree.newick: the best maximum likelihood tree from the degeneracy-coded concatenated analysis with bootstrap values in newick format
Degeneracy_coded_nt_partitioned_tree.newick: the best maximum likelihood tree from the degeneracy-coded partitioned analysis with bootstrap values in newick format
count_ingroup_taxa.py: script that counts the number of ingroup and/or outgroup taxa present in an alignment
keywords:
Auchenorrhyncha; Hemiptera; alignment; trees
published:
2025-10-10
Dong, Chang; Shi, Zhuwei; Huang, Lei; Zhao, Huimin; Xu, Zhinan; Lian, Jiazhang
(2025)
Mitochondrion is generally considered as the most promising subcellular organelle for compartmentalization engineering. Much progress has been made in reconstituting whole metabolic pathways in the mitochondria of yeast to harness the precursor pools (i.e., pyruvate and acetyl-CoA), bypass competing pathways, and minimize transportation limitations. However, only a few mitochondrial targeting sequences (MTSs) have been characterized (i.e., MTS of COX4), limiting the application of compartmentalization engineering for multigene biosynthetic pathways in the mitochondria of yeast. In the present study, based on the mitochondrial proteome, a total of 20 MTSs were cloned and the efficiency of these MTSs in targeting heterologous proteins, including the Escherichia coli FabI and enhanced green fluorescence protein (EGFP) into the mitochondria was evaluated by growth complementation and confocal microscopy. After systematic characterization, six of the well-performed MTSs were chosen for the colocalization of complete biosynthetic pathways into the mitochondria. As proof of concept, the full α-santalene biosynthetic pathway consisting of 10 expression cassettes capable of converting acetyl-coA to α-santalene was compartmentalized into the mitochondria, leading to a 3.7-fold improvement in the production of α-santalene. The newly characterized MTSs should contribute to the expanded metabolic engineering and synthetic biology toolbox for yeast mitochondrial compartmentalization engineering.
keywords:
Conversion;Metabolic Engineering
published:
2019-08-29
This is the published ortholog set derived from whole genome data used for the analysis of members of the B. tabaci complex of whiteflies. It includes the concatenated alignment and individual gene alignments used for analyses (Link to publication: https://www.mdpi.com/1424-2818/11/9/151).
published:
2025-10-08
Kim, Sang Yeol; Stessman, Dan J.; Wright, David A.; Spalding, Martin H.; Huber, Steven; Ort, Donald
(2025)
Rubisco activase (Rca) facilitates the release of sugar‐phosphate inhibitors from the active sites of Rubisco and thereby plays a central role in initiating and sustaining Rubisco activation. In Arabidopsis, alternative splicing of a single Rca gene results in two Rca isoforms, Rca‐α and Rca‐β. Redox modulation of Rca‐α regulates the function of Rca‐α and Rca‐β acting together to control Rubisco activation. Although Arabidopsis Rca‐α alone less effectively activates Rubisco in vitro , it is not known how CO2 assimilation and plant growth are impacted. Here, we show that two independent transgenic Arabidopsis lines expressing Rca‐α in the absence of Rca‐β (“Rca‐α only” lines) grew more slowly in various light conditions, especially under low light or fluctuating light intensity, and in a short day photoperiod compared to wildtype. Photosynthetic induction was slower in the Rca‐α only lines, and they maintained a lower rate of CO2 assimilation during both photoperiod types. Our findings suggest Rca oligomers composed of Rca‐α only are less effective in initiating and sustaining the activation of Rubisco than when Rca‐β is also present. Currently there are no examples of any plant species that naturally express Rca‐α only but numerous examples of species expressing Rca‐β only. That Rca‐α exists in most plant species, including many C3 and C4 food and bioenergy crops, implies its presence is adaptive under some circumstances.
keywords:
Feedstock Production;Biomass Analytics;Phenomics
published:
2025-10-24
Maitra, Shraddha; Singh, Vijay
(2025)
Sweet sorghum is typically cultivated for the food and fodder market. Recently, sweet sorghum varieties are being metabolically transitioned to enhance energy density by accumulating oil droplets in their vegetative tissues for bioenergy applications. Owing to the high biomass yield of sorghum, the transgenic lines can compete with oil-seed crops for biodiesel yield per unit area. In the initial phase of transgenic development, a high-throughput phenotyping method can bridge the gap between the production pipeline and analysis to improve the efficiency of the process. To meet the requirement, the present study extends the application of time-domain 1H-NMR spectroscopy for rapid quantification and characterization of the total in-situ lipids of sweet sorghum ‘ramada’ to lay the groundwork for analyzing the upcoming large quantity of transgenic samples. NMR technology has been successfully established for analyzing lipid contents of vegetative tissues of non-transgenic variety. The multiexponential analysis of spin-lattice (T1) relaxation spectra obtained from TD-NMR aided the investigation of the dynamics of the free and bound lipid fraction with plant development. The total lipid concentration of bagasse and leaves of non-transgenic sweet sorghum remained unchanged throughout the plant development. Leaves displayed a higher percentage of bound lipids as compared to bagasse. A significant variation in the lipid concentration of juice was observed at the different growth stages with a maximum lipid accumulation of 1.21 ± 0.04% w/w at the boot stage that decreased with further maturity of the plant.
keywords:
Conversion;Biomass Analytics;Lipidomics;Metabolomics