Illinois Data Bank Dataset Search Results
Results
published:
2020-01-20
Zhang, Jun; Wuebbles, Donald; Kinnison, Douglas; Saiz López, Alfonso
(2020)
This datasets provide basis of our analysis in the paper - Revising the Ozone Depletion Potentials for Short-Lived Chemicals such as CF3I and CH3I. All datasets here are from the model output (CAM4-chem). All the simulations (background and perturbation) were run to steady-state and only the last year outputs used in analysis are archived here.
keywords:
Illinois Data Bank; NetCDF; Ozone Depletion Potential; CF3I and CH3I
published:
2020-03-03
Schneider, Jodi; Ye, Di
(2020)
This second version (V2) provides additional data cleaning compared to V1, additional data collection (mainly to include data from 2019), and more metadata for nodes. Please see NETWORKv2README.txt for more detail.
keywords:
citations; retraction; network analysis; Web of Science; Google Scholar; indirect citation
published:
2020-11-05
Miller, Andrew; Raudabaugh, Daniel
(2020)
This version 2 dataset contains 34 files in total with one (1) additional file, called "Culture-dependent Isolate table with taxonomic determination and sequence data.csv". The remaining files (33) are identical to version 1. The following is the information about the new file and its variables:
<b>Culture-dependent Isolate table with taxonomic determination and sequence data.csv</b>: Culture table with assigned taxonomy from NCBI. Single direction sequence for each isolate is include if one could be obtained. Sequence is derived from ITS1F-ITS4 PCR amplicons, with Sanger sequencing in one direction using ITS5. The files contains 20 variables with explanation as below:
IsolateNumber : unique number identify each isolate cultured
Time: season in which the sample was collected
Location: the specific name of the location
Habitat: type of habitat : either stream or peatland
State: state in the USA in which the specific location is located
Incubation_pH ID: pH of the medium during isolation of fungal cultures
Genus: phylogenetic genus of the fungal isolates (determined by sequence similarity)
Sequence_quality: base call quality of the entire sequence used for blast analysis, if known
%_coverage: sequence coverage reported from GenBank
%_ID: sequence similarity reported from GenBank
Life_style : ecological life style if known
Phylum: phylogenetic phylum as indicated by Index Fungorum
Subphylum: phylogenetic subphylum as indicated by Index Fungorum
Class: phylogenetic class as indicated by Index Fungorum
Subclass: phylogenetic subclass as indicated by Index Fungorum
Order: phylogenetic order as indicated by Index Fungorum
Family: phylogenetic Family as indicated by Index Fungorum
ITS5_Sequence: single direction sequence used for sequence similarity match using blastn. Primer ITS5
Fasta: sequence with nomenclature in a fasta format for easy cut and paste into phylogenetic software
Note: blank cells mean no data is available or unknown.
keywords:
ITS1 forward reads; Illumina; peatlands; streams; bogs; fens
published:
2020-02-12
Price, Edward; Spyreas, Greg; Matthews, Jeffrey
(2020)
This is the dataset used in the Landscape Ecology publication of the same name. This dataset consists of the following files:
NWCA_Int_Veg.txt
NWCA_Reg_Veg.txt
NWCA_Site_Attributes.txt
NWCA_Int_Veg.txt is a site and plot by species matrix. Column labeled SITES consists of site IDs. Column labeled Plots consist of Plot ID numbers. All other columns represent species abundances (estimates of percent cover, summed across five plots).
NWCA_Reg_Veg.txt is a site by species matrix of species abundances. Column labeled SITES consist of site IDs. All other columns represent species abundances (estimates of percent cover within individual plots).
NWCA_Site_Attributes.txt is a matrix of site attributes. Column labeled SITES consist of site IDs. Column labeled AA_CENTER_LAT consist of latitudinal coordinates for the Assessment Area center point in decimal degrees. Column labeled AA_CENTER_LONG consist of longitudinal coordinates for the Assessment Area center point in decimal degrees. Column REFPLUS_NWCA represents disturbance gradient classes including MIN (minimally disturbed), L (least disturbed), I (intermediate), M (most disturbed). Column REFPLUS_NWCA2 represents revised disturbance gradient classes based on protocols described in the article. These revised classes were used for analysis. Column labeled STRESS_HEAVYMETAL represents heavy metal stressor classes, used to ascertain which wetlands were missing soil data. Classes in the STRESS_HEAVYMETAL column include Low, Moderate, High, and Missing. Sites with Missing STRESS_HEAVYMETAL classes were removed from analysis.
More information about this dataset: All of the data used in this analysis was gathered from the National Wetlands Condition Assessment. Wetland surveys were conducted from 4/4/2011 to 11/2/2011. The entire National Wetlands Condition Assessment Dataset, which includes 3640 unique taxonomic identities of plants, can be found at: https://www.epa.gov/national-aquatic-resource-surveys/data-national-aquatic-resource-surveys
keywords:
Anthropogenic disturbance; β-Diversity; Biotic homogenization; Phalaris arundinacea; reed canary grass; Wetlands
published:
2019-05-07
Detmer, Thomas; Wahl, David
(2019)
Data set of trophic cascade in mesocosms experiments for zooplankton (biomass and body size) and phytoplankton (chlorophyll a concentration) caused by Bluegill as well as zooplankton production in those same treatment groups. Zooplankton were collected by tube sampler and phytoplankton were collected through grab samples.
keywords:
Trophic cascades; size-selective predation; compensatory mechanisms; biomanipulation; invasive fish; Daphnia; Moina
published:
2019-07-04
Software (Matlab .m files) for the article: Lying in Wait: Modeling the Control of Bacterial Infections via Antibiotic-Induced Proviruses. The files can be used to reproduce the analysis and figures in the article.
keywords:
Matlab codes; antibiotic-induced dynamics
published:
2024-12-05
Salami, Malik Oyewale; McCumber, Corinne
(2024)
This project investigates retraction indexing agreement among data sources: BCI, BIOABS, CCC, Compendex, Crossref, GEOBASE, MEDLINE, PubMed, Retraction Watch, Scopus, and Web of Science Core. Post-retraction citation may be partly due to authors’ and publishers' challenges in systematically identifying retracted publications. To investigate retraction indexing quality, we investigate the agreement in indexing retracted publications between 11 database sources, restricting to their coverage, resulting in a union list of 85,392 unique items. We also discuss common errors in indexing retracted publications. Our results reveal low retraction indexing agreement scores, indicating that databases widely disagree on indexing retracted publications they cover, leading to a lack of consistency in what publications are identified as retracted. Our findings highlight the need for clear and standard practices in the curation and management of retracted publications.
Pipeline code to get the result files can be found in the GitHub repository
https://github.com/infoqualitylab/retraction-indexing-agreement in the ‘src’ file containing iPython notebooks:
The ‘unionlist_completed-ria_2024-07-09.csv’ file has been redacted to remove proprietary data, as noted below in README.txt. Among our sources, data is openly available only for Crossref, PubMed, and Retraction Watch.
FILE FORMATS:
1) unionlist_completed-ria_2024-07-09.csv - UTF-8 CSV file
2) README.txt - text file
keywords:
retraction status; data quality; indexing; retraction indexing; metadata; meta-science; RISRS
published:
2019-05-10
Pradhan, Dikshant; Jensen, Paul
(2019)
Data necessary for production of figures presented in "Efficient enzyme coupling algorithms identify functional pathways in genome-scale metabolic models" by Pradhan et al.
keywords:
Efficient enzyme coupling algorithms identify functional pathways in genome-scale metabolic models;
published:
2012-07-01
Mirarab, Siavash; Ngyuen, Nam-Phuong; Warnow, Tandy
(2012)
This dataset provides the data for Mirarab, Siavash, Nam Nguyen, and Tandy Warnow. "SEPP: SATé-enabled phylogenetic placement." Biocomputing 2012. 2012. 247-258.
published:
2020-04-22
Endres, A. Bryan; Endres, Renata; Krstinić Nižić, Marinela
(2020)
Data on Croatian restaurant allergen disclosures on restaurant websites, on-line menus and social media comments
keywords:
restaurant; allergen; disclosure; tourism
published:
2019-05-31
Krichels, Alexander
(2019)
This dataset includes all data presented in the manuscript entitled: "Dynamic controls on field-scale soil nitrous oxide hot spots and hot moments across a microtopographic gradient"
keywords:
denitrification; depressions; microtopography; nitrous oxide; soil oxygen; soil temperature
published:
2025-01-26
Liu, Shanshan; Vlachokostas, Alex; Kontou, Eleftheria
(2025)
Data and code supporting the paper titled "Leveraging electric vehicles as a resiliency solution for residential backup power during outages" by Shanshan Liu, Alex Vlachokostas, and Eleftheria Kontou. The data and the code enable spatiotemporal analytics and assessment of electric vehicle charging demand, remaining driving range, residential energy use, and vehicle-to-home (V2H) energy system resilience metrics.
keywords:
Electric vehicles; Power outages; Vehicle-to-home energy system; Residential loads; Bidirectional energy exchange
published:
2018-03-01
The data set consists of Illumina sequences derived from 48 sediment samples, collected in 2015 from Lake Michigan and Lake Superior for the purpose of inventorying the fungal diversity in these two lakes. DNA was extracted from ca. 0.5g of sediment using the MoBio PowerSoil DNA isolation kits following the Earth Microbiome protocol. PCR was completed with the fungal primers ITS1F and fITS7 using the Fluidigm Access Array. The resulting amplicons were sequenced using the Illumina Hi-Seq2500 platform with rapid 2 x 250nt paired-end reads. The enclosed data sets contain the forward read files for both primers, both fixed-header index files, and the associated map files needed to be processed in QIIME. In addition, enclosed are two rarefied OTU files used to evaluate fungal diversity. All decimal latitude and decimal longitude coordinates of our collecting sites are also included.
File descriptions:
Great_lakes_Map_coordinates.xlsx = coordinates of sample sites
QIIME Processing ITS1 region: These are the raw files used to process the ITS1 Illumina reads in QIIME. ***only forward reads were processed
GL_ITS1_HW_mapFile_meta.txt = This is the map file used in QIIME.
ITS1F_Miller_Fludigm_I1_fixedheader.fastq = Index file from Illumina. Headers were fixed to match the forward reads (R1) file in order to process in QIIME
ITS1F_Miller_Fludigm_R1.fastq = Forward Illumina reads for the ITS1 region.
QIIME Processing ITS2 region: These are the raw files used to process the ITS2 Illumina reads in QIIME. ***only forward reads were processed
GL_ITS2_HW_mapFile_meta.txt = This is the map file used in QIIME.
ITS7_Miller_Fludigm_I1_Fixedheaders.fastq = Index file from Illumina. Headers were fixed to match the forward reads (R1) file in order to process in QIIME
ITS7_Miller_Fludigm_R1.fastq = Forward Illumina reads for the ITS2 region.
Resulting OTU Table and OTU table with taxonomy
ITS1 Region
wahl_ITS1_R1_otu_table.csv = File contains Representative OTUs based on ITS1 region for all the R1 data and the number of each OTU found in each sample.
wahl_ITS1_R1_otu_table_w_tax.csv = File contains Representative OTUs based on ITS1 region for all the R1 and the number of each OTU found in each sample along with taxonomic determination based on the following database: sh_taxonomy_qiime_ver7_97_s_31.01.2016_dev
ITS2 Region
wahl_ITS2_R1_otu_table.csv = File contains Representative OTUs based on ITS2 region for all the R1 data and the number of each OTU found in each sample.
wahl_ITS2_R1_otu_table_w_tax.csv = File contains Representative OTUs based on ITS2 region for all the R1 data and the number of each OTU found in each sample along with taxonomic determination based on the following database: sh_taxonomy_qiime_ver7_97_s_31.01.2016_dev
Rarified illumina dataset for each ITS Region
ITS1_R1_nosing_rare_5000.csv = Environmental parameters and rarefied OTU dataset for ITS1 region.
ITS2_R1_nosing_rare_5000.csv = Environmental parameters and rarefied OTU dataset for ITS2 region.
Column headings:
#SampleID = code including researcher initials and sequential run number
BarcodeSequence =
LinkerPrimerSequence = two sequences used CTTGGTCATTTAGAGGAAGTAA or GTGARTCATCGAATCTTTG
ReversePrimer = two sequences used GCTGCGTTCTTCATCGATGC or TCCTCCGCTTATTGATATGC
run_prefix = initials of run operator
Sample = location code, see thesis figures 1 and 2 for mapped locations and Great_lakes_Map_coordinates.xlsx for exact coordinates.
DepthGroup = S= shallow (50-100 m), MS=mid-shallow (101-150 m), MD=mid-deep (151-200 m), and D=deep (>200 m)"
Depth_Meters = Depth in meters
Lake = lake name, Michigan or Superior
Nitrogen %
Carbon %
Date = mm/dd/yyyy
pH = acidity, potential of Hydrogen (pH) scale
SampleDescription = Sample or control
X = sequential run number
OTU ID = Operational taxonomic unit ID
keywords:
Illumina; next-generation sequencing; ITS; fungi
published:
2020-04-22
Nest survival and Fledgling production data for Bell's Vireo and Willow Flycatcher nests.
keywords:
Bell's Vireo;Willow Flycatcher;habitat selection;fitness;
published:
2020-02-05
Zahniser, James; Dietrich, Christopher
(2020)
The Delt_Comb.NEX text file contains the original data used in the phylogenetic analyses of Zahniser & Dietrich, 2013 (European Journal of Taxonomy, 45: 1-211). The text file is marked up according to the standard NEXUS format commonly used by various phylogenetic analysis software packages. The file will be parsed automatically by a variety of programs that recognize NEXUS as a standard bioinformatics file format. The first nine lines of the file indicate the file type (Nexus), that 152 taxa were analyzed, that a total of 3971 characters were analyzed, the format of the data, and specification for two symbols used in the dataset. There are four datasets separated into blocks, one each for: 28S rDNA gene, Histone H3 gene, morphology, and insertion/deletion characters scored based on the alignment of the 28S rDNA dataset. Descriptions of the morphological characters and more details on the species and specimens included in the dataset are provided in the publication using this dataset. A text file, Delt_morph_char.txt, is available here that states the morphological characters and characters states that were scored in the Delt_Comb.NEX dataset. The original DNA sequence data are available from NCBI GenBank under the accession numbers indicated in publication. Chromatogram files for each sequencing read are available from the first author upon request.
keywords:
phylogeny; DNA sequence; morphology; parsimony analysis; Insecta; Hemiptera; Cicadellidae; leafhopper; evolution; 28S rDNA; histone H3; bayesian analysis
published:
2019-02-26
Neumann, Elizabeth; Comi, Troy; Rubakhin, Stanislav; Sweedler, Jonathan
(2019)
We have recently created an approach for high throughput single cell measurements using matrix assisted laser desorption / ionization mass spectrometry (MALDI MS) (J Am Soc Mass Spectrom. 2017, 28, 1919-1928. doi: 10.1007/s13361-017-1704-1. Chemphyschem. 2018, 19, 1180-1191. doi: 10.1002/cphc.201701364). While chemical detail is obtained on individual cells, it has not been possible to correlate the chemical information with canonical cell types.
Now we combine high-throughput single cell mass spectrometry with immunocytochemistry to determine lipid profiles of two known cell types, astrocytes and neurons from the rodent brain, with the work appearing as “Lipid heterogeneity between astrocytes and neurons revealed with single cell MALDI MS supervised by immunocytochemical classification” (DOI: 10.1002/anie.201812892).
Here we provide the data collected for this study. The dataset provides the raw data and script files for the rodent cerebral cells described in the manuscript.
keywords:
Single cell analysis; mass spectrometry; astrocyte; neuron; lipid analysis
published:
2019-06-12
Miller, Andrew; Raudabaugh, Daniel
(2019)
The data set contains Supplemental data sets for the Manuscript entitled "Where are they hiding? Testing the body snatchers hypothesis in pyrophilous fungi."
Environmental sampling: Amplification of nuclear DNA regions (ITS1 and ITS2) were completed using the Fluidigm Access Array and the resulting amplicons were sequenced on an Illumina MiSeq v2 platform runs using rapid 2 × 250 nt paired-end reads. Illumina sequencing run amplicons that were size selected into <500nt and >500nt sub-pools, then remixed together <500nt: >500nt by nM concentration in a 1x:3x proportion. All amplification and sequencing steps were performed at the Roy J. Carver Biotechnology Center at the University of Illinois Urbana-Champaign.
ITS1 region primers consisted of ITS1F (5'-CTTGGTCATTTAGAGGAAGTAA-'3) and ITS2 (5'-GCTGCGTTCTTCATCGATGC-'3).
ITS2 region primers consisted of fITS7 (5'-GTGARTCATCGAATCTTTG-'3) and ITS4 (5'-TCCTCCGCTTATTGATATGC-'3).
Supplemental files 1 through 5 contain the raw data files.
Supplemental 1 is the ITS1 Illumina MiSeq forward reads and Supplemental 2 is the corresponding index files.
Supplemental 3 is the ITS2 Illumina MiSeq forward reads and Supplemental 4 is the corresponding index files.
Supplemental 5 is the map file needed to process the forward reads and index files in QIIME.
Supplemental 6 and 7 contain the resulting QIIME 1.9.1. OTU tables along with UNITE, NCBI, and CONSTAX taxonomic assignments in addition to the representative OTU sequence.
Numeric samples within the OTU tables correspond to the following:
1 Brachythecium sp.
2 Usnea cornuta
3 Dicranum sp.
4 Leucodon julaceus
5 Lobaria quercizans
6 Rhizomnium sp.
7 Dicranum sp.
8 Thuidium delicatulum
9 Myelochroa aurulenta
10 Atrichum angustatum
11 Dicranum sp.
12 Hypnum sp.
13 Atrichum angustatum
14 Hypnum sp.
15 Thuidium delicatulum
16 Leucobryum sp.
17 Polytrichum commune
18 Atrichum angustatum
19 Atrichum angustatum
20 Atrichum crispulum
21 Bryaceae
22 Leucobryum sp.
23 Conocephalum conicum
24 Climacium americanum
25 Atrichum angustatum
26 Huperzia serrata
27 Polytrichum commune
28 Diphasiastrum sp.
29 Anomodon attenuatus
30 Bryoandersonia sp.
31 Polytrichum commune
32 Thuidium delicatulum
33 Brachythecium sp.
34 Leucobryum glaucum
35 Bryoandersonia sp.
36 Anomodon attenuatus
37 Pohlia sp.
38 Cinclidium sp.
39 Hylocomium splendens
40 Polytrichum commune
41 negative control
42 Soil
43 Soil
44 Soil
45 Soil
46 Soil
47 Soil
If a sample number is not present within the OTU table; either no sequences were obtained or no sequences passed the quality filtering step in QIIME.
Supplemental 8 contains the Summary of unique species per location.
published:
2020-06-03
Zachwieja, Alexandra
(2020)
This dataset provides files for use in analysis of human land preference across Australasia, and in a localized analysis of land preference in Laos and Vietnam. All files can be imported into ArcGIS for visualization, and re-analyzed using the open source Maxent species distribution modeling program. CSV files contain known human presence sites for model validation. ASC files contain geographically coded environmental data for mean annual temperature and mean annual precipitation during the Last Glacial Maximum, as well as downward slope data. All ASC files are in the WGS 1984 Mercator map projection for visualization in ArcGIS and can be opened as text files in text editors supporting large file sizes.
keywords:
human dispersal; ecological niche modeling; Australasia; Late Pleistocene; land preference
published:
2023-09-01
Chakraborty, Sulagna; Steckler, Teresa; Gronemeyer, Peg; Mateus-Pinilla, Nohra; Smith, Rebecca
(2023)
An online and paper knowledge, attitudes, and practices survey on ticks and tick-borne diseases (TBD) was distributed to farmers in Illinois during summer 2020 to spring 2022 (paper version titled Final Draft Farmer KAP_v.SoftCopy_Revised.docx). These are the raw data associated with that survey and the survey questions used (FarmerTickKAPdata.csv, data dictionary in Data Description.docx). We have added calculated values (columns 286 to end, code for calculation in FarmerKAPvariableCalculation.R), including: the tick knowledge score, TBD knowledge score, and total knowledge score, which are the sum of the total number of correct answers in each category, and score percent, which are the proportion of correct answers in each category.
keywords:
ticks; survey; tick-borne disease; farmer
published:
2019-07-29
Christensen, Sarah; Molloy, Erin K.; Vachaspati, Pranjal; Warnow, Tandy
(2019)
Datasets used in the study, "TRACTION: Fast non-parametric improvement of estimated gene trees," accepted at the Workshop on Algorithms in Bioinformatics (WABI) 2019.
keywords:
Gene tree correction; horizontal gene transfer; incomplete lineage sorting
published:
2021-06-17
Dominguez, Francina; Yang, Zhao
(2021)
Model output dataset (6-hourly) from the Weather Research and Forecasting (WRF) model simulations over South America with the added capability of water vapor tracers to track the moisture that originates over the Amazon and the La Plata river basins. The simulations were performed for the period 2003-2013 at 20-km horizontal resolution fully coupled with the Noah-MP land surface model. Limited number of original output variables sufficient for reproducing the analyses in papers that cite this dataset are included here. The attached wrfout_southamerica_readme.txt contains detailed information about the file format and variables. For the complete model dataset, contact francina@illinois.edu.
keywords:
WRF; Amazon; La Plata; South America; Numerical tracers
published:
2019-07-08
Kehoe, Adam K.; Torvik, Vetle I.
(2019)
# Overview
These datasets were created in conjunction with the dissertation "Predicting Controlled Vocabulary Based on Text and Citations: Case Studies in Medical Subject Headings in MEDLINE and Patents," by Adam Kehoe.
The datasets consist of the following:
* twin_not_abstract_matched_complete.tsv: a tab-delimited file consisting of pairs of MEDLINE articles with identical titles, authors and years of publication. This file contains the PMIDs of the duplicate publications, as well as their medical subject headings (MeSH) and three measures of their indexing consistency.
* twin_abstract_matched_complete.tsv: the same as above, except that the MEDLINE articles also have matching abstracts.
* mesh_training_data.csv: a comma-separated file containing the training data for the model discussed in the dissertation.
* mesh_scores.tsv: a tab-delimited file containing a pairwise similarity score based on word embeddings, and MeSH hierarchy relationship.
## Duplicate MEDLINE Publications
Both the twin_not_abstract_matched_complete.tsv and twin_abstract_matched_complete.tsv have the same structure. They have the following columns:
1. pmid_one: the PubMed unique identifier of the first paper
2. pmid_two: the PubMed unique identifier of the second paper
3. mesh_one: A list of medical subject headings (MeSH) from the first paper, delimited by the "|" character
4. mesh_two: a list of medical subject headings from the second paper, delimited by the "|" character
5. hoopers_consistency: The calculation of Hooper's consistency between the MeSH of the first and second paper
6. nonhierarchicalfree: a word embedding based consistency score described in the dissertation
7. hierarchicalfree: a word embedding based consistency score additionally limited by the MeSH hierarchy, described in the dissertation.
## MeSH Training Data
The mesh_training_data.csv file contains the training data for the model discussed in the dissertation. It has the following columns:
1. pmid: the PubMed unique identifier of the paper
2. term: a candidate MeSH term
3. cit_count: the log of the frequency of the term in the citation candidate set
4. total_cit: the log of the total number the paper's citations
5. citr_count: the log of the frequency of the term in the citations of the paper's citations
6. total_citofcit: the log of the total number of the citations of the paper's citations
7. absim_count: the log of the frequency of the term in the AbSim candidate set
8. total_absim_count: the log of the total number of AbSim records for the paper
9. absimr_count: the log of the frequency of the term in the citations of the AbSim records
10. total_absimr_count: the log of the total number of citations of the AbSim record
11. log_medline_frequency: the log of the frequency of the candidate term in MEDLINE.
12. relevance: a binary indicator (True/False) if the candidate term was assigned to the target paper
## Cosine Similarity
The mesh_scores.tsv file contains a pairwise list of all MeSH terms including their cosine similarity based on the word embedding described in the dissertation. Because the MeSH hierarchy is also used in many of the evaluation measures, the relationship of the term pair is also included. It has the following columns:
1. mesh_one: a string of the first MeSH heading.
2. mesh_two: a string of the second MeSH heading.
3. cosine_similarity: the cosine similarity between the terms
4. relationship_type: a string identifying the relationship type, consisting of none, parent/child, sibling, ancestor and direct (terms are identical, i.e. a direct hierarchy match).
The mesh_model.bin file contains a binary word2vec C format file containing the MeSH term embeddings. It was generated using version 3.7.2 of the Python gensim library (https://radimrehurek.com/gensim/).
For an example of how to load the model file, see https://radimrehurek.com/gensim/models/word2vec.html#usage-examples, specifically the directions for loading the "word2vec C format."
keywords:
MEDLINE;MeSH;Medical Subject Headings;Indexing
published:
2024-07-08
Chong, Jer Pin; Minnaert-Grote, Jamie; Zaya, David N.; Ashley, Mary V.; Coons, Janice; Ramp Neal, Jennifer M.; Molano-Flores, Brenda
(2024)
A population genetics study was conducted on three plant taxa in the genus Physaria that are found on the Kaibab Plateau (Arizona, USA). Physaria kingii subsp. kaibabensis is endemic to the Kaibab Plateau, and is of conservation concern because of its rarity, limited range, and potential threats to its long-term persistence. Additionally, the taxon is a candidate for federal protection under the Endangered Species Act. It was not clear how genetically isolated P. k. subsp. kaibabensis was from Physaria kingii subsp. latifolia, which is a widespread subspecies found throughout the southwestern USA, including on the Kaibab Plateau. Additionally, other authors have suggested that P. k. subsp. kaibabensis may hybridize with Physaria arizonica, a different species that is also widespread and found on and off the Kaibab Plateau. We conducted a population genetics study of all three groups to better determine the conservation status of P. k. subsp. kaibabensis. Genetic data are in the form of nuclear DNA microsatellites for 13 loci (all apparently diploid). Additionally, we have included location information for the collection sites. We collected tissue samples from on and off the Kaibab Plateau. The overall findings are shared in a manuscript being submitted for peer-review.
keywords:
Physaria kingii; Kaibab Plateau; endemism; conservation genetics; rare species biology
published:
2023-07-10
Harmon-Threatt, Alexandra N.; Anderson, Nicholas L.
(2023)
Bee movement between habitat patches in a naturally fragmented ecosystem depended on species, patch, and matrix variables. Using a mark-recapture methodology in the naturally fragmented Ozark glade ecosystem, we assessed the importance of bee size, nesting biology, the distance between patches (e.g., isolation), and nesting and floral resources in habitat patches and the surrounding matrix on bee movement.
This dataset includes seven data files, three R code files, and a QGIS tool. Three of the data files include information collected at the study sites with regard to bees and matrix and patch characteristics. The other four data files are spatial files used to quantify the characteristics of the forest canopy between the study sites and the edge-to-edge distances between the study sites. R code in the R Markdown file recreates the analysis and data presentation for the associated publication. R script files contain processes for calculating some of the explanatory variables used in the analysis. The QGIS tool can be used as the first step to obtaining average values from a raster file where the cells are large relative to the areas of interest (AOI) that you would like to characterize. The second step is contained in one of the aforementioned R scripts.
Detected effects included: Larger bees were more likely to move between patches. Bee movement was less likely as the distance between patches increased. However, relatively short distances (~50 m) inhibited movement more than our a priori expectations. Bees were unlikely to move away from home patches with abundant and diverse floral and below-ground nesting resources. When home patches were less resource-rich, bee movement depended on the characteristics of the away patch or the matrix. In these cases, bees were more likely to move to away patches with greater below-ground nesting and floral resources. Matrix habitats with more available floral and below-ground nesting resources appear to impede movement to neighboring patches, potentially because they already provide supplemental resources for bees.
keywords:
habitat fragmentation; bees; movement; mark-recapture; nesting resources; floral resources; isolation
published:
2024-10-12
Langeslay, Blake; Juarez, Gabriel
(2024)
Simulation data used to generate plots in the associated paper ("Strain rate controls alignment in growing bacterial monolayers").