Datasets for phylogenomics of the leafhopper genus Neoaliturus Distant

Sinaiko, Guy; Cao, Yanghui; Dietrich, Christopher H.

doi:10.13012/B2IDB-8336414_V2

Datasets for phylogenomics of the leafhopper genus Neoaliturus Distant

Cite this dataset:

Sinaiko, Guy; Cao, Yanghui; Dietrich, Christopher H. (2024): Datasets for phylogenomics of the leafhopper genus Neoaliturus Distant. University of Illinois Urbana-Champaign. https://doi.org/10.13012/B2IDB-8336414_V2

Use this persistent URL to link to this dataset:

Metadata


Dataset Description	The following files include specimen information, DNA sequence data, and additional information on the analyses used to reconstruct the phylogeny of the leafhopper genus Neoaliturus as described in the Methods section of the original paper: 1. Taxon_sampling.csv: contains data on the individual specimens from which DNA was extracted, including sample code, taxon name, collection data (locality, date and name of collector) and museum unique identifier. 2. Alignments.zip: a ZIP archive containing 432 separate FASTA files representing the aligned nucleotide sequences of individual gene loci used in the analysis. 3. Concatenated_Matrix.fa: is a FASTA file containing the concatenated individual gene alignments used for the maximum likelihood analysis in IQ-TREE. 4. Genes_and_Loci.rtf: identifies the individual genes and loci used in the analysis. The partition name is the same as the name of the individual alignment file in the zipped Alignments folder. 5. Partitions_best_scheme.nex: is a text file in the standard NEXUS format that indicates the names of the individual data partitions and their locations in the concatenated matrix, and also indicates the substitution model for each partition. 6. (New in this version 2) Scripts & Description.zip includes 8 custom shell or perl scripts used to assemble the DNA sequence data by perform reciprocal blast searches between the reference sequences and assemblies for each sample, extract the best sequences based on the blast searches, screen the hits for each locus and keep only the best result, and generate the nucleotide sequence dataset for the predicted orthologues (see the file description.txt for details). 7. (New in this version 2) Full_genetic_distances_matrix.csv shows the genetic distances between pairs of samples in the datset (proportion of nucleotides that differ between samples).
Subject	Life Sciences
Keywords	leafhopper; phylogeny; anchored-hybrid-enrichment; DNA sequence; insect
License	CC0
Funder	U.S. National Science Foundation (NSF)-Grant:DEB-1639601
Corresponding Creator	Christopher H. Dietrich
Downloaded	707 times

Versions in Illinois Data Bank

Version	DOI	Comment	Publication Date
2	10.13012/B2IDB-8336414_V2	Added 2 new files per journal's suggestion: Scripts_and_description.zip and Full_genetic_distances_matrix.csv.	2024-04-05
1	10.13012/B2IDB-8336414_V1		2024-01-18

Files

Change Log

Contact the Research Data Service for help interpreting this log.

Dataset	update: {"all_globus"=>[nil, true]}	2026-01-16T15:38:52Z
Dataset	update: {"all_medusa"=>[nil, true]}	2026-01-16T15:36:21Z
RelatedMaterial	destroy: {"material_type"=>"Article", "availability"=>nil, "link"=>"https://doi.org/10.1016/j.ympev.2024.108071", "uri"=>nil, "uri_type"=>nil, "citation"=>"Sinaiko, G., Cao, Y., & Dietrich, C. H. (2024). Phylogenomics of the leafhopper genus Neoaliturus Distant, 1918 (Hemiptera: Cicadellidae: Deltocephalinae) reveals genetically divergent lineages in the invasive beet leafhopper. Molecular Phylogenetics and Evolution, 195, 108071.", "dataset_id"=>2675, "selected_type"=>"Article", "datacite_list"=>nil, "note"=>nil, "feature"=>nil}	2025-02-10T22:49:50Z
RelatedMaterial	create: {"material_type"=>"Article", "availability"=>nil, "link"=>"https://doi.org/10.1016/j.ympev.2024.108071", "uri"=>nil, "uri_type"=>nil, "citation"=>"Sinaiko, G., Cao, Y., & Dietrich, C. H. (2024). Phylogenomics of the leafhopper genus Neoaliturus Distant, 1918 (Hemiptera: Cicadellidae: Deltocephalinae) reveals genetically divergent lineages in the invasive beet leafhopper. Molecular Phylogenetics and Evolution, 195, 108071.", "dataset_id"=>2675, "selected_type"=>"Article", "datacite_list"=>nil, "note"=>nil, "feature"=>nil}	2025-02-09T15:33:20Z
Dataset	update: {"publisher"=>["University of Illinois at Urbana-Champaign", "University of Illinois Urbana-Champaign"]}	2025-02-09T15:33:20Z
RelatedMaterial	destroy: {"material_type"=>"Article", "availability"=>nil, "link"=>"", "uri"=>"", "uri_type"=>"", "citation"=>"Sinaiko, G., Cao, Y., & Dietrich, C. H. (in review). Phylogenomics of the leafhopper genus Neoaliturus Distant, 1918 (Hemiptera: Cicadellidae: Deltocephalinae) reveals genetically divergent lineages in the invasive beet leafhopper. Submitted to Molecular Phylogenetics and Evolution ", "dataset_id"=>2675, "selected_type"=>"Article", "datacite_list"=>"", "note"=>"", "feature"=>false}	2025-01-08T23:48:52Z
Dataset	update: {"publication_state"=>["version candidate under curator review", "released"], "release_date"=>[nil, Fri, 05 Apr 2024]}	2024-04-05T21:32:02Z
Dataset	update: {"description"=>["The following files include specimen information, DNA sequence data, and additional information on the analyses used to reconstruct the phylogeny of the leafhopper genus Neoaliturus as described in the Methods section of the original paper:\r\n1.\tTaxon_sampling.csv: contains data on the individual specimens from which DNA was extracted, including sample code, taxon name, collection data (locality, date and name of collector) and museum unique identifier.\r\n2.\tAlignments.zip: a ZIP archive containing 432 separate FASTA files representing the aligned nucleotide sequences of individual gene loci used in the analysis.\r\n3.\tConcatenated_Matrix.fa: is a FASTA file containing the concatenated individual gene alignments used for the maximum likelihood analysis in IQ-TREE.\r\n4.\tGenes_and_Loci.rtf: identifies the individual genes and loci used in the analysis. The partition name is the same as the name of the individual alignment file in the zipped Alignments folder.\r\n5.\tPartitions_best_scheme.nex: is a text file in the standard NEXUS format that indicates the names of the individual data partitions and their locations in the concatenated matrix, and also indicates the substitution model for each partition.\r\n6.\tScripts & Description.zip includes 8 custom shell or perl scripts used to assemble the DNA sequence data by perform reciprocal blast searches between the reference sequences and assemblies for each sample, extract the best sequences based on the blast searches, screen the hits for each locus and keep only the best result, and generate the nucleotide sequence dataset for the predicted orthologues (see the file description.txt for details).\r\n7.\tFull_genetic_distances_matrix.csv shows the genetic distances between pairs of samples in the datset (proportion of nucleotides that differ between samples).", "The following files include specimen information, DNA sequence data, and additional information on the analyses used to reconstruct the phylogeny of the leafhopper genus Neoaliturus as described in the Methods section of the original paper:\r\n1.\tTaxon_sampling.csv: contains data on the individual specimens from which DNA was extracted, including sample code, taxon name, collection data (locality, date and name of collector) and museum unique identifier.\r\n2.\tAlignments.zip: a ZIP archive containing 432 separate FASTA files representing the aligned nucleotide sequences of individual gene loci used in the analysis.\r\n3.\tConcatenated_Matrix.fa: is a FASTA file containing the concatenated individual gene alignments used for the maximum likelihood analysis in IQ-TREE.\r\n4.\tGenes_and_Loci.rtf: identifies the individual genes and loci used in the analysis. The partition name is the same as the name of the individual alignment file in the zipped Alignments folder.\r\n5.\tPartitions_best_scheme.nex: is a text file in the standard NEXUS format that indicates the names of the individual data partitions and their locations in the concatenated matrix, and also indicates the substitution model for each partition.\r\n6.\t(New in this version 2) Scripts & Description.zip includes 8 custom shell or perl scripts used to assemble the DNA sequence data by perform reciprocal blast searches between the reference sequences and assemblies for each sample, extract the best sequences based on the blast searches, screen the hits for each locus and keep only the best result, and generate the nucleotide sequence dataset for the predicted orthologues (see the file description.txt for details).\r\n7.\t(New in this version 2) Full_genetic_distances_matrix.csv shows the genetic distances between pairs of samples in the datset (proportion of nucleotides that differ between samples)."], "version_comment"=>["The original files were submitted to IDB before the associated manuscript (submitted for publication) was reviewed. Referees suggested including 2 additional files, i.e., the scripts used in the bioinformatics pipeline and the full matrix of genetic distances among sequenced individuals in the dataset. The previously included files remain unchanged. We could create a separate IDB deposit for the 2 new files or add them as version 2 of the prevvious dataset. Which is the best option?", "Added 2 new files per journal's suggestion: Scripts_and_description.zip and Full_genetic_distances_matrix.csv."]}	2024-04-04T16:41:15Z
RelatedMaterial	update: {"note"=>[nil, ""]}	2024-04-04T16:38:28Z
Dataset	update: {"description"=>["The following files include specimen information, DNA sequence data, and additional information on the analyses used to reconstruct the phylogeny of the leafhopper genus Neoaliturus as described in the Methods section of the original paper:\r\n1.\tTaxon_sampling.csv: contains data on the individual specimens from which DNA was extracted, including sample code, taxon name, collection data (locality, date and name of collector) and museum unique identifier.\r\n2.\tAlignments.zip: a ZIP archive containing 432 separate FASTA files representing the aligned nucleotide sequences of individual gene loci used in the analysis.\r\n3.\tConcatenated_Matrix.fa: is a FASTA file containing the concatenated individual gene alignments used for the maximum likelihood analysis in IQ-TREE.\r\n4.\tGenes_and_Loci.rtf: identifies the individual genes and loci used in the analysis. The partition name is the same as the name of the individual alignment file in the zipped Alignments folder.\r\n5.\tPartitions_best_scheme.nex: is a text file in the standard NEXUS format that indicates the names of the individual data partitions and their locations in the concatenated matrix, and also indicates the substitution model for each partition.", "The following files include specimen information, DNA sequence data, and additional information on the analyses used to reconstruct the phylogeny of the leafhopper genus Neoaliturus as described in the Methods section of the original paper:\r\n1.\tTaxon_sampling.csv: contains data on the individual specimens from which DNA was extracted, including sample code, taxon name, collection data (locality, date and name of collector) and museum unique identifier.\r\n2.\tAlignments.zip: a ZIP archive containing 432 separate FASTA files representing the aligned nucleotide sequences of individual gene loci used in the analysis.\r\n3.\tConcatenated_Matrix.fa: is a FASTA file containing the concatenated individual gene alignments used for the maximum likelihood analysis in IQ-TREE.\r\n4.\tGenes_and_Loci.rtf: identifies the individual genes and loci used in the analysis. The partition name is the same as the name of the individual alignment file in the zipped Alignments folder.\r\n5.\tPartitions_best_scheme.nex: is a text file in the standard NEXUS format that indicates the names of the individual data partitions and their locations in the concatenated matrix, and also indicates the substitution model for each partition.\r\n6.\tScripts & Description.zip includes 8 custom shell or perl scripts used to assemble the DNA sequence data by perform reciprocal blast searches between the reference sequences and assemblies for each sample, extract the best sequences based on the blast searches, screen the hits for each locus and keep only the best result, and generate the nucleotide sequence dataset for the predicted orthologues (see the file description.txt for details).\r\n7.\tFull_genetic_distances_matrix.csv shows the genetic distances between pairs of samples in the datset (proportion of nucleotides that differ between samples)."]}	2024-04-04T14:18:53Z
Dataset	update: {"hold_state"=>["version candidate under curator review", "none"]}	2024-04-04T14:11:40Z
Dataset	update: {"version_comment"=>[nil, "The original files were submitted to IDB before the associated manuscript (submitted for publication) was reviewed. Referees suggested including 2 additional files, i.e., the scripts used in the bioinformatics pipeline and the full matrix of genetic distances among sequenced individuals in the dataset. The previously included files remain unchanged. We could create a separate IDB deposit for the 2 new files or add them as version 2 of the prevvious dataset. Which is the best option?"]}	2024-04-03T22:17:44Z
RelatedMaterial	create: {"material_type"=>"Dataset", "availability"=>nil, "link"=>"https://doi.org/10.13012/B2IDB-8336414_V1", "uri"=>"10.13012/B2IDB-8336414_V1", "uri_type"=>"DOI", "citation"=>"Sinaiko, Guy; Cao, Yanghui; Dietrich, Christopher H. (2024): Datasets for phylogenomics of the leafhopper genus Neoaliturus Distant. University of Illinois at Urbana-Champaign. https://doi.org/10.13012/B2IDB-8336414_V1", "dataset_id"=>2675, "selected_type"=>"Dataset", "datacite_list"=>"IsNewVersionOf", "note"=>nil, "feature"=>nil}	2024-04-03T22:14:57Z
RelatedMaterial	create: {"material_type"=>"Article", "availability"=>nil, "link"=>"", "uri"=>"", "uri_type"=>"", "citation"=>"Sinaiko, G., Cao, Y., & Dietrich, C. H. (in review). Phylogenomics of the leafhopper genus Neoaliturus Distant, 1918 (Hemiptera: Cicadellidae: Deltocephalinae) reveals genetically divergent lineages in the invasive beet leafhopper. Submitted to Molecular Phylogenetics and Evolution ", "dataset_id"=>2675, "selected_type"=>"Article", "datacite_list"=>"", "note"=>"", "feature"=>false}	2024-04-03T22:14:57Z
Funder	create: {"name"=>"U.S. National Science Foundation (NSF)", "identifier"=>"10.13039/100000001", "identifier_scheme"=>"DOI", "grant"=>"DEB-1639601", "dataset_id"=>2675, "code"=>"NSF"}	2024-04-03T22:14:57Z
Creator	create: {"family_name"=>"Dietrich", "given_name"=>"Christopher H.", "identifier"=>"0000-0003-4005-4305", "email"=>"chdietri@illinois.edu", "is_contact"=>true, "row_position"=>3}	2024-04-03T22:14:57Z
Dataset	update: {"corresponding_creator_name"=>[nil, "Christopher H. Dietrich"], "corresponding_creator_email"=>[nil, "chdietri@illinois.edu"], "nested_updated_at"=>[Wed, 03 Apr 2024 22:14:57.557647000 UTC +00:00, Wed, 03 Apr 2024 22:14:57.658062000 UTC +00:00]}	2024-04-03T22:14:57Z
Creator	create: {"family_name"=>"Cao", "given_name"=>"Yanghui", "identifier"=>"0000-0002-0515-0767", "email"=>"caoyh@nwafu.edu.cn", "is_contact"=>false, "row_position"=>2}	2024-04-03T22:14:57Z
Creator	create: {"family_name"=>"Sinaiko", "given_name"=>"Guy", "identifier"=>"0000-0002-3047-7526", "email"=>"guysinaiko@gmail.com", "is_contact"=>false, "row_position"=>1}	2024-04-03T22:14:57Z

Datasets for phylogenomics of the leafhopper genus Neoaliturus Distant

Metadata

Dataset Description

Subject

Keywords

License

Funder

Corresponding Creator

Downloaded

Versions in Illinois Data Bank

Files

Change Log