Illinois Data Bank - Dataset

A newer version of this dataset is available. View the latest version.
Version DOI Comment Publication Date
2 10.13012/B2IDB-0569467_V2 Added additional information, new file as well as some corrections. Please refer to the dataset description for list of changes. 2019-05-16
1 10.13012/B2IDB-0569467_V1 2018-11-19

3.77 KB File
3.29 MB File
4.27 MB File
867 MB File
726 MB File
183 MB File
892 MB File
701 MB File
184 MB File
772 MB File
592 MB File
172 MB File
835 MB File
671 MB File
164 MB File
481 MB File
439 MB File
3.6 MB File
818 KB File
694 KB File
19.6 KB File
1.78 MB File

Contact the Research Data Service for help interpreting this log.

RelatedMaterial update: {"note"=>[nil, ""], "feature"=>[nil, false]} 2024-01-29T19:50:09Z
RelatedMaterial update: {"note"=>[nil, ""], "feature"=>[nil, false]} 2024-01-29T19:50:08Z
RelatedMaterial update: {"uri"=>["", "/10.1101/469130"], "uri_type"=>["", "DOI"], "datacite_list"=>["", "IsSupplementTo"], "note"=>[nil, ""], "feature"=>[nil, false]} 2024-01-29T19:50:08Z
RelatedMaterial update: {"uri"=>["", "https://github.com/ekmolloy/njmerge"], "uri_type"=>["", "URL"], "datacite_list"=>["", "IsSupplementedBy"], "note"=>[nil, ""], "feature"=>[nil, false]} 2024-01-29T19:50:08Z
RelatedMaterial create: {"material_type"=>"Dataset", "availability"=>nil, "link"=>"https://doi.org/10.13012/B2IDB-0569467_V2", "uri"=>"10.13012/B2IDB-0569467_V2", "uri_type"=>"DOI", "citation"=>"", "dataset_id"=>719, "selected_type"=>"Dataset", "datacite_list"=>"IsPreviousVersionOf"} 2019-05-16T19:38:01Z
Dataset update: {"description"=>["This repository includes scripts and datasets for the paper, \"Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.\" All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge).\r\n\r\n***When downloading datasets, please note that the following errors.***\r\n\r\nIn tools.zip, the compare_trees.py and the compare_tree_lists.py scripts incorrectly refer to the symmetric difference rate as the Robinson-Foulds error rate. Because the symmetric difference rate and the Robinson-Foulds error rate are equal for binary trees, this does not impact the species tree error rates reported in the study. This can impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the symmetric difference rate is always greater than or equal to the Robinson-Foulds error rate, so the gene tree error rates reported in the study are more conservative.", "This repository includes scripts and datasets for the paper, \"Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.\" All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge).\r\n\r\n***When downloading datasets, please note that the following errors.***\r\n\r\nIn tools.zip, the compare_trees.py and the compare_tree_lists.py scripts incorrectly refer to the normalized symmetric difference as the normalized Robinson-Foulds distance. Because the normalized symmetric difference and the normalized Robinson-Foulds distance are equal for binary trees, this does not impact the species tree error rates reported in the study. This can impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the symmetric difference rate is always greater than or equal to the Robinson-Foulds error rate, so the gene tree error rates reported in the study are more conservative."]} 2019-04-07T04:30:31Z
Dataset update: {"description"=>["This repository includes scripts and datasets for the paper, \"Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.\" All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge).\r\n\r\n***When downloading datasets, please note that the following errors.***\r\n\r\nIn tools.zip, the compare_trees.py script incorrectly refers to the symmetric difference rate as the Robinson-Foulds error rate. Because the symmetric difference rate and the Robinson-Foulds error rate are equal for binary trees, this does not impact the species tree error rates reported in the study. This can impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the symmetric difference rate is always greater than or equal to the Robinson-Foulds error rate, so the gene tree error rates reported in the study are more conservative.", "This repository includes scripts and datasets for the paper, \"Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.\" All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge).\r\n\r\n***When downloading datasets, please note that the following errors.***\r\n\r\nIn tools.zip, the compare_trees.py and the compare_tree_lists.py scripts incorrectly refer to the symmetric difference rate as the Robinson-Foulds error rate. Because the symmetric difference rate and the Robinson-Foulds error rate are equal for binary trees, this does not impact the species tree error rates reported in the study. This can impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the symmetric difference rate is always greater than or equal to the Robinson-Foulds error rate, so the gene tree error rates reported in the study are more conservative."]} 2019-03-12T15:09:47Z
Dataset update: {"description"=>["This repository includes scripts and datasets for the paper, \"Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.\" All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge).", "This repository includes scripts and datasets for the paper, \"Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.\" All data files in this repository are for analyses using the logdet distance matrix computed on the concatenated alignment. Data files for analyses using the average gene-tree internode distance matrix can be downloaded from the Illinois Data Bank (https://doi.org/10.13012/B2IDB-1424746_V1). The latest version of NJMerge can be downloaded from Github (https://github.com/ekmolloy/njmerge).\r\n\r\n***When downloading datasets, please note that the following errors.***\r\n\r\nIn tools.zip, the compare_trees.py script incorrectly refers to the symmetric difference rate as the Robinson-Foulds error rate. Because the symmetric difference rate and the Robinson-Foulds error rate are equal for binary trees, this does not impact the species tree error rates reported in the study. This can impact the gene tree error rates reported in the study (see data-gene-trees.csv in data.zip), as FastTree-2 returns trees with polytomies whenever 3 or more sequences in the input alignment are identical. Note that the symmetric difference rate is always greater than or equal to the Robinson-Foulds error rate, so the gene tree error rates reported in the study are more conservative."]} 2019-03-11T12:42:05Z
RelatedMaterial update: {"uri"=>[nil, ""], "uri_type"=>[nil, ""], "datacite_list"=>[nil, ""]} 2019-01-02T17:15:48Z
RelatedMaterial update: {"uri"=>[nil, ""], "uri_type"=>[nil, ""], "datacite_list"=>[nil, ""]} 2019-01-02T17:15:48Z
RelatedMaterial update: {"uri"=>[nil, ""], "uri_type"=>[nil, ""], "datacite_list"=>[nil, ""]} 2019-01-02T17:15:48Z
Dataset update: {"version_comment"=>[nil, ""], "subject"=>[nil, "Life Sciences"]} 2019-01-02T17:15:48Z