HIPPI Dataset
Dataset Description |
This archive contains all the alignments and trees used in the HIPPI paper [1]. The pfam.tar archive contains the PFAM families
./X/Y/initial.fasttree
where X is a Pfam family, Y is the cross-fold set (0, 1, 2, or 3). Inside the folder
The query.tar archive contains the query sequences for each cross-fold set. The associated query sequences for a cross-fold Y is labeled as query.Y.Z.fas,
[1] Nguyen, Nam-Phuong D, Mike Nute, Siavash Mirarab, and Tandy Warnow. (2016) HIPPI: Highly Accurate Protein Family Classification with Ensembles of HMMs. To appear in BMC Genomics. |
Subject |
Life Sciences |
Keywords |
HIPPI dataset; ensembles of profile Hidden Markov models; Pfam |
License |
CC0 |
Funder |
U.S. National Science Foundation (NSF)-Grant:DBI-1461364 |
Funder |
U.S. National Science Foundation (NSF)-Grant:ABI-1458652 |
Funder |
U.S. National Science Foundation (NSF)-Grant:III:AF:1513629 |
Funder |
University of Illinois at Urbana-Champaign |
Corresponding Creator |
Tandy Warnow |
Downloaded |
859 times |
| Version | DOI | Comment | Publication Date |
|---|---|---|---|
| 1 | 10.13012/B2IDB-6795126_V1 | 2016-08-16 |
Contact the Research Data Service for help interpreting this log.
| RelatedMaterial | update: {"datacite_list"=>["IsSupplementTo,IsCitedBy", "IsSupplementTo"], "note"=>[nil, ""], "feature"=>[nil, false]} | 2023-12-13T19:27:34Z |
| Dataset | update: {"version_comment"=>[nil, ""], "subject"=>[nil, "Life Sciences"]} | 2018-02-09T16:04:29Z |
| RelatedMaterial | update: {"citation"=>["Nguyen, Nam-Phuong D, Mike Nute, Siavash Mirarab, and Tandy Warnow. HIPPI: Highly Accurate Protein Family Classification with Ensembles of HMMs. 2016. To appear in BMC Genomics.", "Nguyen, Nam-Phuong D, Mike Nute, Siavash Mirarab, and Tandy Warnow. HIPPI: Highly Accurate Protein Family Classification with Ensembles of HMMs. 2016. BMC Genomics. doi:10.1186/s12864-016-3097-0"]} | 2016-11-15T20:06:18Z |
| RelatedMaterial | update: {"link"=>["", "http://dx.doi.org/10.1186/s12864-016-3097-0"], "uri"=>["", "10.1186/s12864-016-3097-0"], "uri_type"=>["", "DOI"], "datacite_list"=>["", "IsSupplementTo,IsCitedBy"]} | 2016-11-15T14:32:42Z |
| Creator | create: {"family_name"=>"Warnow", "given_name"=>"Tandy", "identifier"=>"", "email"=>"warnow@illinois.edu", "is_contact"=>true, "row_position"=>4} | 2016-08-26T15:00:54Z |
| Creator | create: {"family_name"=>"Mirarab", "given_name"=>"Siavash", "identifier"=>"", "email"=>"smirarab@gmail.com", "is_contact"=>false, "row_position"=>3} | 2016-08-26T15:00:54Z |
| Creator | create: {"family_name"=>"Nute", "given_name"=>"Mike", "identifier"=>"", "email"=>"nute2@illinois.edu", "is_contact"=>false, "row_position"=>2} | 2016-08-26T15:00:54Z |
| Creator | update: {"is_contact"=>[true, false]} | 2016-08-26T15:00:54Z |
| Dataset | update: {"corresponding_creator_name"=>["Nam-phuong Nguyen", "Tandy Warnow"], "corresponding_creator_email"=>["namphuon@cs.utah.edu", "warnow@illinois.edu"]} | 2016-08-26T15:00:54Z |
| RelatedMaterial | update: {"uri"=>[nil, ""], "uri_type"=>[nil, ""], "datacite_list"=>[nil, ""]} | 2016-08-25T20:58:20Z |