Illinois Data Bank

Seven ROSE datasets in high and low fragmentation conditions

This is a general description of the datasets included in this upload; details of each dataset can be found in the individual README.txt in each compressed folder. We have:
1. ROSE-HF.tar.gz
2. ROSE-LF.tar.gz

HF (high fragmentary): 50% of the sequences are made fragmentary, which have average lengths of 25% of the original lengths with a standard deviation of 60 bp.
LF (low fragmentary): 25% of the sequences are made fragmentary, which have average lengths of 50% of the original lengths with a standard deviation of 60 bp.

The seven ROSE datasets made fragmentary are: 1000L1, 1000L3, 1000L4, 1000M3, 1000S1, 1000S2 and 1000S4.
"ROSE-HF.tar.gz" contains HF versions of the seven ROSE datasets.
"ROSE-LF.tar.gz" contains LF versions of the seven ROSE datasets.

Life Sciences
ROSE; simulation; fragmentary
CC0
U.S. National Science Foundation (NSF)-Grant:1458652
Chengze Shen
541 times
Version DOI Comment Publication Date
1 10.13012/B2IDB-6128941_V1 2021-11-19

168 MB File
202 MB File

Contact the Research Data Service for help interpreting this log.

RelatedMaterial create: {"material_type"=>"Article", "availability"=>nil, "link"=>"https://doi.org/10.1089/cmb.2021.0585", "uri"=>"10.1089/cmb.2021.0585", "uri_type"=>"DOI", "citation"=>"Shen, Chengze, Minhyuk Park, and Tandy Warnow. 2022. “WITCH: Improved Multiple Sequence Alignment Through Weighted Consensus Hidden Markov Model Alignment.” Journal of Computational Biology : A Journal of Computational Molecular Cell Biology, May. doi:10.1089/cmb.2021.0585.", "dataset_id"=>2118, "selected_type"=>"Article", "datacite_list"=>"IsCitedBy"} 2022-05-23T16:25:57Z
Dataset update: {"version_comment"=>[nil, ""], "subject"=>[nil, "Life Sciences"]} 2022-02-09T17:55:37Z
Research Data Service Illinois Data Bank
Access and Use Policies Web Privacy Notice Contact Us