readme file for dataset
Click here to get the file
Size
1.0 kB
-
File type
text/plain
File contents
The datasets used in the paper:
Rfold: An exact algorithm for computing local base pairing probabilities
Hisanori Kiryu; Taishin Kin; Kiyoshi Asai
Bioinformatics 2007; doi: 10.1093/bioinformatics/btm591
RNA sequences are taken from Rfam7.0 database.
dataset1~4.stk correspond to Dataset1~4 in the above paper.
(see the paper for detailed descriptions of these datasets)
dataset1: 151 RNA sequences in Rfam7.0 database.
dataset2: e=100,300,500,and 1000 random bases are appended
to the both ends of the sequences of dataset1.
dataset3: RNA sequences of dataset1 and random sequences of length 1000
are concatenated alternately.
dataset4: 10 random sequences of length 10 k bases.
Citation:
Rfold: An exact algorithm for computing local base pairing probabilities
Hisanori Kiryu; Taishin Kin; Kiyoshi Asai
Bioinformatics 2007; doi: 10.1093/bioinformatics/btm591
Reference:
Sam Griffiths-Jones, Alex Bateman, Mhairi Marshall, Ajay Khanna, and
Sean R Eddy. Rfam: an RNA family database.
Nucleic Acids Res, 31(1):43941, 2003.