Personal tools
You are here: Home Bioinformatcs Tools for RNAs Stem Kernels README
Document Actions

README

by Kengo Sato last modified 2008-01-17 01:13

Click here to get the file

Size 2.0 kB - File type text/x-readme

File contents

	       DAG kernels for structural RNA analysis
		  -- profile-profile stem kernels --

                                    Kengo SATO <sato-kengo@aist.go.jp>

1. Build

You can make the kernel matrix calculator for the profile-profile stem
kernels as follows:

./configure &&  make

Several options for the configure script can be specified. You can see
more details with --help option.

The configure script will find required libraries:

* Boost C++ library >= 1.33.1 (http://www.boost.org)
* Vienna RNA packagae >= 1.6  (http://www.tbi.univie.ac.at/~ivo/RNA/)

If you have the MPI system, you may give '--with-mpi' option for the
configure script to build the executable that can run in parallel.


2. Learning a Model

Typical usage of the stem kernel is the following command line:

% stem_kernel_lite -n km.dat +1 pos.fa -1 neg.fa

km.dat is the resulting pre-computed kernel matrix that can be
accepted by LIBSVM (e.g. svm-train -t 4 -b 1 km.dat km.model),
pos.fa and neg.fa are positive sequences and negative sequences,
respectively, written in the FASTA format.
The option -n makes km.dat nomalized to avoid sequence length bias.

The executable can accept the following sequence formats:
* FASTA format,
* CLUSTAL format,
* MAF format.
Furthermore, the filename expansion '*' can be accepted such as:

% stem_kernel_lite -n km.dat +1 'pos-*.aln' -1 'neg-*.aln'


3. Prediction by The Model

% stem_kernel_lite -n x.dat +1 pos.fa -1 neg.fa \\
      --test +1 seq1.fa -1 seq2.fa ....

Before the option --test, you should put the same options as when the
model were learned. After the option --test, you put the class labels
and sequence files to be predicted. Then, you can predict the class of
sequences using LIBSVM (e.g. svm-predict -b 1 x.dat km.model output). 


4. References

Sakakibara, Y., Popendorf, K., Ogawa, N., Asai, K. and Sato, K.: Stem
kernels for RNA sequence analyses. J Bioinform Comput Biol, 2007, 5,
1103-1122.

Sato, K., Kin, T., Asai, K. and Sakakibara, Y.: Directed acyclic graph
kernels for structural RNA analysis. submitted.

Powered by Plone CMS, the Open Source Content Management System

This site conforms to the following standards: