BLAT
BLAT——The Blast Like Alignment Tool
基本命令:
blat database.fasta query.fasta output.psl
主要用途:
适用于较大运算量并且需要找共线性的序列比对
BLAT结果用图形输出可以通过psl2gff.pl以及gff2ps实现。
先安装gawk;
然后:
psl2gff.pl output.sql output.gff
gff2ps output.gff > output.ps
gff2ps -C brown.a4.rc output.gff >output.ps(此时ps矢量图输出经过brown.a4.rc预处理)
psl2gff.pl[is available to convert the PSL format (BLAT) to GFF format]可从http://www.sanger.ac.uk/Software/formats/GFF/下载
Repeatmasker: 屏蔽重复序列
安装(Installation):
download crossmatch
and put crossmatch in /usr/local
then do the following procedure:
1. Download RepeatMasker
Latest Version Released 10/6/2006: RepeatMasker-open-3-1-6.tar.gz
2. Unpack Distribution
Unpack the distribution in your home directory or in a location where it may be shared with other users of your system ( ie. /usr/local/ ).
* cp RepeatMasker-open-3-#-#.tar.gz /usr/local
* cd /usr/local
* gunzip RepeatMasker-open-3-#-#.tar.gz
* tar xvf RepeatMasker-open-3-#-#.tar
3. Install RepeatMasker Libraries
Download the libraries from http://www.girinst.org and unpack in the RepeatMasker program directory. This version supports the following libraries ( it is always best to use the most current ): repeatmaskerlibraries-20061006.tar.gz and later.
* cp repeatmaskerlibraries-20061006.tar.gz /usr/local/RepeatMasker/
* cd /usr/local/RepeatMasker
* gunzip repeatmaskerlibraries-20061006.tar.gz
* tar xvf repeatmaskerlibraries-20061006.tar
* rm repeatmaskerlibraries-20061006.tar
4. Run Configure Script
The program requires some configuration before use. There is now a script which assists in making these changes.
* cd /usr/local/RepeatMasker
* perl ./configure
BLASTCLUST使用说明: http://www.pdg.cnb.uam.es/cursos ... tAndClus/index.html
blastclust manually:
use cpus: 4
inputfile: proteins.fsa
outputfile: cluster_60_80_complete.ssv
protein identity: >60%
coverage: >80%
blastclust -a 4 -i proteins.fsa -o cluster_60_80_complete.ssv -S 60 -L 0.80 -e F
if blastclust -a 4 -i proteins.fsa -o cluster_60_80_complete.ssv -S 60 -L 0.80 -e F -p F
then the input file is nucleotides, not proteins
MUSCLE(global align) uses two types of similarity measure: the fractional identity D computed from a global alignment of the two sequences, and measures obtained by k-mer counting.(http://www.biomedcentral.com/1471-2105/5/113) 作者: xmubingo 时间: 2011-3-7 16:20
挺全的。:lol