1 / 28

FASTA and BLAST

FASTA and BLAST. FASTA: Introduction. FASTA   (pronounced FAST-Aye) stands for FAST-All, reflecting the fact that it can be used for a fast protein comparison or a fast nucleotide comparison This is achieved by performing optimized searches for local alignments using a substitution matrix.

Jeffrey
Download Presentation

FASTA and BLAST

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. FASTA and BLAST

  2. FASTA: Introduction • FASTA  (pronounced FAST-Aye) stands for FAST-All, reflecting the fact that it can be used for a fast protein comparison or a fast nucleotide comparison • This is achieved by performing optimized searches for local alignments using a substitution matrix. • The high speed of this program is achieved by using the observed pattern of word hits to identify potential matches before attempting the more time consuming optimized search.

  3. fasta • http://fasta.bioch.virginia.edu/ • http://www.ebi.ac.uk/fasta33/ • http://www.ebi.ac.uk/fasta33/genomes.html

  4. FASTA sequence format

  5. FASTA Compares a protein sequence to another protein sequence or to a protein database • FASTX/FASTY Compares a DNA sequence to a protein sequence database, translating the DNA sequence in three forward (or reverse) frames and allowing frameshifts.

  6. TFASTX/TFASTY Compares a protein sequence to a DNA sequence or DNA sequence library. The DNA sequence is translated in three forward and three reverse frames, and the protein query sequence is compared to each of the six derived protein sequences. • FASTF/TFASTF Compares an ordered peptide mixture • FASTS/TFASTS Compares set of short peptide fragments, as would be obtained from mass-spec.

  7. FASTA@EBI

  8. Output: Hit list

  9. Alignment of query to a hit

  10. Exercise • Align two sequences by FASTA and SSEARCH • http://gel.ym.edu.tw/~wxluo/lec5/eco_hslv.fasta • http://gel.ym.edu.tw/~wxluo/lec5/vch_hslv.fasta • http://www.ebi.ac.uk/fasta33/

  11. Useful links • http://www.ebi.ac.uk/2can/home.html • http://fasta.genome.jp/ • http://blast.genome.jp/ • http://www.ebi.ac.uk/fasta33/

  12. FASTA method

  13. BLAST • http://www.ncbi.nih.gov/BLAST

  14. BLAST programs

  15. Running standalone NCBI BLAST • formatdb –i input.fasta –o T –p T • blastall –p blastp –d db –i query.fasta –o blast.result

  16. References • Altschul, S.F., Gish, W., Miller, W., Myers, E.W. & Lipman, D.J. (1990) "Basic local alignment search tool." J. Mol. Biol. 215:403-410. • Altschul, S.F., Madden, T.L., Schäffer, A.A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D.J. (1997) "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs." Nucleic Acids Res. 25:3389-3402 • http://www.ncbi.nlm.nih.gov/blast/producttable.shtml

  17. Exercise • Blast two sequences against sequence database • http://gel.ym.edu.tw/~wxluo/lec5/eco_hslv.fasta • http://gel.ym.edu.tw/~wxluo/lec5/vch_hslv.fasta • http://www.ebi.ac.uk/fasta33/ • http://bioinfo.nchc.org.tw/

More Related