1 / 13

Peter Rice and Mahmut Uludag

Peter Rice and Mahmut Uludag. EMBOSS as an Efficient DAS Annotation Source. Peter Rice, EBI (pmr@ebi.ac.uk) Mahmut Uludag, EBI (uludag@ebi.ac.uk) 10th March 2009. European Molecular Biology Open Software Suite 1996: Started at Sanger Centre 2000: Release 1.0.0 and moved to HGMP

elpida
Download Presentation

Peter Rice and Mahmut Uludag

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Peter Rice and Mahmut Uludag EMBOSS as an Efficient DAS Annotation Source Peter Rice, EBI (pmr@ebi.ac.uk) Mahmut Uludag, EBI (uludag@ebi.ac.uk) 10th March 2009

  2. European Molecular Biology Open Software Suite 1996: Started at Sanger Centre 2000: Release 1.0.0 and moved to HGMP 2005: Moved to EBI (HGMP closed) 2008: Release 6.0.0 http://emboss.sourceforge.net EMBOSS: History

  3. Open source package Sequence analysis 200 applications 100 third-party applications Reads 40 sequence formats Writes 40 sequence formats Reads 6 feature formats Writes 10 feature formats EMBOSS: Status

  4. Over 100 interfaces / packages containing EMBOSS Command line Web interfaces GUIs SOAP Web services (EMBRACE) Taverna workflows Galaxy EMBOSS: Interfaces

  5. EMBOSS produces annotations in DASGFF format Protein sequence referencing using Uniprot protein identifiers Nucleotide sequence referencing using Ensembl gene identifiers MyDAS based annotation server Executes EMBOSS programs based on the incoming requests Overview

  6. pepcoil; predicted coiled coil regions in protein sequences patmatmotifs; motifs from the PROSITE database helixturnhelix; nucleic acid-binding motifs in protein sequences garnier; predicted protein secondary structures using GOR method sigcleave; predicted signal cleavage sites in protein sequences digest; protein proteolytic enzyme or reagent cleavage sites antigenic; predicted antigenic regions in protein sequences Protein sequence annotation,EMBOSS programs used so far

  7. equicktandem, tandem; tandem repeats in nucleotide sequences silent; restriction enzyme sites in a nucleotide sequence which can be inserted (mutated) without changing the translation jaspscan; transcription factor binding sites from the JASPAR database marscan; matrix/scaffold recognition (MRS) signatures in DNA sequences restrict; restriction enzyme cleavage sites in nucleotide sequences tcode; protein-coding regions identified using Fickett TESTCODE statistic Nucleotide sequence annotation,EMBOSS programs used so far

  8. 26 EMBOSS programs producing graphical outputs Possibly using stylesheet support in Ensembl & DAS 13 EMBOSS alignment programs DAS 1.53E has alignment extension Other EMBOSS programsthat can be used for annotation

  9. Dasty2; for protein annotations Good in displaying individual features Useful links for further exploration Links to ontology terms used Links to original DAS responses Ensembl; for gene and protein annotations Displays features in genomic context Possible to use DAS resources that not in the registry Test clients used

  10. Example Dasty screen:

  11. Example Ensembl screen:

  12. Need to register on dasregistry.org Experimental DAS server available at http://wwwdev.ebi.ac.uk/soaplab/das DAS servers as data sources Common coordinate systems Work in progress

  13. The EMBOSS Team Peter Rice Alan Bleasby Jon Ison Mahmut Uludag

More Related