1 / 150

A Proteomics Toolkit:

A Proteomics Toolkit:. UniProt, InterPro and IntAct Databases at the EBI. Hinxton,U.K. EMBL. GenBank. EBI (EMBL). NCBI (NIH). DDBJ. CIB (NIG). European Bioinformatics Institute. (http://www.ebi.ac.uk/). Created as part of the EMBL in 1992

Download Presentation

A Proteomics Toolkit:

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A Proteomics Toolkit: UniProt, InterPro and IntAct Databases at the EBI

  2. Hinxton,U.K.

  3. EMBL GenBank EBI (EMBL) NCBI (NIH) DDBJ CIB (NIG) European Bioinformatics Institute (http://www.ebi.ac.uk/) • Created as part of the EMBL in 1992 • To house EMBL Nucleotide Sequence Data Library established in 1980 Today, 3 databases accept primary nucleotide data:

  4. European Bioinformatics Institute (http://www.ebi.ac.uk/) EMBL-EBI maintains the world’s most comprehensive range of molecular databases

  5. Nucleotide Sequence Database Automatic Annotation of Genomes Alternative Transcript Diversity ArrayExpress Alternative Splicing Database Protein Sequence Database Molecular Structure Database Database of Protein Families and Domains Chemical Entities of Biological Interest Enzyme Database Protein Interaction Database Gene Ontology Database of Biological Processes

  6. http://www.ebi.ac.uk/services/

  7. Roles of Public Domain Databases To provide stable, long-term sources of basic information To react in the long-term for the needs of the community To act as repositories for published information To bridge the gap between multiple data sources

  8. Protein Databases UniProtDatabase of Protein Sequences InterPro Database of Protein Families and Domains IntAct Database of Protein Interactions

  9. UniProt A central repository of protein sequence and function World's most comprehensive catalogue of information on proteins Based on the original work of PIR, Swiss-Prot and TrEMBL Funded mainly by NIH

  10. protein sequencing Met-Gln-Pro-Glu-Glu-Gly-Thr-Gly-Trp-Leu-Leu-Glu-Val-Gln-Gln- Met-Gly-Arg-Gly-Arg-Cys-Val-Gly-Pro-Ser-Leu-Gln-Glu-Trp-Arg- Swiss-Prot annotation EMBL CGCTGTGATAGCGCTGATCGTGATGCGTATGCAGGTCGT CGCGCCTGTACGCTGAACGCTCGTGACGTGTAGTGCGCG nucleotide sequencing

  11. UniProt TrEMBL PSD annotation + translated EMBL annotation PIR Swiss-Prot EMBL EBI CGCTGTGATAGCGCTGATCGTGATGCGTATGCAGGTCGT CGCGCCTGTACGCTGAACGCTCGTGACGTGTAGTGCGCG nucleotide sequencing

  12. UniProt Consortium

  13. UniProt 3 Components: • UniProt Knowledgebase(UniProt) • UniProt Reference Clusters (UniRef) • UniProt Archive (UniParc)

  14. UniProt 3 Components: • UniProt Knowledgebase(UniProt) • Central repository for annotated protein sequences • UniProt Reference Clusters (UniRef) • UniProt Archive (UniParc)

  15. UniProt 3 Components: • UniProt Knowledgebase(UniProt) • Central repository for annotated protein sequences • Swiss-Prot: non-redundant, manually annotated • TrEMBL: redundant, automatically annotated • UniProt Reference Clusters (UniRef) • UniProt Archive (UniParc)

  16. UniProt 3 Components: • UniProt Knowledgebase(UniProt) • Central repository for annotated protein sequences • Swiss-Prot: non-redundant, manually annotated • TrEMBL: redundant, automatically annotated • UniProt Reference Clusters (UniRef) • Combines related sequences for speed searching • UniProt Archive (UniParc)

  17. UniProt 3 Components: • UniProt Knowledgebase(UniProt) • Central repository for annotated protein sequences • Swiss-Prot: non-redundant, manually annotated • TrEMBL: redundant, automatically annotated • UniProt Reference Clusters (UniRef) • Combines related sequences for speed searching • UniRef100, UniRef90, UniRef50 • UniProt Archive (UniParc)

  18. UniProt 3 Components: • UniProt Knowledgebase(UniProt) • Central repository for annotated protein sequences • Swiss-Prot: non-redundant, manually annotated • TrEMBL: redundant, automatically annotated • UniProt Reference Clusters (UniRef) • Combines related sequences for speed searching • UniRef100, UniRef90, UniRef50 • UniProt Archive (UniParc) • Comprehensive repository for history of sequences

  19. 2D-gel Electrophoresis ANU-2DPAGE Aarhus/Ghent-2DPAGE COMPLUYEAST-2DPAGE ECO2DPAGE HSC-2DPAGE MAIZE-2DPAGE OGP PHCI-2DPAGE PMMA-2DPAGE Rat-heart-2DPAGE Siena-2DPAGE SWISS-2DPAGE Sequence EMBL/GenBank/DDBJ PIR Organism-Specific AGD dbSNP DictyBase EcoGene EchoBASE FlyBase GeneDB_Spombe GeneFarm Genew Gramene HIV H-InvDB LegioList Leproma ListiList MaizeDB MGD MypuList OMIM PhotoList Reactome RGD SagaList SGD StyGene SubtiList TAIR TIGR TubercuList WormBase WormPep ZFIN Databases cross-referenced in UniProt Domains, Sites, Families Gene3D HAMAP InterPro PANTHER Pfam PIRSF PRINTS ProDom PROSITE SMART TIGRFAM UniProt Explicit Links Miscellaneous Ensembl GermOnline Gene Ontology MEROPS PTM GlycoSuiteDB PhosSite Structure HSSP PDB MSD Molecular Interaction IntAct TRANSFAC

  20. http://www.ebi.ac.uk/services/

  21. Searching UniProt Search tools include: • Text Search • Power Search • Blast, Fasta and MPsrch • Links to extra search services (including SRS) http://www.ebi.uniprot.org/index.shtml

  22. http://www.ebi.uniprot.org/index.shtml • Text-based searching • Logical operators ‘&’ (and), ‘|’ (or) • (Wildcards and numerical operators not allowed) • Text Search – keyword queries • Power Search – can search for specific entry lines • Warehouse Search – link query to other databases

  23. Each linked to the UniProt entry Text Search Results

  24. Sequence-based searching • BLAST, Fasta, MPsrch

  25. View alignments Identity score UniProt entry Sequence Search Results

  26. Manipulate multiple data sets

  27. Use Venn diagrams to combine, intersect, or subtract multiple data sets Build complex data sets

  28. UniProt/Swiss-Prot entry for human ubiquitin-protein ligase E3 mdm2

  29. Merged entries: • Remove redundancy • Can still be searched Some literature search engines pull synonyms from UniProt for more complete searching

  30. IntAct Database

  31. Summary of nucleotide data upon which entry is originally based Structural data associated with entry protein

  32. IntAct Database

  33. All the interactions with entry protein IntAct Database

  34. IntAct Database

  35. IntAct Database

  36. IntAct Database

  37. Literature citation used for curation Taxonomic Reference Experimental information Experimental name Experimental technique: co-immunoprecipitation Links to interacting protein Interaction information

  38. Displays interactions graphically IntAct Database

  39. View all GO interactions involving MDM2 View all 7 interactions involving MDM2

  40. Expand graph to see network surrounding one protein Expand graph to see entire network View all InterPro entries associated with MDM2

  41. View all proteins in a network associated with a specific GO term View interactions associated with both MDM2 and p53

More Related