170 likes | 192 Views
Explore an efficient algorithm for mining frequent subgraphs in biological networks, focusing on metabolic pathways and protein interaction networks. The presentation covers definitions, algorithm details, results, and insightful comments on graph isomorphism problems.
E N D
An efficient algorithm for detecting frequent subgraphs in biological networks Class presentation for CPSC 689-604 Authors: Mehmet Kouturk, Ananth Grama and Wojciech Szpankowski Presented by: Songjian Lu Professor: Jianer Chen
Contents • Introduction • Metabolic Pathways in detail • Mining metabolic pathways • Algorithm • Result • Some comments
Introduction-1 • Metabolic pathways • Model to a directed graph • Node—representing enzymes • Edge—representing the product of one enzyme is consumed by a reaction catalyzed by another enzyme enzyme enzyme enzyme enzyme
Introduction-2 • Protein interaction network • Node—representing Protein • Edge—representing interaction between proteins • Pairwise interactions—getting by two-hybrid experiments • Multi-way interactions—getting by mass spectrometry experiments • Database BIND(http://www.blueprint.org/bind) • Database DIP(http://dip.doe-mbi.ucla.edu/) protein protein protein
Metabolic pathway detail-1 • DEFINITION:A metabolic pathway P(M,Z,R) is a collection of metabolites M, enzymes Z, and reactions R, where each reaction rR is associated with a set of enzymes Z(r)Z, a set of substrates S(r)M, and a set of products T(r) M. S(r) Z(r) T(r)
Metabolic pathway detail-2 • DEFINITION:Given metabolic pathway P(M,Z,R), the associated directed graph G(V,E) of P is constructed as follows:for any enzyme zi Z, there is a node vi V. There is an edge from vi to vj, i.e. (vi,vj) E if and only if r1,r2 R, such that zi Z(r1), zj Z(r2) and T(r1)S(r2) .
Mining Metabolic pathways-1 • DEFINITION:Given a collection of graphs G1,G2,…,Gn and support threshold , the Maximal Frequent Subgraph Discovery problem is one of finding all maximal connected subgraphs that are contained in at least n of the input graphs.
Comment-1 • Graph isomorphism problem is very hard • Given two graphs, if they are isomorphic? • Given two graphs, if one graph is isomorphic to a subgraph of another graph?
Comment-2 • Given two graph G1, G2, if there exists a induced subgraph of k vertices in G1 and a induced subgraph of k vertices in G2, such that these two subgraph are isomorphic?(w[1] hard) • Given two graph G1, G2, if there exists a subgraph of k edges in G1 and a subgraph of k edges in G2, such that these two subgraphs are isomorphic?(w[1] hard) • Given two graph G1, G2, if there exists a subtree of k vertices in G1 and a subtree of k vertices in G2, such that these two subtrees are isomorphic?(FPT) • Given two graph G1, G2, if there exists a subpath of k vertices in G1 and a subpath of k vertices in G2, such that these two subpaths are isomorphic?(FPT)
Comment-3 • Given two graph G1, G2, if there exists a induced subgraph of k vertices in G1 and a induced subgraph of k vertices in G2, such that these two subgraph are isomorphic?(w[1] hard) G1 G2 G v1 v1 v1 v1 v1 v1 v1 v1 v1 v1 v1 v1
Homework • Problem 1: Given G1=(V1,E1),G2=(V2,E2), and integer k, if there exists a subgraph G1’ of k edges in G1, a subgraph G2’ of k edges in G2 such that there is an isomorphic mapping from G1’ to G2’. Prove this problem is NP-complete. • Problem2:Given G=(V,E), IV, and integer k, if there exists a subtree T of k vertices, such that all leaves are in I. Prove this problem is NP-complete.