1 / 29

RECOMB SATELLITE MEETING NEW-YORK, NOVEMBER 2010

RECOMB SATELLITE MEETING NEW-YORK, NOVEMBER 2010. GENIE – GEne Network Inference with Ensemble of trees. Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling, University of Liege, Belgium. Inference of GRNs.

laird
Download Presentation

RECOMB SATELLITE MEETING NEW-YORK, NOVEMBER 2010

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. RECOMB SATELLITE MEETINGNEW-YORK, NOVEMBER 2010

  2. GENIE – GEne Network Inference with Ensemble of trees Van Anh Huynh-Thu Department of Electrical Engineering and Computer Science, Systems and Modeling, University of Liege, Belgium

  3. Inference of GRNs • Gene regulatory networks (GRNs) are behind the scene players in gene expression • How do we determine the regulators of each gene? • Input: • Gene expression data in different conditions/time points • A subset of the genes that contains all the regulators (without GENIE accuracy plummets)

  4. Underlying Model • Every reverse engineering tool assumes an underlying model • GENIE assume that the GRN is a Boolean network • Therefore, the regulation of each gene is a Boolean function

  5. GENIE Strategy Outline • Not to make strong assumptions about the possible regulatory interactions (e.g. a strong assumption is linearity) • Treat time-series as static experiments • Solve the problem for each gene separately, and combine the results • The final output is a ranking of potential interactions in descending confidence

  6. GENIE workflow

  7. Tree-based Ensemble Methods • A regulation function is a binary tree – at each node a binary test according to a different regulator is performed • The prediction is at the leaf • For each gene, randomly select a set of samples and produce a tree from each one (the root is the single gene that splits K random conditions of the target best, and so on) • Rank the regulators according to their importance in the trees

  8. Ranking of regulators #S is the number of samples that reach the node N #St (Sf) is the number of samples with output true (false) Var() is the variance of the output In order to avoid bias towards highly variable genes, the expression values are first normalized to unit variance

  9. Best performer in DREAM5 network inference

  10. The Genetic Landscape of the Cell Charles Boone University of Toronto, Donnelly Center

  11. Synthetic Genetic Arrays • Single mutant strand (query gene) is crossed with all other single mutants • Double mutants are selected • Currently done for budding yeast, e.coli and s.pombe No growth

  12. Genetic Interactions • Positive interaction: The double knockout is more viable than would be expected by the separate contributions of the single knockouts • Negative interaction: The double knockout is less viable than would be expected by the separate contributions of the single knockouts • They crossed ~1700 yeast single mutants with ~3,800 single mutants, and after filtering failures they got ~5.4 million double mutants

  13. Yeast Interaction Map Edges are interactions that pass cutoff threshold (170,000) Proximity in the layout is according to similarity in interaction profiles Colored sets = GO enrichment

  14. Proximity between clusters and related functions Proximate clusters Both require cytoskeleton genes

  15. Zoom in on pathway Required for polarization and growth Cell division Red – Negative Green - Positive Translation Budding Interactions between pathways and complexes were often monochromatic

  16. Positive vs. negative interactions No interaction Negative interactions are ~two times more prominent than positive

  17. Degree distribution Hubs are less numerous Severe fitness defects in single mutants correlate with degree

  18. Gene duplicates interact less

  19. Correlation between degree and gene properties Black - PPI # morphological phenotypes # chemical perturbations unstable structure

  20. Genetic interactions between cellular processes Cell cycle is more buffered?

  21. Hubs in the chemical interaction networks match hubs in GI network Single mutant + chemical = chemical interaction Hydroxyurea blocks DNA synthesis Erodoxin (new) similar to protein Folding-related gene DNA repair

  22. Discovering Master Regulators of Alcohol Addiction William Shin Center for Computational Biology and Bioinformatics Columbia University

  23. Rat Model of Alcohol Addiction Control Alcohol Self Administration Alcohol Vapor Treatment (Chronic alcohol addiction) No Alcohol Vapor Non Dependent Dependent Control

  24. 100 * 75 Alcohol responding (0.5 hr) 50 Baseline Non-dependent 25 (exposed to air) Dependent (exposed to alcohol vapor) 0 Rat model of alcohol addiction Alcohol self-administration (lever pressing) Induction of alcohol-dependence Alcohol Intake during early withdrawal

  25. Identification of TF-target interactions • Rat Brain regions were sliced and used as microarray samples • 92 samples from Dependent, Non-Dependent, Control Rats across 8 regions that are known as sites-of-action for of addictive drugs. • Applied ARACNE to this data • Information-theory based (MI) • Tests triplets of genes for indirect interactions • 130,000 TF-target interactions in total

  26. Screening of false positives Targets of TF1 TF1 TF1 shadows TF2: TF2 appears enriched only because it shares common targets with TF1 Targets of TF2 TF2 The master regulators are enriched TFs not shadowed by any other

  27. Masters regulators in the Accumbens shell

  28. Activity profile at different brain regions

  29. siRNA validation has 50-75% success rate Not all targets were tested yet

More Related