710 likes | 911 Views
Biochemistry, computing in biology. Course outline. Recombination. Recombination and crossover. Recombination and crossover. Recombination and crossover. If no exchange of genes ( i.e. phenotypic marker) occurs, recombination event can not be detected. Recombination and crossover.
E N D
Recombination and crossover If no exchange of genes (i.e. phenotypic marker) occurs, recombination event can not be detected
literature • Genome Gymnastics: Unique Modes of DNA Evolution and Processing in Ciliates. David M. Prescott, Nature Reviews Genetics • Computational power of gene rearrangement. Lila Kari and Laura Landweber, DIMACS series in discreet mathematics and theoretical computer science
The ciliate • Very ancient ( ~ 2 . 109 years ago) • Very rich group ( ~ 10000 genetically different organisms) • Very important from the evolutionary point of view
The ciliate • DNA molecules in micronucleus are very long (hundreds of kilo bps) • DNA molecules in macronucleus are gene-size, short (average ~ 2000 bps)
The ciliate tree Baldauf et al. 2000. Science 290:972.
Urostyla grandis Holosticha kessleri Bar: 50 mm Bar: 100 mm Eschaneustyla sp. Scrambled Genes Found S. lemnae O. trifallax O. nova Uroleptus sp. Bar: 25 mm Bar: 100 mm
Nuclei • Micronucleus the small nucleus containing a single copy of the genome that is used for sexual reproduction • Macronucleus the large nucleus that carries up to several hundred copies of the genome and controls metabolism and asexual reproduction
Lifecycle of a ciliate Micronucleus Macronucleus Cutting, splicing, elimination, reordering, and amplification of DNA Prescott, 2000
The ciliate, reproduction MIC MAC Meiosis and Nuclear Exchange Cell Pairing Nuclear Fusion and Duplication of the Zygotic Nucleus Macronuclear Development and Nuclear Degeneration Polytenization Chromatid breakage De novo telomere formation Modified from Larry Klobutcher & Carolyn Jahn Ann. Review Microbiology, 2002
The ciliate Astounding feats of ‘DNA computing’ are routine in this ‘simple’ single -celled organism— a protozoan. In initial micronucleus, DNA is‘junky’ and scrambled, but…. ….it reassembles itself in proper sequence by means of computer-like acrobatics (unscrambling, throwing out genetic ‘junk’)—in macronucleus
The complexity of spirotrich biology Telomere Pointers MAC MIC MDS: macronuclear destined sequences IES: internal eliminated segments
The complexity of gene scrambling • Intervening non-coding DNA regions (IES: internal eliminated segments) interrupt protein-coding sequences (MDS macronuclear destined sequences) • IESs are removed during macronuclear development • MDSs are unscrambled Prescott, 2000
Scramble genes -TBP, actin I, DNA pol -TBP Prescott et al., 1998 Actin I Oxytricha nova Hogan et al., 2001 DNA polymerase Landweber et al., 2000
Degree of scrambling in -TBP Prescott et al, 1998
Unscrambling of actin I Hogan et al, 2001
Degree of scrambling in DNA pol Landweber et al, 2000
DNA folding and recombination DNA pol DNA pol : Hairpin loop Prescott, 2000
Recombination -TBP Prescott et al, 1998
Tracing evolutionary scrambling • Isolate the micronuclear and macronuclear forms of the -TBP gene • Compare the micronuclear and macronuclear gene structures (MDS and IESs) to determine whether the gene is scrambled • Compare homologous MDSs and scrambling patterns in various stichotrich species (earlier diverging species vs later diverging species) • Trace a parsimonious evolutionary scrambling pathway
Comparisons of scrambling complexity Oxytrichidaeand Paraurostyla weissei Uroleptus sp.
The evolution of recombination Paraurostyla weissei Uroleptus sp. Stylonychia mytilus 100 100 Oxytricha nova 100 Oxytricha trifallax
Evolutionary scrambling pathway Holosticha sp. S. mytilus O. nova O. trifallax Uroleptus sp. P. weissei
Ciliate computing • The process of gene unscrambling in hypotrichous ciliates represents one of nature’s ingenious solutions to the computational problem of gene assembly. • With some essential genes fragmented in as many as 50 pieces, these organisms rely on a set of sequence and structural clues to detangle their coding regions. • For example, pointer sequences present at the junctions between coding and non-coding sequences permit reassembly of the functional copy. As the process of gene unscrambling appears to follow a precise algorithm or set of algorithms, the question remains: what is the actual problem being solved?
The problem in the cell • Genomic Copies of some Protein-coding genes are obscured by intervening non-protein-coding DNA sequence elements (internally eliminated sequences, IES) • Protein-coding sequences (macronuclear destined sequences, MDS) are present in a permuted order, and must be rearranged.
Assumption • By clever structural alignment…, the cell decides which sequences are IES and MDS, as well as which are guides. • After this decision, the process is simply sorting, O(n). • Decision process unknown, but amounts to finding the correct path. Most Costly.
Ciliate computing • there is some as yet undiscovered “oracle”mechanism within the cell, • or the cell simulates non-determinism • the former solution lacks biological credibility and the latter implies exponential time and space explosion. • What we want is a deterministic algorithm for applying the inter- and intra-molecular recombination operations to descramble an arbitrary gene.
Ciliate computing The first proposed step in gene unscrambling—alignment or combinatorial pattern matching—may involve searches through several possible matches, via either intra-molecular or intermolecular strand associations. This part could be similar to Adleman’s (1994) DNA solution of a directed Hamiltonian path problem.
Ciliate computing The second step—homologous recombination at aligned repeats—involves the choice of whether to retain the coding or the non-coding segment between each pair of recombination junctions. This decision process could even be equivalent to solving an n-bit instance of a satisfiability problem, where n is the number of scrambled segments.
Ciliate computing We use our knowledge of the first step to develop a model for the guided homologous recombinations and prove that such a model has the computational power of a Turing machine, the accepted formal model of computation. This indicates that, in principle, these unicellular organisms may have the capacity to perform at least any computation carried out by an electronic computer.
Ciliate computing, the naïve model • Assume the cell simply reconstructs the genes by matching up pointers. • Just one problem... pointer sequences are not unique. In fact, may have multiplicities greater than 13. • The proposed solution to this was that the cell would simply try every possible combination of pointers until it found the right two.
How the cell computes • Relies on short repeat sequences to act as guides in homologous recombination events • Splints analogous to edges in Adleman • One example represents solution of 50 city HP (50 pieces reordered)
Formal model • Guided recombination system