SlideShare ist ein Scribd-Unternehmen logo
1 von 88
Downloaden Sie, um offline zu lesen
Networks in genomics and bioinformatics: from
                      phylogeny to Twitter

                                ISCB2012
                              July 12, 2012

                            Jonathan A. Eisen
                      University of California, Davis
                           @phylogenomics




Friday, July 13, 12
Networks in genomics and bioinformatics: from
                      phylogeny to Twitter

                                ISCB2012
                              July 12, 2012

                            Jonathan A. Eisen
                      University of California, Davis
                           @phylogenomics




Friday, July 13, 12
A meandering path and lessons “learned”

                                  ISCB2012
                                July 12, 2012

                              Jonathan A. Eisen
                        University of California, Davis
                             @phylogenomics




Friday, July 13, 12
Friday, July 13, 12
Social Networking in Science




Friday, July 13, 12
Bacterial evolve




Friday, July 13, 12
Friday, July 13, 12
Phylogenomics of Novelty




Friday, July 13, 12
Phylogenomics of Novelty




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New
        Functions and
          Processes




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New
        Functions and
          Processes

      •New genes
      •Changes in old genes
      •Changes in pathways




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New
        Functions and
          Processes

      •New genes
      •Changes in old genes
      •Changes in pathways




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New                      Genome
        Functions and                      Dynamics
          Processes

      •New genes
      •Changes in old genes
      •Changes in pathways




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New                      Genome
        Functions and                      Dynamics
          Processes
                                          •Evolvability
      •New genes                          •Repair and
      •Changes in old genes               recombination processes
      •Changes in pathways                •Intragenomic variation




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New                      Genome
        Functions and                      Dynamics
          Processes
                                          •Evolvability
      •New genes                          •Repair and
      •Changes in old genes               recombination processes
      •Changes in pathways                •Intragenomic variation




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New                              Genome
        Functions and                              Dynamics
          Processes
                                                  •Evolvability
      •New genes                                  •Repair and
      •Changes in old genes                       recombination processes
      •Changes in pathways                        •Intragenomic variation




                              Species Evolution




Friday, July 13, 12
Phylogenomics of Novelty



        Origin of New                                             Genome
        Functions and                                             Dynamics
          Processes
                                                                 •Evolvability
      •New genes                                                 •Repair and
      •Changes in old genes                                      recombination processes
      •Changes in pathways                                       •Intragenomic variation




                                  Species Evolution
                              •Phylogenetic history
                              •Vertical vs. horizontal descent
                              •Needed to track gain/loss of
                              processes, infer convergence
Friday, July 13, 12
Undergrad Lesson 1:
                      Be prepared for random events


      • Gould’s class b/c planned on not majoring
        in Biology
      • RMBL via backpacking trip
      • Geology library job w/ Nabokov collection
        b/c went to wrong building
      • Discovering Colleen Cavanaugh’s lab via
        street encounter



Friday, July 13, 12
Undergrad Lesson 2:
                       Phylogeny Matters

      • “MacClade”
      • Phylogenetic ecology
      • Phylotyping




Friday, July 13, 12
Phylogeny Matters




                                          Eisen et
                                          al. 1992
Friday, July 13, 12
Grad school lesson I:
                      find right people to work with

      • Went to work on butterfly population biology
        and phylogeny
      • Advisor and I did not see eye to eye
      • Despite great subject for me (combined
        phylogeny, molecular evolution, RMBL, etc),
        chose not to join lab
      • Did many rotations …
      • Picked final lab in part b/c advisor was right
        match


Friday, July 13, 12
Grad school lesson II:
                      never too late to change

      • Wanted to combine DNA repair studies and
        molecular evolution
      • I: Thymineless death
      • II: Adaptive mutation
      • III: Repair in archaea




Friday, July 13, 12
Friday, July 13, 12
Grad school lesson II:
                      never too late to change

      • Wanted to combine DNA repair studies and
        molecular evolution
      • I: Thymineless death
      • II: Adaptive mutation
      • III: Repair in archaea
      • IV: Bioinformatics and genome analysis …




Friday, July 13, 12
Grad school lesson III:
                      Get others to do your work



          • Interested in RecA structure function
            relationships
          • Using phylogeny to look for correlated
            substitutions in RecA structure, like
            done with rRNA
          • But not enough sequences …




Friday, July 13, 12
Friday, July 13, 12
Shotgun Sequencing Allows Use of Alternative
                           Anchors (e.g., RecA)




                                                     Venter et al., 2004
Friday, July 13, 12
Grad school lesson IV:
                         Stealing is good



          • Phylogenetic perspective in
            bioinformatics missing




Friday, July 13, 12
“Nothing in biology makes sense
               except in the light of evolution.”

                      T. H. Dobzhansky (1973)




Friday, July 13, 12
Evolutionary Perspective and
                          Comparative Biology

     • Comparative biology is the analysis of
       differences and similarities between
       species.

     • An evolutionary perspective is useful in
       such studies because this allows one to
       focus not just on the levels and degrees of
       similarity or difference but on how and why
       similarities and differences came to be.


Friday, July 13, 12
Phylogenomics



      • Lots of sequences being produced with no
        functions associated with them
      • Much debate in community about how to
        predict functions




Friday, July 13, 12
Predicting Function



          • Identification of motifs
          • Homology/similarity based methods
                 •    Highest hit
                 •    Top hits
                 •    Clusters of orthologous groups
                 •    HMM models
                 •    Structural threading and modeling
                 •    Evolutionary reconstructions




Friday, July 13, 12
Phylogeny Matters




                                          Eisen et
                                          al. 1992
Friday, July 13, 12
Evolutionary Functional Prediction
                                       EXAMPLE A                                   METHOD                          EXAMPLE B

                                               2A                         CHOOSE GENE(S) OF INTEREST                        5


                                            3A                                                                          1 3 4
                                                 2B                                                                 2
                                                                             IDENTIFY HOMOLOGS                             5
                                       1A 2A 1B 3B                                                                       6



                                                                              ALIGN SEQUENCES

                              1A      2A    3A 1B        2B      3B                                      1    2         3       4   5   6



                                                                            CALCULATE GENE TREE


                                                       Duplication?


                             1A       2A 3A 1B          2B      3B                                       1    2         3       4   5   6



                                                                              OVERLAY KNOWN
                                                                            FUNCTIONS ONTO TREE

                                                       Duplication?


                                                                                                        1      2        3       4   5   6
                             1A       2A 3A 1B          2B      3B



                                                                            INFER LIKELY FUNCTION
                                                                            OF GENE(S) OF INTEREST
                                                                                                       Ambiguous

                                                       Duplication?



                          Species 1        Species 2          Species 3
                           1A 1B            2A 2B              3A 3B                                     1    2         3       4   5   6


                                                                              ACTUAL EVOLUTION
                                                                          (ASSUMED TO BE UNKNOWN)

                                                                                                                                            Based on Eisen,
                                                       Duplication
                                                                                                                                            1998 Genome
                                                                                                                                            Res 8: 163-167.
Friday, July 13, 12
Similarity ≠ Relatedness




Friday, July 13, 12
Evolutionary Rate Variation




Friday, July 13, 12
Phylogenetic Prediction of
                              Function

      • Many powerful and automated similarity based
        methods for assigning genes to protein families
             • COGs
             • PFAM HMM searches
      • Some limitations of similarity based methods can be
        overcome by phylogenetic approaches
      • Automated methods now available
             • Sean Eddy
             • Steven Brenner
             • Kimmen Sjölander
      • But …


Friday, July 13, 12
Grad school lesson V:
                      Teaching helps you learn




Friday, July 13, 12
Grad school lesson VI:
                      There are no career rules




Friday, July 13, 12
Career Lesson I:
                      Build on what you know

      •    Phylogenetic approaches to genomics
      •    Genomics of endosymbionts
      •    Genomic studies of communities
      •    Analysis of DNA repair genes in genome
           sequences
      •    Phylogenomics of halophilic archaea
      •    GEBA
      •    Phylogenetic metagenomics
      •    ...

Friday, July 13, 12
Career Lesson II:
                      Don’t Only Use What You Know




Friday, July 13, 12
What We Don’t Know Can Hurt Us




Friday, July 13, 12
D. radiodurans genome




Friday, July 13, 12
DNA Repair Genes in D. radiodurans


       Process                      Genes in D. radiodur a n s

       Nucleotide Excision Repair   UvrABCD, UvrA2
       Base Excision Repair         AlkA, Ung, Ung2, GT, MutM, MutY-Nths,
                                    MPG
       AP Endonuclease              Xth
       Mismatch Excision Repair     MutS, MutL
       Recombination
        Initiation                  RecFJNRQ, SbcCD, RecD
        Recombinase                 RecA
        Migration and resolution    RuvABC, RecG
       Replication                  PolA, PolC, PolX, phage Pol
       Ligation                     DnlJ
       dNTP pools, cleanup          MutTs, RRase
       Other                        LexA, RadA, HepA, UVDE, MutS2


Friday, July 13, 12
Problem ...



      • List of DNA repair gene homologs in
        D. radiodurans genome is not
        significantly different from other
        bacterial genomes of the similar size




Friday, July 13, 12
Repair Studies in Different Species
                  (via Medline searches as of 1998)
                       Humans          7028
                       E. coli         3926
                       S. cerevisiae   988
                       Drosophila      387
                       B. subtilits    284
                       S. pombe        116
                       Xenopus         56
                       C. elegans      25
                       A. thaliana     20
                       Methanogens     16
                       Haloferax       5
                       Giardia         0




Friday, July 13, 12
Proteobacteria
                      TM6
                      OS-K
                                                ~40 Phyla of
                      Acidobacteria
                      Termite Group
                      OP8
                                                Bacteria
                      Nitrospira
                      Bacteroides
                      Chlorobi
                      Fibrobacteres
                      Marine GroupA
                      WS3
                      Gemmimonas
                      Firmicutes
                      Fusobacteria
                      Actinobacteria
                      OP9
                      Cyanobacteria
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                      NKB19
                      Verrucomicrobia
                      Chlamydia
                      OP3
                      Planctomycetes
                      Spriochaetes                  0.1
                      Coprothmermobacter
                      OP10
                      Thermomicrobia
                      Chloroflexi
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae                Tree based on
                      Thermudesulfobacteria
                      Thermotogae             Hugenholtz (2002)
                      OP1                     with some
                      OP11                    modifications.
Friday, July 13, 12
Proteobacteria
                      TM6
                      OS-K
                      Acidobacteria             Most DNA
                      Termite Group
                      OP8
                      Nitrospira                metabolism
                      Bacteroides
                      Chlorobi
                      Fibrobacteres
                                                studies in
                      Marine GroupA
                      WS3
                      Gemmimonas
                                                two Phyla
                      Firmicutes
                      Fusobacteria
                      Actinobacteria
                      OP9
                      Cyanobacteria
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                      NKB19
                      Verrucomicrobia
                      Chlamydia
                      OP3
                      Planctomycetes
                      Spriochaetes                  0.1
                      Coprothmermobacter
                      OP10
                      Thermomicrobia
                      Chloroflexi
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae                Tree based on
                      Thermudesulfobacteria
                      Thermotogae             Hugenholtz (2002)
                      OP1                     with some
                      OP11                    modifications.
Friday, July 13, 12
Proteobacteria
                      TM6
                      OS-K
                      Acidobacteria             Deinococcus
                      Termite Group
                      OP8
                      Nitrospira                is very distant
                      Bacteroides
                      Chlorobi
                      Fibrobacteres
                                                from well
                      Marine GroupA
                      WS3
                      Gemmimonas
                                                studied
                      Firmicutes
                      Fusobacteria              groups
                      Actinobacteria
                      OP9
                      Cyanobacteria
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                      NKB19
                      Verrucomicrobia
                      Chlamydia
                      OP3
                      Planctomycetes
                      Spriochaetes                  0.1
                      Coprothmermobacter
                      OP10
                      Thermomicrobia
                      Chloroflexi
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae                Tree based on
                      Thermudesulfobacteria
                      Thermotogae             Hugenholtz (2002)
                      OP1                     with some
                      OP11                    modifications.
Friday, July 13, 12
Gain and Loss of Repair Genes                     BACTERIA                                                                 ARCHAEA                          EUKARYOTES




                                                         Helpy




                                                                                                                           Trepa
                     Ecoli




                                                                                                                                                                                                Human
                                                                                          Mycge


                                                                                                   Mycpn
                                                                    Bacsu




                                                                                                                                   Synsp
                                                                                                               Borbu




                                                                                                                                                                     Metth
                                        Neigo




                                                                                                                                                                                     Yeast
                                                                                                                                                          Arcfu
                                                                               Strpy




                                                                                                                                              Metjn
                             Haein                                           -Ogt
                               -PhrI                                                                                                                  -AlkA -Nfo       -AlkA
                                                -Ogt                   -PhrI -AlkA                                 -Ogt   -Ung                        -Xth             -Rad25
                               -AlkA            -AlkA                        -Nfo
                               -Nfo                                                                                -RecFRQN                           -Rad25?
              R
              + us                              -TagI                        -RecQ                                 -RuvC                                                                       +P53
           UmuD
            +                  -Vsr             -Nfo                         -SbcD?                                                                                                           dRecQ
                               -SbcCD                                                                              -Dut                                                          +Rad7
           +Nei?                                -Rec                         -Lon                                                                                                            dRad23
                               -LexA                                                                               -SMS                                                         +CCE1
          +RecE                                 -SbcCD                       -LexA                                                                                                           +MAG?
         tRecT?                -UmuC            -LexA             +Spr tTagI ?                                         tRad25
                                                                 t3MG
                                                                                                  -PhrI                                                  -PhrII
                                                          -PhrI                                   -Ogt                   -PhrI                           -Ogg      tUvrABCD
                  Ada
                  +                                       -PhrII                                  -AlkA                                -Ogt
                 MutH
                 +                                                                                                       -PhrII?
                                                          -AlkA                                   -Xth                   -AlkA         -Ung
                 SbcB
                  +                                       -Fpg                                    -MutLS                               -Nfo
                                                                                                                         -Fpg
                                                          -Nfo                                    -RecFJORQN             -Nfo          -Dut
                                                          -MutLS                                  -Mfd                   -RecO         -Lon                       -PhrI
                                                          -RecFORQ                                -SbcCD                 -LexA                                    -Ung?
                                     -PhrII               -SbcCD                                  -RecG                  -UmuC                                    -MutLS
                                                          -LexA                                   -Dut                                                            -RecQ?
                           + sr
                           V                              -UmuC                                   -PriA                                                           -Dut
                       RecBCD?
                         +                                -TagI+RecT                              -LexA                                                           -UmuC
                                                                                                  -SMS
                                                                                                  -MutT                                                                               RFAs
                                                                                                                                                                                       +
                                                                                       -PhrII                                                                                      +TFIIH
                                                                                       -RuvC                                                                           +Rad4,10,14,16,23,26
                                                                                                                                                                                       CSA
                                                                                                                                                                                        +
                                                                                                                                                                               Rad52,53,54
                                                                                                                                                                                    +
                                     +TagI?                                                                                        dPhr                                       DNA-PK, Ku
                                                                                                                                                                                   +
                                                                                                                                                                                      SNF2
                                                                                                                                                                                       d
                                                                                    TagI?
                                                                                     +                                                                                              dMutS
                                                                                    +Fpg                                                                                            dMutL
                                                                                UvrABCD
                                                                                  +                                                                                                 dRecA
                                                                                     Mfd
                                                                                      +
                                                                               RecFJNOR
                                                                                  +                                                                                                                      Ung?
                                                                                                                                                                                                         +
                                                                                 RuvABC
                                                                                   +                                                                                                                     SSB,
                                                                                                                                                                                                          +
                                                                                  +RecG                                                                                Rad1
                                                                                                                                                                        +                               +Dut?
                                                                                     LigI
                                                                                      +                                                                               +Rad2                                     from mitochondria
                                                                                    LexA
                                                                                     +                                                                              +Rad25?
                                                                                      SSB
                                                                                      +                                                                                 Ogg
                                                                                                                                                                        +
                                                                                   +PriA                                                                               LigII
                                                                                                                                                                        +
                                                                                   +Dut?



                                                                                                                      PhrI, PhrII
                                                                                                                         +
                                                                                                                            +Ogt
                                                                                                           +Ung, AlkA, MutY-Nth
                                                                                                                          +AlkA
                                                                                                                      +Xth, Nfo?
                                                                                                                       +MutLS?
                                                                                                                        +SbcCD
                                                                                                                         +RecA
                                                                                                                        +UmuC
                                                                                                                         +MutT
                                                                                                                           +Lon
                                                                                                                                                                                          Eisen and Hanawalt, 1999 Mut
                                                                                                                  dMutSI/MutSII
                                                                                                                     dRecA/SMS                                                            Res 435: 171-213
                                                                                                                     dPhrI/PhrII

Friday, July 13, 12
Solution - Experiments




Friday, July 13, 12
What We Don’t Know Can Hurt Us




Friday, July 13, 12
As of 2002            Proteobacteria
                      TM6
                      OS-K
                                              • At least 40
                                                phyla of
                      Acidobacteria
                      Termite Group
                      OP8
                      Nitrospira
                      Bacteroides
                                                bacteria
                      Chlorobi
                      Fibrobacteres
                      Marine GroupA
                      WS3
                      Gemmimonas
                      Firmicutes
                      Fusobacteria
                      Actinobacteria
                      OP9
                      Cyanobacteria
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                      NKB19
                      Verrucomicrobia
                      Chlamydia
                      OP3
                      Planctomycetes
                      Spriochaetes
                      Coprothmermobacter
                      OP10
                      Thermomicrobia
                      Chloroflexi
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae
                      Thermudesulfobacteria
                      Thermotogae
                      OP1                       Based on Hugenholtz,
                      OP11                      2002
Friday, July 13, 12
As of 2002            Proteobacteria
                      TM6
                      OS-K
                                              • At least 40
                      Acidobacteria
                      Termite Group             phyla of
                      OP8
                      Nitrospira
                      Bacteroides
                                                bacteria
                      Chlorobi
                      Fibrobacteres
                      Marine GroupA
                                              • Most genomes
                      WS3
                      Gemmimonas                from three
                      Firmicutes
                      Fusobacteria              phyla
                      Actinobacteria
                      OP9
                      Cyanobacteria
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                      NKB19
                      Verrucomicrobia
                      Chlamydia
                      OP3
                      Planctomycetes
                      Spriochaetes
                      Coprothmermobacter
                      OP10
                      Thermomicrobia
                      Chloroflexi
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae
                      Thermudesulfobacteria
                      Thermotogae
                      OP1                       Based on Hugenholtz,
                      OP11                      2002
Friday, July 13, 12
As of 2002            Proteobacteria
                      TM6
                      OS-K
                                              • At least 40
                      Acidobacteria
                      Termite Group             phyla of
                      OP8
                      Nitrospira
                      Bacteroides
                                                bacteria
                      Chlorobi
                      Fibrobacteres
                      Marine GroupA
                                              • Most genomes
                      WS3
                      Gemmimonas                from three
                      Firmicutes
                      Fusobacteria              phyla
                      Actinobacteria
                      OP9
                      Cyanobacteria
                                              • Some studies
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                                                in other phyla
                      NKB19
                      Verrucomicrobia
                      Chlamydia
                      OP3
                      Planctomycetes
                      Spriochaetes
                      Coprothmermobacter
                      OP10
                      Thermomicrobia
                      Chloroflexi
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae
                      Thermudesulfobacteria
                      Thermotogae
                      OP1                       Based on Hugenholtz,
                      OP11                      2002
Friday, July 13, 12
As of 2002            Proteobacteria
                      TM6
                      OS-K
                                              • At least 40
                      Acidobacteria
                      Termite Group             phyla of
                      OP8
                      Nitrospira
                      Bacteroides
                                                bacteria
                      Chlorobi
                      Fibrobacteres
                      Marine GroupA
                                              • Most genomes
                      WS3
                      Gemmimonas                from three
                      Firmicutes
                      Fusobacteria              phyla
                      Actinobacteria
                      OP9
                      Cyanobacteria
                                              • Some other
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                                                phyla are only
                      NKB19
                      Verrucomicrobia           sparsely
                      Chlamydia
                      OP3
                      Planctomycetes
                                                sampled
                      Spriochaetes
                      Coprothmermobacter      • Same trend in
                      OP10
                      Thermomicrobia
                      Chloroflexi
                                                Eukaryotes
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae
                      Thermudesulfobacteria
                      Thermotogae
                      OP1                       Based on Hugenholtz,
                      OP11                      2002
Friday, July 13, 12
As of 2002            Proteobacteria
                      TM6
                      OS-K
                                              • At least 40
                      Acidobacteria
                      Termite Group             phyla of
                      OP8
                      Nitrospira
                      Bacteroides
                                                bacteria
                      Chlorobi
                      Fibrobacteres
                      Marine GroupA
                                              • Most genomes
                      WS3
                      Gemmimonas                from three
                      Firmicutes
                      Fusobacteria              phyla
                      Actinobacteria
                      OP9
                      Cyanobacteria
                                              • Some other
                      Synergistes
                      Deferribacteres
                      Chrysiogenetes
                                                phyla are only
                      NKB19
                      Verrucomicrobia           sparsely
                      Chlamydia
                      OP3
                      Planctomycetes
                                                sampled
                      Spriochaetes
                      Coprothmermobacter      • Same trend in
                      OP10
                      Thermomicrobia
                      Chloroflexi
                                                Viruses
                      TM7
                      Deinococcus-Thermus
                      Dictyoglomus
                      Aquificae
                      Thermudesulfobacteria
                      Thermotogae
                      OP1                       Based on Hugenholtz,
                      OP11                      2002
Friday, July 13, 12
Friday, July 13, 12
GEBA




                      http://www.jgi.doe.gov/programs/GEBA/pilot.html

Friday, July 13, 12
rRNA Tree of Life
                      Bacteria




                                                               Archaea




                       Eukaryotes
                       Figure from Barton, Eisen et al. “Evolution”,
                                   CSHL Press. 2007.
                         Based on tree from Pace 1997 Science
                                     276:734-740

Friday, July 13, 12
PD: Genomes




From Wu
et al. 2009
Nature
462,
1056-1060


Friday, July 13, 12
PD: Genomes + GEBA




From Wu
et al. 2009
Nature
462,
1056-1060


Friday, July 13, 12
PD: Isolates




                            From Wu et al. 2009 Nature 462, 1056-1060
Friday, July 13, 12
rRNA Tree of Life
                      Bacteria




                                                                Archaea
                                                                       ??????




                       Eukaryotes
                      Figure from Barton, Eisen et al. “Evolution”,
                                  CSHL Press. 2007.                   Wu et al. (2011) PLoS ONE 6(3):
                                                                      e18011. doi:10.1371/
                        Based on tree from Pace 1997 Science          journal.pone.0018011
                                    276:734-740

Friday, July 13, 12
????

                             Phage




                             Phage

                             ????




                      Thaumarchaeot




Friday, July 13, 12
GEBA uncultured
    Number of SAGs from Candidate Phyla




                                                                406
                                                    1
                                             OD1

                                                   OP1

                                                         OP3

                                                               SAR
    Site   A: Hydrothermal vent               4      1    -     -
    Site   B: Gold Mine                       6     13    2     -
    Site   C: Tropical gyres (Mesopelagic)    -      -    -     2
    Site   D: Tropical gyres (Photic zone)    1      -    -     -




 Sample collections at 4 additional sites are underway.




                                                                                Phil Hugenholtz




                                                                           56


Friday, July 13, 12
Uncharacterized genes




Friday, July 13, 12
Non homology functional



      • Many genes have homologs in other
        species but no homologs have ever been
        studied experimentally
      • Non-homology methods can make
        functional predictions for these




Friday, July 13, 12
Phylogenetic profiling basis



      • Microbial genes are lost rapidly when not
        maintained by selection
      • Genes can be acquired by lateral transfer
      • Frequently gain and loss occurs for entire
        pathways/processes
      • Thus might be able to use correlated
        presence/absence information to identify
        genes with similar functions



Friday, July 13, 12
Non-Homology Predictions:
                        Phylogenetic Profiling



           • Step 1: Search all genes in
             organisms of interest against
             all other genomes

           • Ask: Yes or No, is each gene
             found in each other species

           • Cluster genes by distribution
             patterns (profiles)


Friday, July 13, 12
Carboxydothermus
                      hydrogenoformans

   • Isolated from a Russian hotspring
   • Thermophile (grows at 80°C)
   • Anaerobic
   • Grows very efficiently on CO (Carbon
     Monoxide)
   • Produces hydrogen gas
   • Low GC Gram positive (Firmicute)
   • Genome Determined (Wu et al. 2005
     PLoS Genetics 1: e65. )




Friday, July 13, 12
Homologs of Sporulation Genes




                                                Wu et al. 2005
                                                PLoS Genetics 1:
                                                e65.
Friday, July 13, 12
Carboxydothermus sporulates




                            Wu et al. 2005 PLoS Genetics 1: e65.
Friday, July 13, 12
Wu et al. 2005 PLoS Genetics 1: e65.
Friday, July 13, 12
PG Profiling Works Better Using
                                 Orthology




Friday, July 13, 12
PG Profiling Works Better Using
                          Independent Contrasts




Friday, July 13, 12
Career Lesson III:
                      Networks Matter




Friday, July 13, 12
Protein Family Rarefaction Curves

      • Take data set of multiple complete
        genomes
      • Identify all protein families using MCL
      • Plot # of genomes vs. # of protein families




Friday, July 13, 12
Wu et al. 2009 Nature 462, 1056-1060

Friday, July 13, 12
Wu et al. 2009 Nature 462, 1056-1060

Friday, July 13, 12
Wu et al. 2009 Nature 462, 1056-1060

Friday, July 13, 12
Wu et al. 2009 Nature 462, 1056-1060

Friday, July 13, 12
Wu et al. 2009 Nature 462, 1056-1060

Friday, July 13, 12
Synapomorphies exist




Wu et al. 2009 Nature 462, 1056-1060

Friday, July 13, 12
Metagenomics




Friday, July 13, 12
Binning challenge




Friday, July 13, 12
B
                      A




                                    C




                          Sharpton et al. submitted

Friday, July 13, 12
Career Lesson IV:
                      Openness Helps




Friday, July 13, 12

Weitere ähnliche Inhalte

Andere mochten auch

Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualizationJan Aerts
 
Chamberlain PhD Thesis
Chamberlain PhD ThesisChamberlain PhD Thesis
Chamberlain PhD Thesisschamber
 
VIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic VariationVIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic VariationJan Aerts
 
Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...
Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...
Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...Jonathan Eisen
 
Tetrahymena genome project 2003 presentation by Jonathan Eisen
Tetrahymena genome project 2003 presentation by Jonathan EisenTetrahymena genome project 2003 presentation by Jonathan Eisen
Tetrahymena genome project 2003 presentation by Jonathan EisenJonathan Eisen
 
The neurobiological nature of free will
The neurobiological nature of free willThe neurobiological nature of free will
The neurobiological nature of free willBjörn Brembs
 
Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...
Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...
Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...Jonathan Eisen
 
E Talevich - Biopython project-update
E Talevich - Biopython project-updateE Talevich - Biopython project-update
E Talevich - Biopython project-updateJan Aerts
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformaticsJan Aerts
 
Intel Theater Presentation - SC11
Intel Theater Presentation - SC11Intel Theater Presentation - SC11
Intel Theater Presentation - SC11Deepak Singh
 
A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...
A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...
A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...Jean-Claude Bradley
 
Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12Jonathan Eisen
 
Evolution of gene family size change in fungi
Evolution of gene family size change in fungiEvolution of gene family size change in fungi
Evolution of gene family size change in fungiJason Stajich
 
The Sam Adams talk
The Sam Adams talkThe Sam Adams talk
The Sam Adams talkRoderic Page
 
Fungal ITS meeting presentation
Fungal ITS meeting presentationFungal ITS meeting presentation
Fungal ITS meeting presentationHolly Bik
 
Using Social Media in Research
Using Social Media in ResearchUsing Social Media in Research
Using Social Media in ResearchHolly Bik
 
Perl for Phyloinformatics
Perl for PhyloinformaticsPerl for Phyloinformatics
Perl for PhyloinformaticsRutger Vos
 
yw jakartarb20101031
yw jakartarb20101031yw jakartarb20101031
yw jakartarb20101031Yannick Wurm
 

Andere mochten auch (20)

Intro to data visualization
Intro to data visualizationIntro to data visualization
Intro to data visualization
 
Chamberlain PhD Thesis
Chamberlain PhD ThesisChamberlain PhD Thesis
Chamberlain PhD Thesis
 
VIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic VariationVIZBI 2014 - Visualizing Genomic Variation
VIZBI 2014 - Visualizing Genomic Variation
 
Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...
Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...
Jonathan Eisen: Phylogenetic approaches to the analysis of genomes and metage...
 
Tetrahymena genome project 2003 presentation by Jonathan Eisen
Tetrahymena genome project 2003 presentation by Jonathan EisenTetrahymena genome project 2003 presentation by Jonathan Eisen
Tetrahymena genome project 2003 presentation by Jonathan Eisen
 
The neurobiological nature of free will
The neurobiological nature of free willThe neurobiological nature of free will
The neurobiological nature of free will
 
Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...
Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...
Evolution of the RecA Protein: from Systematics to Structure 1995 talk for CA...
 
ORCID Principles
ORCID PrinciplesORCID Principles
ORCID Principles
 
E Talevich - Biopython project-update
E Talevich - Biopython project-updateE Talevich - Biopython project-update
E Talevich - Biopython project-update
 
Humanizing bioinformatics
Humanizing bioinformaticsHumanizing bioinformatics
Humanizing bioinformatics
 
Intel Theater Presentation - SC11
Intel Theater Presentation - SC11Intel Theater Presentation - SC11
Intel Theater Presentation - SC11
 
A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...
A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...
A brief description of the Chemical Rediscovery Survey and Open Chemistry in ...
 
Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12Jonathan Eisen @phylogenomics talk for #LAMG12
Jonathan Eisen @phylogenomics talk for #LAMG12
 
Evolution of gene family size change in fungi
Evolution of gene family size change in fungiEvolution of gene family size change in fungi
Evolution of gene family size change in fungi
 
The Sam Adams talk
The Sam Adams talkThe Sam Adams talk
The Sam Adams talk
 
ESA 2012 talk
ESA 2012 talkESA 2012 talk
ESA 2012 talk
 
Fungal ITS meeting presentation
Fungal ITS meeting presentationFungal ITS meeting presentation
Fungal ITS meeting presentation
 
Using Social Media in Research
Using Social Media in ResearchUsing Social Media in Research
Using Social Media in Research
 
Perl for Phyloinformatics
Perl for PhyloinformaticsPerl for Phyloinformatics
Perl for Phyloinformatics
 
yw jakartarb20101031
yw jakartarb20101031yw jakartarb20101031
yw jakartarb20101031
 

Ähnlich wie Jonathan Eisen talk for #SCS2012 at #ISMB "Networks in genomics and bioinformatics: from phylogeny to Twitter"

Gene expression
Gene expressionGene expression
Gene expressionchavarisa
 
Natural selection - an introduction
Natural selection - an introduction Natural selection - an introduction
Natural selection - an introduction Stephanie Beck
 
Course Design GuideSCI230 Version 71Course Design Gui.docx
Course Design GuideSCI230 Version 71Course Design Gui.docxCourse Design GuideSCI230 Version 71Course Design Gui.docx
Course Design GuideSCI230 Version 71Course Design Gui.docxfaithxdunce63732
 
Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)
Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)
Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)Kim Jim Raborar
 
So many different kinds of mistakes
So many different kinds of mistakesSo many different kinds of mistakes
So many different kinds of mistakesLiliana Davalos
 
Genetic and Evolutionary Roots of Behavior
Genetic and Evolutionary Roots of BehaviorGenetic and Evolutionary Roots of Behavior
Genetic and Evolutionary Roots of BehaviorMeghan Fraley
 
Student Teaching Work Sample
Student Teaching Work SampleStudent Teaching Work Sample
Student Teaching Work Samplegtickerhoof
 
Chapter 3 human development
Chapter 3 human developmentChapter 3 human development
Chapter 3 human developmentAnne Baroy
 
Cloning Endangered Species
Cloning Endangered SpeciesCloning Endangered Species
Cloning Endangered SpeciesMorganScience
 
Plegable biologia molecular-Manuela Colorado
Plegable biologia molecular-Manuela ColoradoPlegable biologia molecular-Manuela Colorado
Plegable biologia molecular-Manuela Coloradomanu_colorado
 
Marine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and OpportunitiesMarine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and OpportunitiesJonathan Eisen
 

Ähnlich wie Jonathan Eisen talk for #SCS2012 at #ISMB "Networks in genomics and bioinformatics: from phylogeny to Twitter" (16)

Gene expression
Gene expressionGene expression
Gene expression
 
Genetic books slides for net.
Genetic books slides for net.Genetic books slides for net.
Genetic books slides for net.
 
Subtle animated2
Subtle animated2Subtle animated2
Subtle animated2
 
Natural selection - an introduction
Natural selection - an introduction Natural selection - an introduction
Natural selection - an introduction
 
Course Design GuideSCI230 Version 71Course Design Gui.docx
Course Design GuideSCI230 Version 71Course Design Gui.docxCourse Design GuideSCI230 Version 71Course Design Gui.docx
Course Design GuideSCI230 Version 71Course Design Gui.docx
 
Genetic reaserch
Genetic reaserch Genetic reaserch
Genetic reaserch
 
Bio chemist interview
Bio chemist interviewBio chemist interview
Bio chemist interview
 
Presentation2
Presentation2Presentation2
Presentation2
 
Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)
Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)
Evolutionary Genetics by: Kim Jim F. Raborar, RN, MAEd(ue)
 
So many different kinds of mistakes
So many different kinds of mistakesSo many different kinds of mistakes
So many different kinds of mistakes
 
Genetic and Evolutionary Roots of Behavior
Genetic and Evolutionary Roots of BehaviorGenetic and Evolutionary Roots of Behavior
Genetic and Evolutionary Roots of Behavior
 
Student Teaching Work Sample
Student Teaching Work SampleStudent Teaching Work Sample
Student Teaching Work Sample
 
Chapter 3 human development
Chapter 3 human developmentChapter 3 human development
Chapter 3 human development
 
Cloning Endangered Species
Cloning Endangered SpeciesCloning Endangered Species
Cloning Endangered Species
 
Plegable biologia molecular-Manuela Colorado
Plegable biologia molecular-Manuela ColoradoPlegable biologia molecular-Manuela Colorado
Plegable biologia molecular-Manuela Colorado
 
Marine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and OpportunitiesMarine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and Opportunities
 

Mehr von Jonathan Eisen

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfJonathan Eisen
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesJonathan Eisen
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingJonathan Eisen
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsJonathan Eisen
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Jonathan Eisen
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2Jonathan Eisen
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4Jonathan Eisen
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 Jonathan Eisen
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines Jonathan Eisen
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionJonathan Eisen
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2Jonathan Eisen
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesJonathan Eisen
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionJonathan Eisen
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionJonathan Eisen
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingJonathan Eisen
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesJonathan Eisen
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionJonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesJonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingJonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionJonathan Eisen
 

Mehr von Jonathan Eisen (20)

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdf
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of Microbes
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meeting
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current Actions
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 Introduction
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 Vaccines
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA Detection
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 Introduction
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID Testing
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID Vaccines
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID Transmission
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
 

Kürzlich hochgeladen

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSJoshuaGantuangco2
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Celine George
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...Postal Advocate Inc.
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A Beña
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfphamnguyenenglishnb
 

Kürzlich hochgeladen (20)

Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTSGRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
GRADE 4 - SUMMATIVE TEST QUARTER 4 ALL SUBJECTS
 
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
Incoming and Outgoing Shipments in 3 STEPS Using Odoo 17
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
USPS® Forced Meter Migration - How to Know if Your Postage Meter Will Soon be...
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdfAMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
AMERICAN LANGUAGE HUB_Level2_Student'sBook_Answerkey.pdf
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 

Jonathan Eisen talk for #SCS2012 at #ISMB "Networks in genomics and bioinformatics: from phylogeny to Twitter"

  • 1. Networks in genomics and bioinformatics: from phylogeny to Twitter ISCB2012 July 12, 2012 Jonathan A. Eisen University of California, Davis @phylogenomics Friday, July 13, 12
  • 2. Networks in genomics and bioinformatics: from phylogeny to Twitter ISCB2012 July 12, 2012 Jonathan A. Eisen University of California, Davis @phylogenomics Friday, July 13, 12
  • 3. A meandering path and lessons “learned” ISCB2012 July 12, 2012 Jonathan A. Eisen University of California, Davis @phylogenomics Friday, July 13, 12
  • 5. Social Networking in Science Friday, July 13, 12
  • 10. Phylogenomics of Novelty Origin of New Functions and Processes Friday, July 13, 12
  • 11. Phylogenomics of Novelty Origin of New Functions and Processes •New genes •Changes in old genes •Changes in pathways Friday, July 13, 12
  • 12. Phylogenomics of Novelty Origin of New Functions and Processes •New genes •Changes in old genes •Changes in pathways Friday, July 13, 12
  • 13. Phylogenomics of Novelty Origin of New Genome Functions and Dynamics Processes •New genes •Changes in old genes •Changes in pathways Friday, July 13, 12
  • 14. Phylogenomics of Novelty Origin of New Genome Functions and Dynamics Processes •Evolvability •New genes •Repair and •Changes in old genes recombination processes •Changes in pathways •Intragenomic variation Friday, July 13, 12
  • 15. Phylogenomics of Novelty Origin of New Genome Functions and Dynamics Processes •Evolvability •New genes •Repair and •Changes in old genes recombination processes •Changes in pathways •Intragenomic variation Friday, July 13, 12
  • 16. Phylogenomics of Novelty Origin of New Genome Functions and Dynamics Processes •Evolvability •New genes •Repair and •Changes in old genes recombination processes •Changes in pathways •Intragenomic variation Species Evolution Friday, July 13, 12
  • 17. Phylogenomics of Novelty Origin of New Genome Functions and Dynamics Processes •Evolvability •New genes •Repair and •Changes in old genes recombination processes •Changes in pathways •Intragenomic variation Species Evolution •Phylogenetic history •Vertical vs. horizontal descent •Needed to track gain/loss of processes, infer convergence Friday, July 13, 12
  • 18. Undergrad Lesson 1: Be prepared for random events • Gould’s class b/c planned on not majoring in Biology • RMBL via backpacking trip • Geology library job w/ Nabokov collection b/c went to wrong building • Discovering Colleen Cavanaugh’s lab via street encounter Friday, July 13, 12
  • 19. Undergrad Lesson 2: Phylogeny Matters • “MacClade” • Phylogenetic ecology • Phylotyping Friday, July 13, 12
  • 20. Phylogeny Matters Eisen et al. 1992 Friday, July 13, 12
  • 21. Grad school lesson I: find right people to work with • Went to work on butterfly population biology and phylogeny • Advisor and I did not see eye to eye • Despite great subject for me (combined phylogeny, molecular evolution, RMBL, etc), chose not to join lab • Did many rotations … • Picked final lab in part b/c advisor was right match Friday, July 13, 12
  • 22. Grad school lesson II: never too late to change • Wanted to combine DNA repair studies and molecular evolution • I: Thymineless death • II: Adaptive mutation • III: Repair in archaea Friday, July 13, 12
  • 24. Grad school lesson II: never too late to change • Wanted to combine DNA repair studies and molecular evolution • I: Thymineless death • II: Adaptive mutation • III: Repair in archaea • IV: Bioinformatics and genome analysis … Friday, July 13, 12
  • 25. Grad school lesson III: Get others to do your work • Interested in RecA structure function relationships • Using phylogeny to look for correlated substitutions in RecA structure, like done with rRNA • But not enough sequences … Friday, July 13, 12
  • 27. Shotgun Sequencing Allows Use of Alternative Anchors (e.g., RecA) Venter et al., 2004 Friday, July 13, 12
  • 28. Grad school lesson IV: Stealing is good • Phylogenetic perspective in bioinformatics missing Friday, July 13, 12
  • 29. “Nothing in biology makes sense except in the light of evolution.” T. H. Dobzhansky (1973) Friday, July 13, 12
  • 30. Evolutionary Perspective and Comparative Biology • Comparative biology is the analysis of differences and similarities between species. • An evolutionary perspective is useful in such studies because this allows one to focus not just on the levels and degrees of similarity or difference but on how and why similarities and differences came to be. Friday, July 13, 12
  • 31. Phylogenomics • Lots of sequences being produced with no functions associated with them • Much debate in community about how to predict functions Friday, July 13, 12
  • 32. Predicting Function • Identification of motifs • Homology/similarity based methods • Highest hit • Top hits • Clusters of orthologous groups • HMM models • Structural threading and modeling • Evolutionary reconstructions Friday, July 13, 12
  • 33. Phylogeny Matters Eisen et al. 1992 Friday, July 13, 12
  • 34. Evolutionary Functional Prediction EXAMPLE A METHOD EXAMPLE B 2A CHOOSE GENE(S) OF INTEREST 5 3A 1 3 4 2B 2 IDENTIFY HOMOLOGS 5 1A 2A 1B 3B 6 ALIGN SEQUENCES 1A 2A 3A 1B 2B 3B 1 2 3 4 5 6 CALCULATE GENE TREE Duplication? 1A 2A 3A 1B 2B 3B 1 2 3 4 5 6 OVERLAY KNOWN FUNCTIONS ONTO TREE Duplication? 1 2 3 4 5 6 1A 2A 3A 1B 2B 3B INFER LIKELY FUNCTION OF GENE(S) OF INTEREST Ambiguous Duplication? Species 1 Species 2 Species 3 1A 1B 2A 2B 3A 3B 1 2 3 4 5 6 ACTUAL EVOLUTION (ASSUMED TO BE UNKNOWN) Based on Eisen, Duplication 1998 Genome Res 8: 163-167. Friday, July 13, 12
  • 37. Phylogenetic Prediction of Function • Many powerful and automated similarity based methods for assigning genes to protein families • COGs • PFAM HMM searches • Some limitations of similarity based methods can be overcome by phylogenetic approaches • Automated methods now available • Sean Eddy • Steven Brenner • Kimmen Sjölander • But … Friday, July 13, 12
  • 38. Grad school lesson V: Teaching helps you learn Friday, July 13, 12
  • 39. Grad school lesson VI: There are no career rules Friday, July 13, 12
  • 40. Career Lesson I: Build on what you know • Phylogenetic approaches to genomics • Genomics of endosymbionts • Genomic studies of communities • Analysis of DNA repair genes in genome sequences • Phylogenomics of halophilic archaea • GEBA • Phylogenetic metagenomics • ... Friday, July 13, 12
  • 41. Career Lesson II: Don’t Only Use What You Know Friday, July 13, 12
  • 42. What We Don’t Know Can Hurt Us Friday, July 13, 12
  • 44. DNA Repair Genes in D. radiodurans Process Genes in D. radiodur a n s Nucleotide Excision Repair UvrABCD, UvrA2 Base Excision Repair AlkA, Ung, Ung2, GT, MutM, MutY-Nths, MPG AP Endonuclease Xth Mismatch Excision Repair MutS, MutL Recombination Initiation RecFJNRQ, SbcCD, RecD Recombinase RecA Migration and resolution RuvABC, RecG Replication PolA, PolC, PolX, phage Pol Ligation DnlJ dNTP pools, cleanup MutTs, RRase Other LexA, RadA, HepA, UVDE, MutS2 Friday, July 13, 12
  • 45. Problem ... • List of DNA repair gene homologs in D. radiodurans genome is not significantly different from other bacterial genomes of the similar size Friday, July 13, 12
  • 46. Repair Studies in Different Species (via Medline searches as of 1998) Humans 7028 E. coli 3926 S. cerevisiae 988 Drosophila 387 B. subtilits 284 S. pombe 116 Xenopus 56 C. elegans 25 A. thaliana 20 Methanogens 16 Haloferax 5 Giardia 0 Friday, July 13, 12
  • 47. Proteobacteria TM6 OS-K ~40 Phyla of Acidobacteria Termite Group OP8 Bacteria Nitrospira Bacteroides Chlorobi Fibrobacteres Marine GroupA WS3 Gemmimonas Firmicutes Fusobacteria Actinobacteria OP9 Cyanobacteria Synergistes Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes 0.1 Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Tree based on Thermudesulfobacteria Thermotogae Hugenholtz (2002) OP1 with some OP11 modifications. Friday, July 13, 12
  • 48. Proteobacteria TM6 OS-K Acidobacteria Most DNA Termite Group OP8 Nitrospira metabolism Bacteroides Chlorobi Fibrobacteres studies in Marine GroupA WS3 Gemmimonas two Phyla Firmicutes Fusobacteria Actinobacteria OP9 Cyanobacteria Synergistes Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes 0.1 Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Tree based on Thermudesulfobacteria Thermotogae Hugenholtz (2002) OP1 with some OP11 modifications. Friday, July 13, 12
  • 49. Proteobacteria TM6 OS-K Acidobacteria Deinococcus Termite Group OP8 Nitrospira is very distant Bacteroides Chlorobi Fibrobacteres from well Marine GroupA WS3 Gemmimonas studied Firmicutes Fusobacteria groups Actinobacteria OP9 Cyanobacteria Synergistes Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes 0.1 Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Tree based on Thermudesulfobacteria Thermotogae Hugenholtz (2002) OP1 with some OP11 modifications. Friday, July 13, 12
  • 50. Gain and Loss of Repair Genes BACTERIA ARCHAEA EUKARYOTES Helpy Trepa Ecoli Human Mycge Mycpn Bacsu Synsp Borbu Metth Neigo Yeast Arcfu Strpy Metjn Haein -Ogt -PhrI -AlkA -Nfo -AlkA -Ogt -PhrI -AlkA -Ogt -Ung -Xth -Rad25 -AlkA -AlkA -Nfo -Nfo -RecFRQN -Rad25? R + us -TagI -RecQ -RuvC +P53 UmuD + -Vsr -Nfo -SbcD? dRecQ -SbcCD -Dut +Rad7 +Nei? -Rec -Lon dRad23 -LexA -SMS +CCE1 +RecE -SbcCD -LexA +MAG? tRecT? -UmuC -LexA +Spr tTagI ? tRad25 t3MG -PhrI -PhrII -PhrI -Ogt -PhrI -Ogg tUvrABCD Ada + -PhrII -AlkA -Ogt MutH + -PhrII? -AlkA -Xth -AlkA -Ung SbcB + -Fpg -MutLS -Nfo -Fpg -Nfo -RecFJORQN -Nfo -Dut -MutLS -Mfd -RecO -Lon -PhrI -RecFORQ -SbcCD -LexA -Ung? -PhrII -SbcCD -RecG -UmuC -MutLS -LexA -Dut -RecQ? + sr V -UmuC -PriA -Dut RecBCD? + -TagI+RecT -LexA -UmuC -SMS -MutT RFAs + -PhrII +TFIIH -RuvC +Rad4,10,14,16,23,26 CSA + Rad52,53,54 + +TagI? dPhr DNA-PK, Ku + SNF2 d TagI? + dMutS +Fpg dMutL UvrABCD + dRecA Mfd + RecFJNOR + Ung? + RuvABC + SSB, + +RecG Rad1 + +Dut? LigI + +Rad2 from mitochondria LexA + +Rad25? SSB + Ogg + +PriA LigII + +Dut? PhrI, PhrII + +Ogt +Ung, AlkA, MutY-Nth +AlkA +Xth, Nfo? +MutLS? +SbcCD +RecA +UmuC +MutT +Lon Eisen and Hanawalt, 1999 Mut dMutSI/MutSII dRecA/SMS Res 435: 171-213 dPhrI/PhrII Friday, July 13, 12
  • 52. What We Don’t Know Can Hurt Us Friday, July 13, 12
  • 53. As of 2002 Proteobacteria TM6 OS-K • At least 40 phyla of Acidobacteria Termite Group OP8 Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA WS3 Gemmimonas Firmicutes Fusobacteria Actinobacteria OP9 Cyanobacteria Synergistes Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on Hugenholtz, OP11 2002 Friday, July 13, 12
  • 54. As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group phyla of OP8 Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Most genomes WS3 Gemmimonas from three Firmicutes Fusobacteria phyla Actinobacteria OP9 Cyanobacteria Synergistes Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on Hugenholtz, OP11 2002 Friday, July 13, 12
  • 55. As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group phyla of OP8 Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Most genomes WS3 Gemmimonas from three Firmicutes Fusobacteria phyla Actinobacteria OP9 Cyanobacteria • Some studies Synergistes Deferribacteres Chrysiogenetes in other phyla NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on Hugenholtz, OP11 2002 Friday, July 13, 12
  • 56. As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group phyla of OP8 Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Most genomes WS3 Gemmimonas from three Firmicutes Fusobacteria phyla Actinobacteria OP9 Cyanobacteria • Some other Synergistes Deferribacteres Chrysiogenetes phyla are only NKB19 Verrucomicrobia sparsely Chlamydia OP3 Planctomycetes sampled Spriochaetes Coprothmermobacter • Same trend in OP10 Thermomicrobia Chloroflexi Eukaryotes TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on Hugenholtz, OP11 2002 Friday, July 13, 12
  • 57. As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group phyla of OP8 Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Most genomes WS3 Gemmimonas from three Firmicutes Fusobacteria phyla Actinobacteria OP9 Cyanobacteria • Some other Synergistes Deferribacteres Chrysiogenetes phyla are only NKB19 Verrucomicrobia sparsely Chlamydia OP3 Planctomycetes sampled Spriochaetes Coprothmermobacter • Same trend in OP10 Thermomicrobia Chloroflexi Viruses TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on Hugenholtz, OP11 2002 Friday, July 13, 12
  • 59. GEBA http://www.jgi.doe.gov/programs/GEBA/pilot.html Friday, July 13, 12
  • 60. rRNA Tree of Life Bacteria Archaea Eukaryotes Figure from Barton, Eisen et al. “Evolution”, CSHL Press. 2007. Based on tree from Pace 1997 Science 276:734-740 Friday, July 13, 12
  • 61. PD: Genomes From Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 62. PD: Genomes + GEBA From Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 63. PD: Isolates From Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 64. rRNA Tree of Life Bacteria Archaea ?????? Eukaryotes Figure from Barton, Eisen et al. “Evolution”, CSHL Press. 2007. Wu et al. (2011) PLoS ONE 6(3): e18011. doi:10.1371/ Based on tree from Pace 1997 Science journal.pone.0018011 276:734-740 Friday, July 13, 12
  • 65. ???? Phage Phage ???? Thaumarchaeot Friday, July 13, 12
  • 66. GEBA uncultured Number of SAGs from Candidate Phyla 406 1 OD1 OP1 OP3 SAR Site A: Hydrothermal vent 4 1 - - Site B: Gold Mine 6 13 2 - Site C: Tropical gyres (Mesopelagic) - - - 2 Site D: Tropical gyres (Photic zone) 1 - - - Sample collections at 4 additional sites are underway. Phil Hugenholtz 56 Friday, July 13, 12
  • 68. Non homology functional • Many genes have homologs in other species but no homologs have ever been studied experimentally • Non-homology methods can make functional predictions for these Friday, July 13, 12
  • 69. Phylogenetic profiling basis • Microbial genes are lost rapidly when not maintained by selection • Genes can be acquired by lateral transfer • Frequently gain and loss occurs for entire pathways/processes • Thus might be able to use correlated presence/absence information to identify genes with similar functions Friday, July 13, 12
  • 70. Non-Homology Predictions: Phylogenetic Profiling • Step 1: Search all genes in organisms of interest against all other genomes • Ask: Yes or No, is each gene found in each other species • Cluster genes by distribution patterns (profiles) Friday, July 13, 12
  • 71. Carboxydothermus hydrogenoformans • Isolated from a Russian hotspring • Thermophile (grows at 80°C) • Anaerobic • Grows very efficiently on CO (Carbon Monoxide) • Produces hydrogen gas • Low GC Gram positive (Firmicute) • Genome Determined (Wu et al. 2005 PLoS Genetics 1: e65. ) Friday, July 13, 12
  • 72. Homologs of Sporulation Genes Wu et al. 2005 PLoS Genetics 1: e65. Friday, July 13, 12
  • 73. Carboxydothermus sporulates Wu et al. 2005 PLoS Genetics 1: e65. Friday, July 13, 12
  • 74. Wu et al. 2005 PLoS Genetics 1: e65. Friday, July 13, 12
  • 75. PG Profiling Works Better Using Orthology Friday, July 13, 12
  • 76. PG Profiling Works Better Using Independent Contrasts Friday, July 13, 12
  • 77. Career Lesson III: Networks Matter Friday, July 13, 12
  • 78. Protein Family Rarefaction Curves • Take data set of multiple complete genomes • Identify all protein families using MCL • Plot # of genomes vs. # of protein families Friday, July 13, 12
  • 79. Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 80. Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 81. Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 82. Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 83. Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 84. Synapomorphies exist Wu et al. 2009 Nature 462, 1056-1060 Friday, July 13, 12
  • 87. B A C Sharpton et al. submitted Friday, July 13, 12
  • 88. Career Lesson IV: Openness Helps Friday, July 13, 12