+ All Categories
Home > Documents > The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution...

The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution...

Date post: 08-Jul-2020
Category:
Upload: others
View: 0 times
Download: 0 times
Share this document with a friend
46
The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker – Peter Pfaffelhuber Albert-Ludwigs Universit¨ at Freiburg April 01, 2013 F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 1 / 28
Transcript
Page 1: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The evolution of bacterial genomes under horizontalgene transfer

Franz Baumdicker – Peter Pfaffelhuber

Albert-Ludwigs Universitat Freiburg

April 01, 2013

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 1 / 28

Page 2: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

1 introduction

2 The Infinitely Many Genes Model

3 ancestral gene transfer graph

4 main results

5 outlook

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 2 / 28

Page 3: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

Introduction

The distributed genome hypothesis

The set of genes in a population of bacteria is distributed over allindividuals that belong to the specific taxon.

individuals of the samepopulation do not havethe same set of genes

no organism contains thefull complement of genesof the species

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 3 / 28

Page 4: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

previous work

Extrapolation:

coregenome: a function fitted to thenumber of genes common to nindividuals converges to some numberc for n→∞pangenome: if a function fitted tothe total number of genes in nindividuals

I goes to infinity:open pangenome

I saturates at some finite level:closed pangenome

Kittichotirat et al. 2011, Kettler et al. 2007

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 4 / 28

Page 5: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

Modelling genomic diversity

Goal:Describe the diversity of distributed genomes in bacterial populations

base the model on the underlying biological mechanismsI random reproduction - genealogyI gain of genesI loss of genesI horizontal gene transfer within the population

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 5 / 28

Page 6: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

Horizontal Gene Transfer in bacteria

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 6 / 28

Page 7: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

Horizontal Gene Transfer in bacteria

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 7 / 28

Page 8: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

Horizontal Gene Transfer in bacteria

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 8 / 28

Page 9: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

genes and genomes

the available gene pool is a set of genes I = [0, 1].

the genome of individual i contains genes Gi ⊆ I

Gi is called dispensable genome of individual i .

in addition every individual has a set of c genes absolutely necessaryto survive, the core genome.

these genes must be passed from ancestor to offspring.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 9 / 28

Page 10: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many sites model

pairs resample at rate 1

mutations accumulate at rate θalong the lineages

one line

a secondline

one line

a secondline

x

x

xx

x

x

xxx xx xxx xxxx x

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 11: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many sites model

genealogy is given byKingman’s coalescent

pairs of lineages coalesceat rate 1

mutations accumulate at rate θalong the lineages of theKingman tree

a secondline

one line

a secondline

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 12: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many genes model

pairs resample at rate 1

gene gains occur at rate θ2 along

the lineages

each gene is lost at rate ρ2

a secondline

one line

a secondline

N5N1

N2 N3

N4H2

H1

H1

{1} {2} {4} ∅ {1, 3}

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 13: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many genes model

genealogy is given byKingman’s coalescent

pairs of lineages coalesceat rate 1

genes are gained at rate θ2

each gene is lost at rate ρ2

each gene is transfered at rate γ2

from a unknown line

a transfered gene is a copy.

donor and acceptor both carrythe gene

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 14: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many genes model with HGT

pairs resample at rate 1

gene gains occur at rate θ2 along

the lineages

each gene is lost at rate ρ2

a present gene is transfered atrate γ

2 to a random individual

a transfered gene is a copy.

donor and acceptor both carrythe gene

each gene is transfered at rate γ2

from a unknown line

a transfered gene is a copy.

donor and acceptor both carrythe gene

N5N1

N2 N3

N4H2

H1

H1

51

1

3

{1, 5} {2, 5} {4} {3} {1, 3}

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 15: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many genes model with HGT

genealogy is given byKingman’s coalescent

pairs of lineages coalesceat rate 1

genes are gained at rate θ2

each gene is lost at rate ρ2

each gene is transfered at rate γ2

from a unknown line

a transfered gene is a copy.

donor and acceptor both carrythe gene

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 16: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

infinitely many genes model with HGT

genealogy is given byKingman’s coalescent

pairs of lineages coalesceat rate 1

genes are gained at rate θ2

each gene is lost at rate ρ2

each gene is transfered at rate γ2

from a unknown line

a transfered gene is a copy.

donor and acceptor both carrythe gene

mutation dynamics borrowed from

Phylogenetic Trees based on gene content Daniel H. Huson,Mike Steel

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 10 / 28

Page 17: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for a single gene

each pair of lines coalesces at rate 1,

each line disappears at rate ρ/2I the gene was lost

each line splits in two lines at rate γ/2

I the gene was horizontally transferredfrom another individual

I the gene can now have two differentorigins

A4

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 11 / 28

Page 18: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for a single gene

each pair of lines coalesces at rate 1,

each line disappears at rate ρ/2I the gene was lost

each line splits in two lines at rate γ/2

I the gene was horizontally transferredfrom another individual

I the gene can now have two differentorigins

A4

H

x

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 11 / 28

Page 19: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for a single gene

each pair of lines coalesces at rate 1,

each line disappears at rate ρ/2I the gene was lost

each line splits in two lines at rate γ/2

I the gene was horizontally transferredfrom another individual

I the gene can now have two differentorigins

A4

H

xx

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 11 / 28

Page 20: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for infinitely many genes

start with the clonal genealogy of the sample A(0)n .

construct the genealogy of the first gene A(1)n

I loss events at rate ρ/2I additional splitting events at rate γ/2I add coalescence events for each new line

Iteratively, construct A(k+1)n

I keep all lines in ∪ki=0A(i)n

I add splitting, loss and coalescence events.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 12 / 28

Page 21: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for infinitely many genes

A(0)4

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 12 / 28

Page 22: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for infinitely many genes

start with the clonal genealogy of the sample A(0)n .

construct the genealogy of the first gene A(1)n

I loss events at rate ρ/2I additional splitting events at rate γ/2I add coalescence events for each new line

Iteratively, construct A(k+1)n

I keep all lines in ∪ki=0A(i)n

I add splitting, loss and coalescence events.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 12 / 28

Page 23: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for infinitely many genes

A(0)4 A(1)

4

•1

•1

1

1

•1

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 12 / 28

Page 24: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for infinitely many genes

start with the clonal genealogy of the sample A(0)n .

construct the genealogy of the first gene A(1)n

I loss events at rate ρ/2I additional splitting events at rate γ/2I add coalescence events for each new line

Iteratively, construct A(k+1)n

I keep all lines in ∪ki=0A(i)n

I add splitting, loss and coalescence events.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 12 / 28

Page 25: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

The ancestral gene transfer graph for infinitely many genes

A(0)4 A(1)

4

•1

•1

1

1

•1

A(2)4

•2

•22

•2

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 12 / 28

Page 26: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

gene gains in the AGTG

Consider the events (Tm,Um)m=1,2,... of a Poisson point process on[0,∞)× [0, 1] with intensity measure 1

2θ dt du.

If Tk ≤ L(A(k)n ), pick a point uniformly at random on A(k)

n , where thegene Uk was gained.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 13 / 28

Page 27: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

weak convergence

Gene distributions from Moran model and AGTG coincide

Let (GN1 , ...,G(N)n ) be the genes of individual 1, . . . , n in the previously

described moran model of size N.And let (G1, ...,Gn) be the gene distribution read off from the AGTG.Then,

(GN1 , ...,G(N)n )

N→∞−−−−→ (G1, ...,Gn)

• •0 1

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 14 / 28

Page 28: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

weak convergence

Gene distributions from Moran model and AGTG coincide

Let (GN1 , ...,G(N)n ) be the genes of individual 1, . . . , n in the previously

described moran model of size N.And let (G1, ...,Gn) be the gene distribution read off from the AGTG.Then,

(GN1 , ...,G(N)n )

N→∞−−−−→ (G1, ...,Gn)

A(1)n

•1

•1

1

1

•1

A(2)n•2

•22

•2

• •0 1

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 14 / 28

Page 29: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

weak convergence

Gene distributions from Moran model and AGTG coincide

Let (GN1 , ...,G(N)n ) be the genes of individual 1, . . . , n in the previously

described moran model of size N.And let (G1, ...,Gn) be the gene distribution read off from the AGTG.Then,

(GN1 , ...,G(N)n )

N→∞−−−−→ (G1, ...,Gn)

A(3)n

A(1)n

•1

•1

1

1

•1

A(4)n

A(2)n•2

•22

•2

A(5)n

• •0 1

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 14 / 28

Page 30: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

single individual – average number of genes

|Gi |: number of genes in individual i

E[|Gi |] =θ

ρ+θ

ρ

∞∑m=1

γm

(1 + ρ)m+ c

with (a)b := a(a + 1) · · · (a + b − 1).

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 15 / 28

Page 31: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

single individual – average number of genes

without HGT (γ = 0)

following one line backwards in timeI losses occur at rate ρ

2

I each line produces a new lineat rate γ

2I each pair of lines coalesces at rate 1

expected length of unlost line is 2ρ

text

E[|Gi |] = θ22ρ = θ

ρ

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 16 / 28

Page 32: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

single individual – average number of genes

with HGT (γ > 0)

following one line backwards in timeI each line dies at rate ρ

2I each line produces a new line

at rate γ2

I each pair of lines coalesces at rate 1

Lm: length of the ancestral genetransfer graph started with m lines

E[|Gi |] = θ2E[L1]

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 16 / 28

Page 33: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

average number of genes – birth-death processes

The length Lm of an AGTG started with m linesequalsthe time to absorption for a birth-death process started in m withbirth-rate λi = 1

iiγ2 = γ

2

death-rate µi = 1i

(iρ2 + i(i−1)

2

)= ρ+i−1

2

Thus,

E[|Gi |] =θ

2E[L1] =

θ

2

∞∑i=1

pi =θ

2

∞∑i=1

λ1λ2 · · ·λi−1µ1µ2 · · ·µi

ρ

(1 +

∞∑i=1

γ i

(ρ+ 1)i

).

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 17 / 28

Page 34: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

expected pangenome size – birth-death processes

Use same idea to compute the expected number of genes in n individuals(pangenome size)

E

[∣∣∣ n⋃i=1

Gi∣∣∣] =

θ

2E[Ln] =

θ

2

∞∑i=1

pi +n−1∑r=1

(r∏

k=1

µkλk

) ∞∑j=r+1

pj

= θ

n−1∑k=0

1

k + ρ

(1 +

∞∑m=1

γm

(i + ρ)m

)

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 18 / 28

Page 35: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

the gene frequency spectrum

The gene frequency spectrum is given by G(n)1 , ...,G

(n)n , where

G(n)k := |{u ∈ I : u ∈ Gi for exactly k different i}|.

E[G(n)k ] =

θ

k

n · · · (n − k + 1)

(n − 1 + ρ) · · · (n − k + ρ)

(1 +

∞∑m=1

(k)mγm

(n + ρ)mm!

)with (a)b := a(a + 1) · · · (a + b − 1).

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 19 / 28

Page 36: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

diffusion theory and the gene frequency spectrum

Let (Xt) be the frequency of a gene at time t.Then, (Xt)t≥0 is a diffusion process, which follows the SDE

dX = −ρ2

Xdt +γ

2X (1− X )dt +

√X (1− X )dW .

The number of genes in frequency x is Poisson with mean

g(x)dx := θeγx

x(1− x)1−ρdx .

and

E[G(n)k ] =

(n

k

)∫ 1

0g(x)xk(1− x)n−kdx

k

n · · · (n − k + 1)

(n − 1 + ρ) · · · (n − k + ρ)

(1 +

∞∑m=1

(k)mγm

(n + ρ)mm!

)

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 20 / 28

Page 37: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

2 4 6 8 10

0.0

0.1

0.2

0.3

0.4

number of individuals, k

E[G

k]/E

[G]

●●

● ● ● ●

● γ=0γ=5γ=10

The expected gene frequency spectrum is highly dependent of γ.For high values of γ, most genes are in high frequency, leading to a closedpangenome.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 21 / 28

Page 38: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

higher moments

The frequencies of two genes depend on each other.

can not apply 1-dim diffusion methods to get higher moments

generation

gene

freq

uenc

y

0 500 1000 1500 20000

0.3

0.5

0.7

1

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 22 / 28

Page 39: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

variance – approximations in the AGTG

Var [|Gi |] =θ

ρ

(1 +

γ

1 + ρ

)+O(γ2)

V[|G1|] =

∫ 1

0V[G1(dx)] +

∫ 1

0

∫ 1

01x 6=yCOV[G1(dx),G1(dy)]

V[|G1(dx)|] =θ

2E[L(A1)]dx +O(dx2) =

θ

2E[L1]dx +O(dx2)

COV[|G1(dx)|, |G1(dy)|] =θ2

4COV[L(A1), L(A2)]dx dy

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 23 / 28

Page 40: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

variance – approximations in the AGTG

Var [|Gi |] =θ

ρ

(1 +

γ

1 + ρ+

γ2

(1 + ρ)(2 + ρ)+

γ2θ

(1 + ρ)2(3 + 2ρ)(2 + 7ρ+ 6ρ2)

)+O(γ3)

• •

V[|G1|] =

∫ 1

0V[G1(dx)] +

∫ 1

0

∫ 1

01x 6=yCOV[G1(dx),G1(dy)]

V[|G1(dx)|] =θ

2E[L(A1)]dx +O(dx2) =

θ

2E[L1]dx +O(dx2)

COV[|G1(dx)|, |G1(dy)|] =θ2

4COV[L(A1), L(A2)]dx dy

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 23 / 28

Page 41: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

software IMaGe

IMaGe

estimatedgenealogicaltree genefrequencyspectrum

statistical test for hypotheses of neutral evolutionparameter estimates

estimated pangenome sizeexpected no. of new genes in the next individual

. . .

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 24 / 28

Page 42: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

outlook – estimating γ

given the observed gene frequencyspectrum it is difficult to estimateθ, ρ, γ and c solely based on the mean genefrequency spectrum

for γ = 0 IMaGe uses an a priori tree

for γ > 0 each gene has its owngenealogy

need a new statistic besides the gfswhich is sensible to γ

● ●●

● ●

2 4 6 8 10

010

2030

40

number of individuals, k

gene

freq

uenc

y G

k(10)

● ● ●

● ●

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 25 / 28

Page 43: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

pairs of incongruent genes

The number of incongruent pairs of genes is given by

P := 1n(n−1)(n−2)(n−3)

n∑i ,j ,k,l=1

Aij ,kl · Aik,jl

where

Aij ,kl := |(Gi ∩ Gj) \ (Gk ∪ Gl)|, 1 ≤ i , j , k , l ≤ n.

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 26 / 28

Page 44: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

pairs of incongruent genes

The average number of incongruent pairs of genes without HGT is givenby

E[P] =θ2ρ

4

18 + 117ρ2 + 203ρ2

4 + 105ρ3

8

(1 + ρ2)2(1 + 2ρ2)(1 + 4ρ2)(3 + 4ρ2)(3 + 5ρ2)(6 + 5ρ2)(6 + 7ρ2)

E[Aij ,kl ] =1(42

)E[G(4)2 ] =

θ

(3 + ρ)(2 + ρ)

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 26 / 28

Page 45: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

outlook

test for HGT (γ > 0) in the infinitely many genes model, based onthe number of incongruent pairs.

joint distribution of gene frequency and mutations in thecorresponding gene sequence

other possible extensions of the IMG model:

I selection, structured populations, changing population size

apply the model to other bacteria:I E. Coli, green sulfer bacteria, epidemic strains, gut bacteria, soil

bacteria

...

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 27 / 28

Page 46: The evolution of bacterial genomes under horizontal …anr-manege/Aussois2013/Baum...The evolution of bacterial genomes under horizontal gene transfer Franz Baumdicker { Peter Pfa

Thank you for your attention!

The infinitely many genes modelBaumdicker, F., W. R. Hess, and P. Pfaffelhuber (2010).The diversity of a distributed genome in bacterial populations.Ann. Appl. Probab. 20 (5).

model applied to cyanobacterial pangenome, estimates, IMaGeBaumdicker, F., W. R. Hess, and P. Pfaffelhuber (2012).The infinitely many genes model for the distributed genome of bacteria.Genome Biol Evol Vol. 4, 443-456.

ancestral gene transfer graphBaumdicker, F. and P. PfaffelhuberThe infinitely many genes model with horizontal gene transferarXiv:1301.6547 [math.PR], in review

F. Baumdicker (University of Freiburg) evolution of genomes under HGT April 01, 2013 28 / 28


Recommended