The Wright-Fisher model

Of population models and genealogies

Per Unneberg

Models of populations

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Model of populations that describes genealogical relationships of genes (chromosomes) in a population under the following assumptions (Hein et al., 2005):

  • discrete and non-overlapping generations
  • haploid individuals or two subpopulations (males and females)
  • constant population size
  • all individuals are equally fit
  • population has no geographical or social structure
  • no recombination

Algorithm

  1. Setup starting population at time zero
  2. Add offspring (same size) at time one
  3. Select parents to offspring at random

Wright-Fisher model

Figure 1: Wright-Fisher model

Wright-Fisher model

Figure 2: WF model indicating time direction from past (top) to present (bottom).
Figure 3: WF model tracing the genealogies of all extant chromosomes
Figure 4: WF model tracing the genealogies of three extant chromosomes
Observations
  • most lineages are lost over time
  • looking backwards in time genes eventually coalesce at a common ancestor
  • looking backwards in time sampled genes can be described by a genealogy

The Wright-Fisher sampling model

Let’s formalise the sampling process of the Wright-Fisher model1. We assume

  1. a single locus in a haploid population of size 2N (or diploid of size N when random mating)
  2. no mutation and selection
  3. discrete generations

Each generation we sample 2N new chromosomes from the previous generation. The probability of choosing a chromosome v is 1/2N (coin flip with probability of success 1/2N). Since the trials are independent, and we perform 2N trials, the number of offspring k of a given chromosome v is binomially distributed \mathrm{Bin}(m, p), with parameters m=2N and probability of success p=\frac{1}{2N}.

Properties of Wright-Fisher sampling

The expected number of offspring is one

Poisson approximation for large N

P(v=k) \approx \frac{1}{k!}e^{-k}

Prob(pick same parent) = 1/2N

Time for two sequences to coalesce \sim 1/2N

Bibliography

Hein, J., Schierup, M. H., & Wiuf, C. (2005). Gene genealogies, variation and evolution: A primer in coalescent theory. Oxford University Press. https://books.google.se/books?id=CCmLNAEACAAJ
Hein, J., Schierup, M., & Wiuf, C. (2004). Gene genealogies, variation and evolution. A primer in coalescent theory. In Systematic Biology - SYST BIOL (Vol. 54).
Hermisson, J. (2017). Mathematical population genetics. https://www.mabs.at/fileadmin/user_upload/p_mabs/Lecture_Notes_2017