Going forward: discuss how linkage generates LD and recombination breaks it down
Miller (2020), Fig. 5.12.3
One (at least) crossover in meiosis I per chromosome! But: rates vary between loci (hotspots), sex chromosomes vs autosomes, and in some species, recombination only occurs in one sex (e.g., D.melanogaster).
d - physical distance in bp
Definiton: genetic distance between two points is x centiMorgans(cM) if the average number of crossovers between points x/100 per meiosis
For short distances, genetic distance in cM \approx \mathrm{Pr}(\text{crossover})
Definition: recombination rate r relates genetic distance to base pair distance; commonly measured in cM/Mb
Example: in human average is 1.2 cM per Mb - the probability of crossover is a 1.2%
Given only information about SNP allele frequencies p_A and p_B, what would guess be for p_{AB}?
If independent, then p_{AB} = p_Ap_B, else p_{AB} \neq p_Ap_B. We measure the deviation D
D = p_{AB} - p_Ap_B
and say that there is linkage equilibrium if D=0!
Unfortunate property of D: its magnitude depends on allele frequencies!
\begin{align*} D^\prime = \frac{D}{D_\mathrm{max}} & = \frac{D}{\min(p_Ap_b,p_ap_B)}, \quad\mathrm{for\ }D>0\\ & = \frac{D}{\min(p_Ap_B,p_ap_b)}, \quad\mathrm{for\ }D<0 \end{align*}
|D^\prime| < 1 implies there must have been recombination
r^2 = \frac{D^2}{p_Ap_ap_Bp_b}
Even for free recombination (r=0.5) LD decay takes time. Comparison: HWE which takes one generation.
Due to lack of data LD didn’t become important until the end of the 70’s
Most loci are in close linkage with a variable site which can be used as a marker to study the inheritance of a trait of interest.
Reduced variation close to a site under selection due to linkage
Strong LD in large region indication of young allele.
Ideal window size ~ distance at which LD between markers approaches background levels
Linkage disequilibrium