TL;DR — The 30-second answer
Linkage disequilibrium (LD) describes non-random associations between alleles at different loci. In linkage equilibrium vs disequilibrium, allele combinations occur as expected or depart from expectation. In linkage disequilibrium analysis, we quantify this with r² vs D'. Use r² to judge how well one variant predicts another. Use D' to infer historical recombination. LD powers GWAS tagging, imputation, and quality control. It also reveals population structure and past demography.
Before You Dive In
This guide explains linkage disequilibrium (LD) in plain language and shows how it shapes GWAS, QC, and fine-mapping decisions. You'll learn the difference between linkage equilibrium vs disequilibrium, when to use r² vs D', and the practical steps to design, compute, visualise, and interpret LD for research-use projects. By the end, you can apply linkage disequilibrium analysis directly to tag SNP selection, clumping/pruning, imputation, and reporting.
LD exists when alleles at two positions co-occur more or less often than chance. Imagine two biallelic SNPs: A/a and B/b. If chromosomes carrying A are more likely to carry B than b, the locus pair shows positive LD. If A tends to pair with b, LD is negative. When observed haplotype counts match the product of single-locus frequencies, the pair is in linkage equilibrium.
A tiny toy example helps. Suppose in 1,000 chromosomes we see:
Allele frequencies are A=0.5 and B=0.5. Under equilibrium, each haplotype should be 0.25 (250 counts). We see an excess of AB and ab. That is LD.
LD is not only about physical proximity. Recombination breaks LD over time, but drift, bottlenecks, selection, and admixture can also create or maintain LD among distant sites. Genotyping error can mimic LD. Good LD work always starts with sound quality control.
LD is a workhorse behind many common analyses.
Bottom line: LD, used well, saves cost, boosts power, and prevents misleading results.
Two related metrics answer different questions.
At-a-glance guide
| Aspect | r² | D' |
| Best for | Tagging, power | Recombination history |
| Sensitive to MAF | Yes | Less so |
| Interpretation | 0.2 low, 0.5 moderate, ≥0.8 strong for tagging | ≥0.9 often "complete" LD given allele counts |
| Pitfall | Underestimates linkage for rare variants | Inflated by rare alleles and small samples |
Practical tip. For GWAS and pruning, use r² thresholds. For block boundaries or hotspot detection, consider D' with MAF filters and confidence bounds.
Several processes change LD over space and time.
Marginal effects for the interactions of LD (r2) estimated from pool-seq. (Lucek K. & Willi Y. (2021) PLOS Genetics)
Understanding these forces helps you design analyses and interpret exceptions, such as the extended LD around the MHC region.
LD typically decays with physical distance. In many species, r² drops sharply over a few kilobases, then approaches a low background level. The exact curve depends on recombination rate, effective population size, and demography.
Patterns of LD for Orthologous Genomic Segments of Approximately 5 Mb in Rat, Human, and Mouse. (Guryev V. et al. (2006) PLOS Genetics)
Haplotype blocks are segments where recombination has been historically limited, yielding clusters of variants in high LD. Within blocks, a small set of tag SNPs can capture most variation. Between blocks, recombination has been frequent.
Why this matters:
Effect of MAF on the nature of LD and its decay in the mini core collection. (Otyama P.I. et al. (2019) BMC Genomics)
Good study design avoids headaches later. Plan these items early.
Cohort size and balance
Marker density and coverage
Allele frequency filters
Hardy–Weinberg equilibrium and call quality
Relatedness and duplicates
Phasing choices
Documentation
Purpose. A practical, decision-first path from raw genotypes to LD-driven outputs—no commands required.
Inputs. VCF/PLINK files, sample metadata (ancestry, batches, relatedness), reference build.
Workflow.
Key parameters. Window 200–1,000 kb; r² = 0.1–0.2 (pruning), ≥0.8 (strong tagging); apply MAF filters for D'; analyse per-ancestry; exclude long-range LD regions (e.g., MHC); phasing optional for r², helpful for D'.
Outputs. Pruned/clumped variant lists, block BEDs, LD matrices; LD heatmaps, decay curves, locus figures; a 1–2 page summary explaining choices and next steps.
Tool selection guide (when to use what)
| Tool | Primary use | Strengths | Caveats | Typical outputs |
| PLINK | r² matrices, pruning, clumping | Fast, standard in GWAS workflows | Limited interactive plotting | r² tables, pruned/clumped variant lists |
| VCFtools | LD stats from VCF windows | Simple, VCF-native | Less feature-rich than PLINK | Pairwise LD summaries |
| scikit-allel (Python) | Flexible LD calc, matrices | Programmable, easy custom filters | Requires Python skills | LD matrices for plotting |
| Haploview | Blocks, D' heatmaps | Classic block visualisation | Legacy UI; export then post-process | PNG/SVG heatmaps, block definitions |
| LocusZoom-style tools | Locus plots with LD | Great for manuscripts | Needs summary stats + LD source | Locus figures with r² colouring |
Interpretation ties statistics to decisions.
Clumping vs pruning
Credible sets and conditional tests
Trans-ethnic fine-mapping
Reporting
A short checklist prevents most LD headaches.
A research team planned a GWAS for a quantitative trait. They began with a dense pilot panel and linkage disequilibrium analysis across 500,000 common variants. Using r² ≥ 0.8 within 250 kb windows, they selected tag SNPs that captured most local correlation. The final array had ~35% fewer markers than the draft list. In power simulations based on the pilot LD matrices, >95% of associations detectable by the dense panel were still detectable. This reduced lab costs and data storage, while maintaining statistical performance. The team then applied stricter pruning for PRS construction and looser clumping for discovery analyses. The combination delivered robust signals and efficient follow-up.
The lesson is simple. Calibrate r² thresholds to your trait architecture and sample size. Test choices on a pilot before you scale.
What is LD in simple terms?
It is a measure of how often certain allele pairs appear together more or less than chance.
When should I use r² vs D'?
Use r² for tagging and power. Use D' to infer recombination or define blocks, with MAF filters.
What is "high LD"?
For tagging, many groups call r² ≥ 0.8 high. Always state thresholds and context.
How do I compute LD from a VCF?
Filter for quality and MAF. Then run PLINK --r2 in windows. Inspect matrices and plots.
Why does LD differ across ancestries?
Demography and recombination histories differ. Populations with larger effective size often show faster LD decay.
Does phasing affect LD estimates?
Phasing improves haplotypes and D' accuracy. Many r² calculations work on unphased genotypes.
How large should my sample be?
Larger samples stabilise LD, especially for low-frequency alleles. Power calculations should guide final numbers.
Can I compare LD across species or panels?
Yes, with care. Match MAF bins, sample sizes, filters, and window settings.
What regions need special handling?
The MHC and known long-range LD regions. Check species-specific lists before clumping or pruning.
You do not need to guess your thresholds alone. We help teams design LD-aware GWAS and fine-mapping plans, set transparent QC, and deliver clear, citable reports. Discuss your cohort, array or WGS plan, and the downstream analyses you want to enable. We align LD steps with your trait models, ancestry makeup, and regulatory reporting needs for research.
Start your project: Linkage Disequilibrium Analysis Service
Note: Services are for research use only. We do not provide clinical or personal testing.
Related reading:
References