This guide provides a defensible, decision-ready workflow for IBD/ROH interpretation and integrates smoothly with our Genetic Diversity service and Population Structure Analysis workflows.
TL;DR — Read IBD and ROH Like a Decision Tree
If long ROH (e.g., >5–10 Mb) dominate an individual's genome, you're looking at recent inbreeding; quantify F_ROH and update mating plans. A burden of short ROH (≈0.5–1 Mb) signals older drift or a historically small effective population size (Ne); track trajectories, not just snapshots. Widespread IBD segments between pairs point to recent shared ancestry; feed those segments to IBDNe to estimate recent Ne (≈5–50 generations) and set recovery targets. Recurrent ROH islands across individuals mark putative selection or recombination deserts—interrogate before acting. Together, IBD and ROH convert pedigrees and genotypes into actionable levers for genetic management in herds, breeds, and conservation programs.
Pedigrees summarize expected relatedness; the genome records what actually happened. Runs of homozygosity (ROH) arise when both chromosomal copies descend from a shared ancestor, producing long homozygous tracts. The number and length of ROH reveal recent versus ancient common ancestry, while the genome-wide proportion F_ROH captures realized inbreeding that pedigrees often miss—especially in incomplete or shallow pedigree systems common in livestock and wildlife. In practice, F_ROH correlates more closely with fitness outcomes than pedigree-based F, making it the more reliable early-warning signal for inbreeding depression and suboptimal crosses.
For conservation programs balancing demographic recovery with local adaptation, the stakes are high: management that ignores realized inbreeding risks lower survival, fertility, and resilience. Genomic metrics enable you to detect, quantify, and date inbreeding, so breeding plans and translocations can be tuned with evidence rather than guesswork.
F_ROH is the fraction of the autosome in ROH per individual. When tracked alongside fertility, growth, or health outcomes, it clarifies which animals carry the highest realized inbreeding and which matings are most prudent. F_ROH is routinely more informative than pedigree-based estimates in herds with missing or shallow records.
ROH length classes are time-aware indicators: long ROH reflect recent common ancestors (a few generations), whereas short ROH reflect older drift from a historically small Ne. This distinction informs whether managers should adjust near-term matings or expand founders/immigrants to lift long-term diversity.
Identity-by-descent (IBD) segments shared between individuals carry information about recent Ne. With IBDNe, you can infer Ne(t) over roughly the last 5–50 generations, quantify whether a population is declining, stable, or recovering, and benchmark the effect of management interventions.
ROH islands—regions repeatedly homozygous across many individuals—can mark selection, drift, or recombination deserts. Cross-check islands against recombination maps and trait associations before acting; livestock syntheses show islands in known targets such as MSTN and other production-related loci. Treat these as candidates for validation, not definitive proof of selection.
ROH calling on sequence and arrays. For whole-genome sequencing or exomes, BCFtools/RoH uses a hidden Markov model (HMM) to detect autozygous tracts, handling genotype likelihoods gracefully. For arrays or rapid screening, PLINK provides fast sliding-window ROH detection with transparent parameters (--homozyg-*). To partition autozygosity by age classes, RZooRoH models HBD (homozygosity-by-descent) classes, profiling recent and ancient contributions within individuals.
IBD detection for pairs. Refined IBD shows improved power and accuracy for phased data and is a robust source of segment calls for IBDNe. Together they deliver interpretable recent effective population size trajectories.
Ancient or low-coverage samples. for archaeological or sparse modern data, hapROH leverages reference haplotypes to call long ROH at very low coverage, enabling parental-relatedness inference through time.
Detecting runs of homozygosity using a reference panel. (Ringbauer H. et al. (2021) Nature Communications).
This stage feeds directly into our Genetic Diversity service, where QC thresholds and parameter templates are standardized so recurring projects remain comparable across seasons and sites.
When F_ROH is high and dominated by long ROH, prioritize near-term mating changes. When F_ROH is high but dominated by short ROH, treat it as a legacy of historically small Ne and consider founder expansion or genetic rescue rather than just reshuffling current pairings.
Total length and number of ROH for three size classes including Small (0.5 to 1 Mb), Medium (1 to 5 Mb) and Large (>5 Mb). (Liu C. et al. (2022) PLOS ONE).
Relatedness that slips through QC inflates ROH in offspring and masks true population structure—a problem for both breeding value estimation and conservation triage.
For multi-origin herds or reintroduced populations, estimate ancestry-specific Ne to catch uneven recovery that a pooled analysis could mask.
Incidence plots of SNPs in ROH for six sheep populations. (Gorssen W. et al. (2021) Genetics Selection Evolution).
These deliverables are included in our genetic diversity analysis so you can re-run the same analysis year-over-year and demonstrate improvement quantitatively.
Meaning: recent inbreeding (close matings or small current Ne).
Action: update mating design, broaden crosses, and monitor F_ROH in the next generation. In conservation contexts, plan genetic rescue to inject diversity.
Meaning: legacy of historically small Ne with little current inbreeding.
Action: protect effective size—avoid severe skews in breeder contributions, maintain census size, and consider immigration over within-line reshuffling.
Meaning: population contraction over the last 5–50 generations.
Action: set numeric Ne targets, plan founder recruitment, and re-estimate Ne every cycle to validate improvement.
Plots of GAM models, with the variable to be predicted "AV index" and the predictors. (Blondeau Da Silva S. et al. (2024) Heredity).
Meaning: probable selection (e.g., production traits in livestock).
Action: moderate selection intensity or leverage alternative lines to retain diversity around targeted loci; cross-check with recombination maps to avoid mistaking recombination deserts for selection.
Scatter plot of total homozygosity per individual against four corrected carcass traits. (Zhao X. et al. (2021) BMC Genomics)
Meaning: standard callers may fail.
Action: use hapROH to recover long ROH; interpret results within a time-aware framework (e.g., by archaeological period or release cohorts).
Methods (example wording):
"We performed ROH calling with BCFtools/RoH (sequence) and PLINK (arrays) using species-appropriate thresholds; for age-class partitioning we applied RZooRoH. We detected pairwise IBD in phased data with Refined IBD, and inferred recent effective population size using IBDNe from filtered segments. We computed F_ROH per individual, summarized ROH length bins, estimated Ne(t) with confidence intervals, and identified ROH islands by overlapping ROH across individuals. Software versions, recombination maps, and parameter sets are documented in Supplementary Methods."
Figures:
Tables: per-sample F_ROH, pairwise IBD summaries, IBDNe outputs, island coordinates and gene overlaps.
Start with published ranges (e.g., 0.5–1 Mb = older drift, 1–5 Mb = intermediate, >5–10 Mb = recent) and adjust for marker density and species-specific recombination. Keep bins consistent over time and explain them in Methods.
Yes. Across simulations and real populations, marker-based measures capture realized IBD more accurately, especially when pedigrees are shallow or incomplete. For management, F_ROH is typically the more predictive lever.
IBDNe infers Ne over about 5–50 generations from the distribution of long IBD segments. It is well suited to the timeframe of modern breeding and conservation actions.
Use BCFtools/RoH (HMM) for sequence/exome; PLINK for quick array-based scans; RZooRoH to decompose HBD into age classes; Refined IBD + IBDNe to add recent Ne context. Document versions, maps, and thresholds for reproducibility.
Yes. hapROH reliably detects long ROH from low-coverage data using a modern reference panel and has been applied to thousands of ancient genomes to infer parental relatedness.
IBD and ROH are practical genomic instruments for protecting population health. By combining F_ROH, ROH length spectra, pairwise IBD, IBDNe, and ROH islands, you can separate recent inbreeding from legacy drift, quantify recent Ne, and target interventions where they matter. The playbook is reproducible and audit-ready: choose the right caller (BCFtools/RoH, PLINK, RZooRoH), detect and filter IBD with Refined IBD, infer Ne(t) with IBDNe, and ship a clear Methods/reporting pack.
If you want these insights operationalized with consistent deliverables, our genetic diversity analysis packages ROH/IBD calling → F_ROH & IBDNe → reviewer-ready reporting, and integrates with population structure analysis for stratification control. We'll calibrate thresholds by species and platform, establish ROH length bins, set Ne targets, and hand you a repeatable SOP you can run every season.
Related reading:
References