Archaic Introgression Analysis Service
Overview
Archaic introgression analysis is a specialized genomic research approach. It focuses on detecting and interpreting genetic material. This genetic material is inherited from archaic hominins, like Neanderthals and Denisovans, into modern human genomes. These introgressed segments still exist in present-day populations. They have contributed to adaptive traits, disease susceptibility, and human phenotypic diversity. Our service uses advanced computational and genomic tools. It maps archaic introgression, assesses its functional and evolutionary impacts, and offers insights into its role in shaping modern human biology.
Our Archaic Introgression Analysis Service Offers:
- High-Precision Introgression Detection:
Utilize deep whole-genome sequencing data and machine learning-based methods (e.g., S* and Sprime). These methods help identify archaic-derived haplotypes with high confidence. This is possible even in regions of low complexity or recent gene flow. We use reference panels of archaic and modern human genomes. They help us distinguish introgressed segments from ancestral variation.
- Comparative Genomics and Ancestry Mapping:
Employ comparative genomics. It is across diverse modern human populations and archaic hominins. This helps us trace the geographic and demographic distribution of introgressed regions. We annotate lineage-specific adaptations. For example, high-altitude adaptation in Tibetans is linked to Denisovan EPAS1. We also identify regions subject to positive selection.
- Functional Impact and Disease Association Analysis:
Predict the biological effects of introgressed variants. We use in silico tools (e.g., RegulomeDB, DeepSEA) for this. We integrate functional genomic datasets (e.g., eQTLs, chromatin accessibility). This helps us assess their role in gene regulation, protein function, or disease risk. We highlight variants linked to immune responses, metabolic traits, and neurological disorders.
- Evolutionary Dynamics and Population History Reconstruction:
Analyze patterns of introgression frequency, linkage disequilibrium, and haplotype structure. This helps us infer the timing and strength of selection acting on archaic alleles. We use coalescent simulations and demographic modeling. They help us reconstruct population history events, such as admixture pulses and purifying selection against maladaptive variants.
- Customized Visualization and Reporting:
Provide interactive, publication-ready visualizations of introgressed regions, including Manhattan plots, haplotype networks, and selection scans. Our reports summarize key findings, evolutionary interpretations, and potential implications for health and ancestry, tailored to research or clinical applications.
Archaic Introgression Analysis is a pioneering genomic methodology dedicated to detecting and interpreting the legacy of interbreeding events between anatomically modern humans and archaic hominins, such as Neanderthals and Denisovans. These ancient gene flow episodes introduced archaic-derived alleles into the modern human genome, which have since been shaped by natural selection, genetic drift, and demographic processes. By systematically characterizing archaic introgressed segments—including their frequency, distribution, and functional impacts—researchers can elucidate how these ancestral genetic variants influenced adaptive traits (e.g., immune responses, high-altitude adaptation, skin pigmentation), susceptibility to diseases, and the evolutionary trajectory of human populations. This approach not only sheds light on the mosaic nature of the human genome but also provides critical insights into the dynamic interplay between hybridization, adaptation, and survival across deep evolutionary timescales.
How to Measure
1. Sample Collection and Preparation
Sample Types:
- Human: Blood, saliva, buccal swabs, or ancient DNA (aDNA) from fossils/bones.
- Archaic Hominins: Neanderthal/Denisovan remains (e.g., teeth, long bones) for high-quality aDNA extraction.
- Environmental/Microbial: Soil or sediment samples from archaic habitat sites (for metagenomic reconstruction of extinct microbiomes).
Genotyping/Sequencing Technologies for Archaic Introgression Detection:
Table 1: Key Methods and Their Characteristics
| Technology |
Application Scenario |
Key Advantages |
| Whole-Genome Sequencing (WGS) |
High-resolution archaic ancestry mapping |
Detects introgressed tracts with high accuracy; identifies novel/rare variants. |
| Targeted Enrichment Sequencing |
Focused analysis of candidate regions |
Cost-effective for large cohorts; prioritizes known adaptive loci (e.g., EPAS1). |
| Array-Based Genotyping |
Genome-wide screening for archaic alleles |
High throughput; pre-designed probes for known archaic SNPs (e.g., Human Origins Array). |
| Read-Depth Analysis (NGS-based) |
Low-coverage CNV/introgression calling |
Scalable for ancient DNA; minimizes sequencing costs. |
| Haplotype-Based Methods |
Phasing archaic tracts in modern genomes |
Resolves ancestry of long haplotypes (e.g., S-STARR for Neanderthal segments). |
Data Quality Control (QC):
- Sample-Level: Exclude samples with low DNA integrity (e.g., high fragmentation in aDNA), contamination (>2% non-human DNA), or insufficient coverage (<1× for aDNA).
- Data-Level: Filter out low-confidence calls (e.g., introgressed tracts <10 kb or <5 supporting reads; SNPs with missingness >5%).
2. Statistical Analysis Workflow for Archaic Introgression
Introgression Detection and Segmentation:
- Tools:
- S-STARR (for archaic haplotype phasing).
- ArchaicSeeker 2.0 (machine learning-based detection).
- Sprime (identifies adaptive introgression under selection).
- Approach: Use HMMs or likelihood-ratio tests to distinguish archaic vs. modern human ancestry. Define introgressed tracts by breakpoint accuracy (>95% confidence).
Population-Level Analysis:
- Frequency Estimation: Calculate archaic allele frequencies across populations (e.g., higher in East Asians vs. Africans for Denisovan ancestry).
- Selection Tests:
- iHS (integrated haplotype score) for ongoing selection.
- PBS (population branch statistic) to identify rapid frequency shifts.
- Admixture Graphs: Model gene flow between archaic and modern populations (e.g., using TreeMix or ADMIXTOOLS).
Functional Annotation:
- Gene Overlap: Prioritize introgressed variants in coding regions, promoters, or enhancers (e.g., HYAL2 for Neanderthal-derived immune adaptation).
- Pathway Enrichment: Test for overrepresentation in disease-related pathways (e.g., TLR1 variants linked to autoimmune disorders).
- Regulatory Impact: Predict effects on transcription factor binding (e.g., using RegulomeDB or DeepSEA).
3. Visualization and Reporting of Archaic Introgression
Visualization Tools:
- Circos Plots: Display introgressed tracts across chromosomes (e.g., Neanderthal segments in non-African genomes).
- Manhattan Plots: Highlight genomic regions under selection (e.g., EPAS1 in Tibetans for high-altitude adaptation).
- Haplotype Networks: Compare archaic and modern haplotypes (e.g., BNC2 for skin pigmentation).
- IGV: Manually inspect read alignments to confirm introgression breakpoints.
Significance Thresholds:
- Tract Length: ≥10 kb (for confident archaic ancestry assignment).
- Statistical Significance:
- p < 1×10⁻⁸ (Bonferroni-corrected for genome-wide scans).
- q < 0.05 (FDR-corrected for multiple testing in pathway analysis).
Reporting Guidelines:
- Include metadata (e.g., ancestry, sequencing depth, contamination estimates).
- Report both raw and phased introgressed tracts with confidence scores.
- Discuss functional implications (e.g., "Denisovan-derived TBX15 variant associated with fat distribution in Oceanians").
Figure 1: Archaic introgression Analysis
What Can We do
Archaic Haplotype Screening: We use reference-quality archaic genomes, like Neanderthal and Denisovan genomes. We use tools such as Sprime and ArchaicSeeker 2.0 to identify introgressed segments in modern human populations. We focus on regions with high divergence from African-ancestry genomes. This helps us prioritize adaptive variants.
Functional Annotation of Introgressed Regions: We integrate epigenomic datasets, like ATAC-seq and ChIP-seq. We also use gene expression profiles, like GTEx. This helps us assess how archaic alleles influence regulatory networks, tissue-specific expression, and phenotypic outcomes. For example, EPAS1 affects Tibetan high-altitude adaptation.
Selective Sweep Detection: We apply long-range haplotype tests, such as iHS and Rsb. We also use machine learning frameworks, like SelScan and DeepSweep. This helps us distinguish archaic alleles under positive selection from neutral background introgression. We can link them to traits like immune responses (TLR1/6/10 cluster) or viral resistance.
Population-Specific Introgression Mapping: We use local ancestry deconvolution tools, like RFMix and LAIT. This helps us dissect archaic contributions across diverse populations, such as Oceanians and East Asians. We can reveal geographic patterns of adaptive retention or purging of archaic variants.
Phenotype-Genotype Association Studies: We conduct GWAS and PheWAS in cohorts with archaic ancestry, like the UK Biobank and Biobank Japan. This helps us test correlations between introgressed alleles and complex traits, such as lipid metabolism and neurological disorders. We control for confounding due to population stratification.
Our Advantages

Beyond Neutral Survival: While many archaic alleles were likely eliminated by purifying selection, our approach prioritizes functionally relevant introgressed segments by integrating genome-wide association studies (GWAS), expression quantitative trait loci (eQTL) mapping, and deep learning-based predictive models (e.g., DeepSEA, Basenji). This enables us to identify archaic variants that influence modern human traits, such as lipid metabolism, immune response, and neural development, even at low frequencies.

Population-Specific Context: Archaic introgression patterns vary dramatically across human populations due to historical migration, admixture, and local adaptation. We leverage high-coverage ancient genomes (e.g., Neanderthal, Denisovan, and early modern humans) alongside global diversity panels (1000 Genomes, Simons Genome Diversity Project) to reconstruct fine-scale introgression maps, distinguishing between shared and population-private archaic haplotypes and their selective pressures.

Structural Variant Sensitivity: Traditional methods often overlook archaic-derived structural variants (SVs), such as inversions, duplications, and retrotransposon insertions, which can have profound functional consequences. Our pipeline integrates long-read sequencing (PacBio, Nanopore) and optical mapping to detect and validate archaic SVs, revealing their roles in genomic architecture, gene regulation, and disease susceptibility in modern populations.

Epigenomic and 3D Genome Integration: Many archaic alleles exert their effects through non-coding regulatory elements (e.g., enhancers, CTCF-binding sites) that shape chromatin architecture. By combining archaic introgression data with epigenomic profiles (ATAC-seq, ChIP-seq, DNA methylation) and Hi-C chromatin interaction maps, we predict how archaic variants rewire gene regulatory networks, offering mechanistic insights into their contributions to human-specific phenotypes.
Applications
1. Human Evolutionary Genetics
Archaic introgression analysis plays a pivotal role in human evolutionary genetics by identifying genomic regions inherited from Neanderthals, Denisovans, or other archaic hominins. These regions provide insights into adaptive traits such as immune responses, lipid metabolism, and high-altitude adaptation, revealing how ancient gene flow contributed to the survival and diversification of modern human populations.
2. Disease Susceptibility Studies
By mapping archaic introgressed alleles in contemporary human genomes, researchers can investigate their associations with disease risk. For example, certain Neanderthal-derived variants have been linked to autoimmune disorders, neurological conditions, and viral resistance, offering clues about the evolutionary trade-offs between adaptation and health vulnerabilities.
3. Population Genomics and Demographic History
Archaic introgression analysis helps reconstruct ancient population interactions and migration patterns. By quantifying the extent and distribution of archaic ancestry across global populations, scientists can infer the timing, geographic scope, and selective pressures shaping modern human genetic diversity, enhancing our understanding of human prehistory.
4. Functional Genomics and Gene Regulation
Studying archaic introgressed regions enables the exploration of gene regulatory evolution. Comparative analyses of archaic and modern human regulatory elements reveal how these variants influence gene expression patterns, potentially explaining phenotypic differences between archaic and modern humans and their role in adaptive evolution.
Demo
Figure 2: Fine-scale genetic structure of the modern Japanese and its three ancestry origins. (Liu, 2024)
Archaic introgression contributed to shape the adaptive modulation of angiogenesis and cardiovascular traits in human high-altitude populations from the Himalayas
Journal: Elife
Published: 2024
The scientific consensus holds that modern humans admixed with extinct hominins, leaving traces in genomes. Some introgressed archaic alleles, linked to metabolism and environmental responses, likely underwent adaptive introgression, facilitating human adaptation to diverse settings, yet the full extent of this influence remains to be explored.
To evaluate Tibetan WGS data's representativeness and ancestry, we merged it with East-Asian genotyping data, performed QC, and assessed shared ancestry via IBD. After LD-pruning and filtering, PCA and ADMIXTURE (K=2-12) were conducted, with results visualized using R software.
To further shortlist the most robust candidate genes involved in AI events, we applied the LASSI algorithm to phased Tibetan WGS data with the aim of searching for genomic signatures ascribable to the action of natural selection.
This enabled us to confirm the strong selective events occurred at the EPAS1 and EGLN1 genes, as previously reported by multiple studies conducted on high-altitude Himalayan populations, as well as to corroborate adaptive evolution of some of the genes pointed out by both VolcanoFinder and Signet analyses. In fact, several chromosomal intervals associated to these loci presented values of the computed T statistic that fall within the top 5% of the related distribution
Figure 3: Representation of genetic distances between modern and archaic haplotypes.
FAQs
Why is Archaic Introgression Analysis important for understanding modern human health and adaptation?
Archaic introgression analysis is crucial because it reveals how ancient interbreeding events between modern humans and archaic hominins (e.g., Neanderthals, Denisovans) contributed to our genetic diversity. Many archaic-derived alleles have been linked to adaptive traits, such as enhanced immune responses to pathogens, metabolic adjustments to new diets, and tolerance to extreme environments. However, some introgressed variants also increase susceptibility to diseases like autoimmune disorders, allergies, or neurological conditions, highlighting the complex interplay between adaptation and health risks in human evolution.
How does Archaic Introgression Analysis differ from standard genome-wide association studies (GWAS)?
While GWAS identifies genetic variants associated with traits or diseases in modern populations, archaic introgression analysis specifically focuses on tracing the origins of these variants to archaic hominins. It examines genomic regions with unusually high divergence from African populations (where archaic ancestry is minimal) or uses statistical methods to detect segments inherited from Neanderthals or Denisovans. This approach provides evolutionary context, revealing whether a variant was beneficial in ancient environments but may now contribute to disease in modern lifestyles.
References
- Liu X, Koyama S, Tomizuka K, Takata S, Ishikawa Y, Ito S, Kosugi S, Suzuki K, Hikino K, Koido M, Koike Y, Horikoshi M, Gakuhari T, Ikegawa S, Matsuda K, Momozawa Y, Ito K, Kamatani Y, Terao C. Decoding triancestral origins, archaic introgression, and natural selection in the Japanese population by whole-genome sequencing. Sci Adv. 2024 Apr 19;10(16):eadi8419. https://doi.org/10.1126/sciadv.adi8419
- Ferraretti G, Abondio P, Alberti M, Dezi A, Sherpa PT, Cocco P, Tiriticco M, Di Marcello M, Gnecchi-Ruscone GA, Natali L, Corcelli A, Marinelli G, Peluzzi D, Sarno S, Sazzini M. Archaic introgression contributed to shape the adaptive modulation of angiogenesis and cardiovascular traits in human high-altitude populations from the Himalayas. Elife. 2024 Nov 8;12:RP89815. https://doi.org/10.7554/eLife.89815
* Designed for biological research and industrial applications, not intended
for individual clinical or medical purposes.