Identification of DMC, DMR, and DMG in DNA Methylation Analysis

DNA methylation, a chemical modification of DNA, can influence genetic expression without altering the DNA sequence. It involves the covalent bonding of a methyl group to the cytosine 5-carbon position of the genomic CpG dinucleotide through the action of DNA methylation transferase. This process has been found to induce changes in chromatin structure, DNA conformation, DNA stability, and DNA-protein interactions, thereby regulating gene expression.

For data mining, DNA methylation analysis typically follows three steps:

  • Genome-wide methylation changes analysis: This includes assessing average methylation level changes, changes in methylation level distribution, performing dimensionality reduction analysis, cluster analysis, correlation analysis, and other related techniques.
  • Methylation difference level analysis: This step involves identifying specific differential genes through DMC/DMR/DMG (differentially methylated CpGs/regions/genes) identification, locating DMC/DMR on genomic elements, conducting TF (transcription factor) binding analysis on DMC/DMR, implementing analysis strategies for time-series methylation data, and performing functional analysis of DMG.
  • Methylation genomics & transcriptomics association analysis: In this phase, the overall association of Meta genes is assessed, as well as the correspondence association between DMG (differentially methylated genes) and DEG (differentially expressed genes), and the creation of network associations.

Then how to do the methylation differential level analysis to screen DMC, DMR, DMG in specific differential genes?

Differential Methylation Site/Region Analysis - DMC/DMR Analysis

(1) DMC/DMR Identification

In this step, the main goal is to identify specific sites (DMCs) and regions (DMRs) in the genome where differential methylation occurs between different experimental groups. This can involve comparing methylation levels between disease and control samples, treated and untreated samples, or different time points in a time-course experiment.

  • DMC (Differential Methylation Site) Identification: Statistical tests are applied to individual CpG sites to determine if there are significant differences in methylation levels between experimental groups. CpG sites showing statistically significant differences are labeled as DMCs.
  • DMR (Differentially Methylated Region) Identification: DMCs that are in close proximity to each other are grouped into DMRs. Various algorithms and statistical methods are used to identify regions of the genome with coordinated changes in methylation.

(2) DMC/DMR Transcription Factor Binding Analysis (TF Binding Motif)

The focus in this step is to investigate whether DMCs/DMRs overlap with known transcription factor (TF) binding sites, particularly in regulatory regions like promoters and enhancers.

TF Binding Motif Analysis: Bioinformatics tools and databases are utilized to predict TF binding motifs at DMCs/DMRs. This information can provide insights into the potential regulatory factors involved in the differential methylation.

(3) Analysis Strategy of Temporal Methylation Data (Time Course)

If the experimental design includes multiple time points or time-course data, the analysis strategy is adapted to compare changes in methylation over time.

  • Comparison Between Neighboring Time Points: Differences in methylation profiles between adjacent time points are assessed to identify dynamic changes.
  • Direct Screening of Time-Course Related DMCs and DMRs: Specific DMCs/DMRs that show consistent changes across time points are identified.
  • Linear Model/Hybrid Linear Model: Statistical models, like linear regression or hybrid linear models, are used to account for confounding factors (e.g., gender) and assess time-related changes in methylation.
  • Co-Methylation Pattern Analysis (Stage-Specific Cluster Screening): Clustering methods, such as WGCNA, MEGENA, and mfuzz, can be applied to identify groups of co-methylated sites or regions that exhibit similar patterns across different stages or time points.

(4) DMC/DMR Distribution on Gene Elements

In this phase, we conduct an in-depth investigation into the genomic localization of Differentially Methylated CpGs (DMCs) and Differentially Methylated Regions (DMRs), focusing on their distribution within distinct gene elements.

  • Transposable Element (TE) Analysis: Transposable elements represent genomic entities capable of mobilizing within the genome, with the potential to exert significant effects on genome stability. Scrutinizing their methylation patterns offers valuable insights into the intricate regulation of the genome.
  • Promoter Analysis: Elucidating alterations in methylation within gene promoters holds pivotal implications for gene expression, as it can influence the binding of crucial transcription factors and RNA polymerase.
  • Gene Body Analysis: Delving into the methylation patterns within gene bodies also bears relevance to gene expression and alternative splicing processes, further influencing the functional outcome of the genes.

The DMC/DMR analysis entails a multifaceted approach encompassing computational methodologies, rigorous statistical analysis, and the integration of diverse genomic information. This comprehensive methodology allows us to uncover distinctive patterns of DNA methylation alterations and elucidate their potential functional consequences. Ultimately, the outcomes of this investigation provide invaluable insights into the intricate epigenetic mechanisms governing gene expression, thereby shedding light on the underlying molecular underpinnings of diverse biological processes and pathological conditions.

Functional Analysis of Differentially Methylated Genes (DMGs)

Functional analysis of differentially methylated genes (DMGs) is a crucial step in understanding the biological processes and pathways influenced by DNA methylation changes. DMGs are genes that have at least one differentially methylated region (DMR) annotated to their promoter or gene body. DMRs in the promoter region have the potential to influence gene transcription, while DMRs in the gene body region often show a positive correlation with gene expression levels. Analyzing these DMGs can provide valuable insights into the regulatory functions of DNA methylation.

(1) Categorization of DMGs

  • Hyper-DMG: Genes that show increased DNA methylation levels in the DMRs compared to the control group. Increased methylation in the promoter region may lead to transcriptional inhibition.
  • Hypo-DMG: Genes that show decreased DNA methylation levels in the DMRs compared to the control group. Decreased methylation in the gene body region may be associated with increased gene expression.

(2) Promoter-DMG and Genebody-DMG

  • Promoter-DMG: Genes with differentially methylated regions in their promoter regions. These genes may experience altered transcriptional regulation due to changes in promoter methylation.
  • Genebody-DMG: Genes with differentially methylated regions in their gene body regions. Methylation changes in the gene body can impact gene expression and alternative splicing.

(3) Functional Enrichment Analysis

Functional Enrichment Analysis is a vital investigative approach that affords valuable insights into the intricate landscape of DNA methylation changes and their ramifications in various biological contexts. This analysis involves the examination of Gene Ontology (GO) to discern enriched biological processes, molecular functions, and cellular components linked to Differentially Methylated Genes (DMGs). Additionally, the investigation extends to KEGG Pathway analysis, which elucidates the significant enrichment of DMGs within various biological pathways, thereby illuminating their functional context.

Moreover, the exploration of Reactome Pathway analysis unveils the specific molecular pathways and signaling cascades wherein DMGs are implicated. By integrating DisGeNET Disease and Disease Ontology analysis, researchers gain valuable insights into the association of DMGs with particular diseases and disease-related terms, providing crucial clues regarding the potential role of DNA methylation in disease pathogenesis.

Undoubtedly, the in-depth functional analysis of DMGs enables researchers to foster a comprehensive comprehension of the intricate interplay between DNA methylation changes and diverse biological processes. Consequently, this endeavor facilitates the formulation of hypotheses concerning the regulatory roles of DNA methylation and its implications in various physiological and pathological contexts. Ultimately, the amalgamation of epigenetic and functional genomics data significantly contributes to a profound understanding of gene regulation and the complexity underpinning cellular processes.

Differential Methylation Level Detection

The differential methylation level detection and analysis workflow described above involve several key steps, including DMR detection, DMR annotation, functional analysis of differentially methylated genes (DMGs), and visualization of DMRs. This comprehensive approach allows researchers to identify regions with significant DNA methylation changes, understand their potential functional implications, and visualize the results for better interpretation.

(1) Differential Methylation Region (DMR) Detection

DMRs are detected using the metilene software, which employs a binary segmentation algorithm combined with double statistical tests (MWU-test and 2D KS-test).

The following criteria are used to define DMRs:

  • Sequencing depth of each CpG site >= 5x.
  • Differential methylation of CpG sites >= 0.2 (indicating a substantial change in methylation levels).
  • Number of differentially methylated CpG sites in the region >= 5 (ensuring sufficient coverage).
  • Distance between adjacent differentially methylated CpG sites <= 300bp.
  • MWU-test p-value < 0.05 (indicating statistical significance).

CpG sites that meet these criteria are used to define the differentially methylated regions.

(2) Differential Methylation Region (DMR) Annotation

The identified DMRs are annotated to genebody and promoter regions, respectively. This step helps link the DMRs to specific genes and understand their potential regulatory implications on gene expression.

(3) Functional Analysis of Differentially Methylated Genes (DMGs)

GO (Gene Ontology) and KEGG (Kyoto Encyclopedia of Genes and Genomes) enrichment analysis is performed to study the functions of the DMGs. The enrichment of DMGs in GO terms and KEGG pathways is analyzed using a Hypergeometric distribution test. This analysis provides insights into the biological processes and pathways influenced by the differential DNA methylation.

(4) Visualization of Differentially Methylated Regions (DMRs)

Due to the large number of DMRs, the top 20 DMRs with Q-value (adjusted p-value) are selected for visualization.

By following this workflow, researchers can identify and characterize differentially methylated regions, gain insights into the potential functional consequences of DNA methylation changes, and effectively visualize the results to aid in data interpretation and hypothesis generation.

For Research Use Only. Not for use in diagnostic procedures.
Related Services
Quote Request
! For research purposes only, not intended for personal diagnosis, clinical testing, or health assessment.
Contact CD Genomics
Terms & Conditions | Privacy Policy | Feedback   Copyright © CD Genomics. All rights reserved.
Top