The Ultimate Guide to eccDNA Sequencing: Definitions vs ecDNA, Cancer Workflows, and Cross‑Species Atlas Resources
Extrachromosomal circular DNA (eccDNA) has moved from a curiosity to a practical research target across oncology, developmental biology, and plant genomics. This guide brings clarity on three axes: (1) precise definitions and distinctions between eccDNA and ecDNA; (2) cancer‑focused detection and sequencing workflows spanning wet lab and bioinformatics; and (3) cross‑species atlas/database navigation. The goal is to help you assemble rigorous experiments and analyses without conflating small eccDNA with large oncogenic ecDNA.
Key takeaways
- Use "eccDNA" to describe small circular DNA fragments and "ecDNA" for large (≈0.1–10 Mb) oncogenic circles; they differ in size, gene content, and behavior.
- For eccDNA sequencing, short reads excel at breakpoint detection in enrichment workflows (e.g., Circle‑Seq), whereas long reads are essential to reconstruct large ecDNA structures.
- A robust workflow removes linear DNA (exonucleases), amplifies circles (phi29 RCA), validates junctions (PCR/Sanger), and confirms structure/location (FISH, long‑read or optical mapping).
- Recent databases (e.g., eccDNABase, scEccDNAdb) offer multi‑species entries and single‑cell contexts; leverage them to cross‑check findings and plan experiments.

eccDNA vs ecDNA: clear terminology and core differences
Confusion between eccDNA and ecDNA leads to misinterpretation. eccDNA generally denotes smaller circular DNA molecules arising from chromosomal fragments and appearing in normal and cancer tissues, while ecDNA refers to large circular amplicons (≈0.1–10 Mb) enriched in cancers, often carrying intact oncogenes and regulatory elements. Authoritative reviews outline these boundaries, including the 2024 eccDNA overview by Wang and colleagues and the 2024 Frontiers in Genetics synthesis by Zhou and co‑authors. For ecDNA prevalence and implications, Weiser and co‑authors’ 2025 Cancer Discovery guide aggregates pan‑cancer evidence.
| Feature | eccDNA | ecDNA |
|---|---|---|
| Typical size | Sub‑kb dominant; often <500–800 bp; can reach several kb in cancers | ≈0.1–10 Mb; often 1–3 Mb circles |
| Gene content | Partial fragments; rarely entire genes | Frequently carries intact oncogenes and regulatory elements |
| Amplification | Typically not clonally amplified | High‑copy amplification; rapid evolution |
| Prevalence | Present in normal and cancer tissues | Enriched across cancers (~17% pan‑cancer) |
| Roles | Potential regulatory/stress responses | Drives heterogeneity and therapy resistance |
Supporting readings with methods and definitions include Wang’s 2024 open‑access review in PMC and Zhou’s 2024 Frontiers in Genetics article on identification methods, while ecDNA’s cancer role is synthesized in Weiser’s 2025 Cancer Discovery review.
How eccDNA forms and why it matters
Mechanisms span DNA damage and repair pathways:
- Microhomology‑mediated end joining (MMEJ) and alternative end‑joining can circularize fragments at microhomology tracts.
- Non‑homologous end joining (NHEJ) and homologous recombination (HR) yield circles in double‑strand break contexts.
- Catastrophic events like chromothripsis and breakage–fusion–bridge cycles seed large amplicons that can circularize into ecDNA.
These models affect detection. For example, microhomology‑rich junctions favor split‑read discovery in short‑read datasets, while complex, repetitive ecDNA often demands long‑read resolution and graph‑based reconstruction. Recent method syntheses and historical context are summarized in 2024–2025 reviews, including the 2025 Theranostics overview.
Detection toolbox: wet‑lab steps, QC, and common pitfalls
A dependable eccDNA detection workflow removes linear DNA, increases circular DNA abundance, and validates structures before sequencing.
-
Exonuclease digestion of linear DNA
- Plasmid‑Safe DNase (PSD) selectively degrades linear DNA prior to rolling‑circle amplification (RCA). Be mindful of nicked circles that may be vulnerable to some nucleases; a 2024 overview outlines enzyme choices and caveats.
- RecBCD exonuclease V workflows use serial digestions (often up to 48 hours) with heat inactivation; qPCR of nuclear (e.g., HBB) and mitochondrial markers (e.g., MT‑CO1/ND5) can track depletion.
-
Mitochondrial DNA depletion
- If mtDNA carryover is high, strategies to linearize and digest mtDNA (endonucleases or CRISPR‑guided cuts) can reduce noise before RCA, as summarized in 2024–2025 method reviews.
-
Rolling‑circle amplification (phi29)
- Isothermal RCA using phi29 boosts circular DNA. Note the bias toward small circles; plan orthogonal validation for larger candidates.
-
Circle‑Seq enrichment and short‑read sequencing
- Typical steps include alkaline treatment and column separation, PSD digestion, phi29 RCA, library prep, and paired‑end sequencing. Junction‑centric bioinformatics (split reads and discordant pairs) follow.
-
CsCl–EtBr density gradient centrifugation
- In samples with higher circle abundance, buoyant‑density separation can further enrich circles, often paired with exonucleases.
QC and troubleshooting highlights:
- Verify linear DNA removal by qPCR before RCA to reduce artifacts.
- Validate putative circles via outward/inverse PCR and Sanger sequencing of junctions.
- If mtDNA dominates reads, add targeted depletion or adjust extraction to reduce organellar DNA.
For general nucleic acid quality practices applicable to DNA workflows, see this practical guide to extraction and QC from our knowledge base: The Most Comprehensive Guide to RNA Extraction. Its principles on yield, purity, and integrity translate well to DNA contexts.
Sequencing strategies for eccDNA sequencing: short vs long reads and when to use each
Choosing a platform depends on circle size, required structural resolution, and budget.
-
Illumina short reads (150–300 bp PE)
- Strengths: high base accuracy; sensitive detection of small eccDNA; precise breakpoint mapping via split reads and discordant pairs. Method syntheses, including the 2025 Theranostics review, discuss these advantages.
- Limitations: poor resolution for large/repetitive circles; full reconstruction of complex ecDNA is often infeasible without long‑read support.
-
PacBio HiFi long reads (10–25 kb+) and Oxford Nanopore long reads (tens to hundreds of kb; ultra‑long feasible)
- Strengths: capture long stretches across repeats, enabling reconstruction of large or complex circles and ecDNA; ONT also offers native methylation signals. Applications in liquid biopsy settings are summarized by Si and colleagues (2024).
- Limitations: higher per‑read error rates (especially ONT; mitigated with consensus/polishing), cost/input requirements, and the need for orthogonal validation.
-
Hybrid strategies
- Combine short reads for junction precision with long reads for scaffolding and full‑circle reconstruction. This is often the pragmatic path for suspected ecDNA.
Context pages that frame short‑ vs long‑read decisions in transcriptomics also provide useful analogies for DNA: platform overviews and short‑ vs long‑read primers can help structure thinking around accuracy, read length, and throughput trade‑offs: Transcriptome Sequencing and Data Analysis Service and Short‑read vs long‑read cDNA and direct RNA sequencing.
Bioinformatics pipelines: from reads to circular DNA calls
A practical toolkit has emerged across 2024–2026:
- eccDNA‑pipe integrates raw FastQ (WGS, Circle‑Seq, or Circulome‑seq) with detection modules (Circle‑Map or AmpliconArchitect), then performs annotation and visualization; see Fang and colleagues’ 2024 description in Briefings in Bioinformatics.
- Circle‑Map detects eccDNA junctions in enrichment datasets, providing coordinates and size distributions; this approach is covered in 2024 methodological reviews.
- AmpliconArchitect (AA) reconstructs focal amplicon graphs in WGS datasets; AmpliconClassifier (AC) categorizes structures (ecDNA/cyclic, BFB, HSR, complex, linear). Cross‑validation with cytogenetics and long reads is recommended; Weiser’s 2025 review summarizes AA/AC usage and performance.
- Complementary resources like CytoCellDB place calls in karyotype and cell‑line contexts to support classification and hypothesis generation.
Outputs to expect:
- Coordinates and sizes for small eccDNA;
- Structural classes and copy number for ecDNA;
- Gene annotations and visualization artefacts (breakpoint graphs, manhattan plots, 3D models);
- QC flags indicating confidence and validation status.
Databases and atlas navigation across species
Several updated resources help you explore circular DNA across tissues and organisms:
- eccDNABase (2025, Molecular Biology and Evolution) offers multi‑species entries with annotation and download options; use species filters and export BED/CSV where provided. See the 2025 database description on the MBE site.
- scEccDNAdb (2024, Oxford Academic Database) integrates single‑cell eccDNA contexts for human and mouse; consult the 2024 Database article for the portal link and query examples.
- Mouse tissue atlas datasets released in 2025 report eccDNA distributions across tissues and age groups, useful for baseline expectations.
- CytoCellDB (2024, NAR Cancer) focuses on cancer cell‑line karyotypes and supports ecDNA/HSR predictions that complement AmpliconSuite outputs.
Practical tips:
- Start with gene‑centric queries (oncogenes, stress‑response genes) and then pivot to species and tissue filters.
- Export entries and track metadata (sample type, platform, depth) to avoid over‑generalizing across heterogeneous datasets.
Cancer‑focused validation: combining orthogonal evidence
No single method is sufficient. A robust confirmation stack typically includes:
- PCR of junctions (outward/inverse) followed by Sanger sequencing to confirm circular topology and exact breakpoints.
- FISH to visualize localization and amplification patterns.
- Long‑read (ONT/PacBio) or optical genome mapping to resolve repetitive architectures and verify ecDNA models.
Pairing these methods increases confidence and improves interpretability for downstream functional studies.
Plant genomics: evidence and method adaptations
Plant systems offer clear, validated examples and unique technical considerations.
- Rice (Oryza sativa) profiling with Circle‑Seq identified tens of thousands of small eccDNAs across tissues, with environmental conditions modulating abundance; most circles fell in the 200–400 bp range, as reported in a 2024 open‑access study.
- In Amaranthus palmeri, herbicide resistance is associated with large replicons carrying EPSPS and, in some cases, GS2; rearranged circular DNA supports dual resistance, according to a 2025 Plant Cell article.
Method notes for plants:
- Adapt extraction to manage cell walls and secondary metabolites; include cleanup steps to reduce inhibitors.
- Plan organellar DNA depletion to limit plastid/mitochondrial carryover.
- Use long reads and junction PCR to validate structures; FISH on nuclei is feasible in some tissues.
Putting it all together: a practical end‑to‑end flow
Start with careful extraction and QC to ensure high molecular weight DNA (especially if long reads are planned). Remove linear DNA via exonucleases and, where needed, deplete mtDNA. Amplify circles with phi29 RCA while tracking potential size bias. Choose sequencing based on goals: Illumina short reads for small eccDNA discovery and breakpoint mapping; PacBio HiFi or Nanopore for suspected large ecDNA and complex repeats; hybrid approaches when both precision and structure are needed. Analyze enrichment data with Circle‑Map or eccDNA‑pipe; interrogate WGS for ecDNA using AmpliconArchitect + AmpliconClassifier and support classifications with cytogenetic context from CytoCellDB. Validate candidates with junction PCR/Sanger, FISH imaging, and long‑read or optical mapping. Finally, cross‑reference findings in databases such as eccDNABase and scEccDNAdb to situate your results in a broader cross‑species landscape.
Next steps and resources
Disclosure: CD Genomics is our product. When planning platform choices and study design, platform overview pages and short‑ vs long‑read primers can help structure decisions without prescribing a single approach. For readers interested in parallels across circular nucleic acids, a circRNA‑seq overview offers useful context. Internal resources:
- Platform overview: eccDNA Sequencing Service
FAQ — Common questions about eccDNA sequencing
Q1 — What is the difference between eccDNA and ecDNA?
A1 — eccDNA refers to relatively small circular DNA molecules derived from chromosomal fragments and commonly found in normal and diseased tissues; ecDNA denotes much larger circular amplicons (≈0.1–10 Mb) that frequently carry intact oncogenes and drive tumor heterogeneity and resistance, as synthesized in recent reviews such as Wang 2024 and Weiser et al. 2025.
Q2 — What are the essential wet‑lab steps for eccDNA sequencing workflows?
A2 — Core steps are: careful extraction (preserve high‑molecular‑weight DNA for long reads), selective removal of linear DNA (e.g., Plasmid‑Safe DNase or RecBCD), optional mitochondrial depletion, rolling‑circle amplification (phi29) for enriched protocols, and library preparation matched to the chosen platform. See the Detection toolbox section for checkpoints and the phi29/RCA caveats summarized in Wang 2024.
Q3 — Short reads, long reads, or hybrid — which should I pick?
A3 — Use short reads (Illumina) for sensitive detection of small eccDNA and precise junction mapping; choose long reads (PacBio HiFi or ONT) to resolve large, repetitive ecDNA architectures; combine both (hybrid) when you need junction precision plus full‑structure reconstruction (see Sequencing strategies section and platform framing in the guide).
Q4 — Which bioinformatics tools are recommended for enrichment versus WGS data?
A4 — For enrichment/Circle‑Seq data, junction‑centric tools like Circle‑Map or integrated pipelines such as eccDNA‑pipe are commonly used. For WGS ecDNA discovery and reconstruction, use AmpliconArchitect + AmpliconClassifier and validate with long‑read assemblies or optical mapping; see Fang et al. 2024 for eccDNA‑pipe and Weiser 2025 for AA/AC usage.
Q5 — How should I validate candidate circles experimentally?
A5 — A minimal validation stack: outward/inverse PCR of junctions with Sanger sequencing (topology confirmation), FISH to localize amplification in cells, and long‑read sequencing or optical mapping to confirm larger architectures. Combining methods increases confidence; see the Cancer‑focused validation section.
Q6 — Where can I find cross‑species eccDNA atlases and how do I query them?
A6 — Start with curated resources such as eccDNABase (2025) for multi‑species entries and scEccDNAdb (2024) for single‑cell contexts. Query by gene, species, or genomic coordinates; export BED/CSV where available and retain sample metadata (platform, tissue, depth) to avoid overgeneralizing across heterogeneous studies.
Q7 — What special considerations apply to plant samples?
A7 — Plants present extraction challenges (cell walls, secondary metabolites) and organellar DNA carryover. Adapt extraction and cleanup steps, include plastid/mitochondrial depletion, and validate with junction PCR plus long reads or FISH when possible; validated plant examples include rice profiling and Amaranthus resistance cases cited in the guide (Zhuang 2024; Carvalho‑Moore 2025).
Q8 — Do common enrichment methods bias size distributions of detected circles?
A8 — Yes. RCA (phi29) and Circle‑Seq protocols preferentially amplify smaller circles, which can underrepresent large ecDNA; CsCl gradients and long‑read approaches reduce that bias for larger structures. Interpret size distributions with method bias in mind (see Detection toolbox and Sequencing strategies).
Q9 — Can eccDNA be reliably detected in clinical sample types (FFPE, plasma)?
A9 — It is possible but more challenging: FFPE DNA fragmentation and low abundance in plasma reduce sensitivity and increase artifacts. Optimize extraction, include strong negative controls, consider target enrichment or deeper sequencing, and validate candidates orthogonally (PCR/FISH/long reads) before interpretation.
Q10 — How do I start a reproducible eccDNA sequencing project?
A10 — Key initial checklist: define biological question and target size range; select enrichment versus WGS; plan platform (short, long, or hybrid) and estimated depth; include linear‑DNA depletion controls and spike‑ins; predefine validation methods (PCR, FISH, long reads); and register metadata and analysis parameters for reproducibility. Use the Practical Workflow Assembly section and the internal QC resources (e.g., the RNA extraction guide) for transferable sample and QC best practices: RNA extraction and QC guide.
Reference:
- eccDNA definitions and methods (2024): Wang’s open‑access review on eccDNA
- Identification methods and terminology (2024): Frontiers in Genetics review by Zhou and colleagues
- Cancer ecDNA prevalence and implications (2025): Cancer Discovery synthesis by Weiser and co‑authors
- Method evolution and platform context (2025): Theranostics overview of eccDNA workflows
- Multi‑species database (2025): eccDNABase on Molecular Biology and Evolution
- Single‑cell database (2024): scEccDNAdb in Oxford Academic Database
- Cytogenetic context for amplicons (2024): CytoCellDB in NAR Cancer
- Rice eccDNA profiling (2024): Open‑access study reporting tissue‑wide eccDNA
- Amaranthus palmeri resistance circles (2025): Plant Cell article on large replicons