At a glance:
A de novo genome assembly can look great in a report and still disappoint in downstream work. In animal and plant projects, heterozygosity, polyploidy, and repeats create failure modes that a single headline metric won't reveal.
The practical answer upfront is simple: no single metric can judge de novo assembly quality on its own. N50, BUSCO, and QV measure different things and should not be treated as interchangeable. This article focuses on how to interpret those three metrics in animal and plant de novo sequencing projects and decide whether the assembly is genuinely useful rather than only impressive on paper.
All discussion here is for research use only (RUO) evaluation of de novo genome assembly deliverables.
One-number judgments are attractive because they make outsourcing decisions feel crisp: compare a value to a target and approve. The risk is that "assembly quality" is not one dimension. It's a bundle of properties that matter differently depending on your organism and your downstream objective.
A practical way to frame assembly metrics that matter in de novo sequencing (and other de novo genome assembly metrics) is to separate three questions:
N50, BUSCO, and QV each primarily map to one of these. None of them, alone, certifies structural correctness, correct haplotype handling, or repeat representation.
Jauhal & Newcomb show that high BUSCO can occur even when N50 is low, and they urge reporting additional assessment metrics beyond N50 (Molecular Ecology Resources, 2021).
N50 is a contiguity statistic. It summarizes how your assembled bases are distributed across contigs or scaffolds. Put plainly: is half of the assembly carried by sequences at least this long?
In other words, N50 helps quantify continuity, but it is not a standalone genome assembly QC verdict.
BUSCO (Benchmarking Universal Single-Copy Orthologs) is widely used in genome assembly evaluation because it moves the conversation from "how long are the contigs?" to "is conserved gene space present and intact?"
BUSCO searches for conserved orthologs expected in a selected lineage dataset and reports them as:
Used carefully, this is a fast readout of gene-space completeness and fragmentation, which often predicts annotation pain.
Primary references include the original BUSCO paper (Simao et al., Bioinformatics, 2015) and practical guidance on interpretation and run modes (Manni et al., Current Protocols, 2021).
BUSCO is not a universal assembly quality score:
Rhie et al. also highlight this limitation: BUSCO examines conserved single-copy genes and does not evaluate the most difficult-to-assemble regions, and it can be inaccurate when true copy number or sequence variants were not considered when building the BUSCO set (Rhie et al., Genome Biology, 2020).
QV (quality value) for an assembly is a Phred-scaled estimate of the consensus base error rate. Conceptually, it answers: how often are the bases wrong, on average?
Because QV is log-scaled, each +10 is roughly a 10× reduction in error rate.
In modern assembly quality assessment, QV is often estimated without a reference using k-mers. Merqury is a widely cited example: it compares k-mers in a de novo assembly to those found in unassembled high-accuracy reads to estimate base-level accuracy and completeness (Rhie et al., Genome Biology, 2020).
Rhie et al. are explicit that Merqury does not directly validate structural accuracy, and some misassemblies (such as inversions) could go unnoticed (Rhie et al., Genome Biology, 2020).
This is the core logic: N50, BUSCO, and QV are most useful when treated as complementary, not competing.
When you see N50 BUSCO QV in a deliverables table, read them as three different questions. The point isn't BUSCO vs N50; it's whether each metric supports your downstream goal, and whether the three agree.
N50, BUSCO, and QV describe different aspects of assembly quality and should be interpreted together.
A balanced profile is not a universal threshold. It's a combination that is coherent:
1) High N50 + high BUSCO + low QV
2) High N50 + low/fragmented BUSCO
3) High BUSCO + low N50
4) High QV + high duplicated BUSCO (animal/plant caution)
A useful next step is to align this metric profile with the expected project scope and deliverables. For animal and plant de novo projects, that framing is often clearer when you start from the intended downstream use and required outputs, then back into the acceptance criteria. For reference, see Animal/Plant Whole Genome De Novo Sequencing.
Metrics only matter insofar as they predict whether your next step will work.
If the goal is a reference that supports gene prediction and functional interpretation, then:
This is where animal and plant genome assembly quality is easiest to judge: annotation outputs expose base errors and fragmentation quickly.
If the goal is structural variation, haplotype-aware biology, or cross-line comparisons:
If the downstream plan includes pan-genome comparisons or haplotype-resolved biology, evaluate whether the assembly representation and validation evidence actually support those aims, not just whether the headline metrics are high.
[Human-added insight: how project goals change which metric combinations matter most.]
These are the misinterpretations that most often turn a "good-looking" report into a downstream problem.
High N50 can reflect true continuity or incorrect joins. If scaffolding validation is weak, N50 can rise while structural correctness falls.
The breakdown matters. Fragmented and duplicated categories often contain the decision-relevant signal, especially in plants where duplication biology and assembly artifacts can look similar.
QV is base accuracy, not a guarantee of correct structure, correct haplotypes, or correct repeat representation.
An assembly can be "high metric" and still be unusable if deliverables do not match the goal.
[Human-added troubleshooting note: the report sections clients most often overlook.]
If you need platform-specific context on how analysis choices affect QC outputs, see PacBio Sequencing Data Analysis and Oxford Nanopore Sequencing Data Analysis.
Use this as the final acceptance filter: does the assembly look ready for your downstream work?
A practical checklist helps researchers evaluate whether a de novo assembly is ready for downstream use.
RUO-safe CTA: Define acceptance criteria early and review N50, BUSCO, and QV together.
If you only have five minutes, ask for the BUSCO category breakdown, the QV estimation method, and the validation evidence behind scaffolding.
No. N50 measures contiguity only. In animal and plant assemblies, high N50 can coexist with missing gene space, haplotype artifacts, or misjoins. Use N50 with BUSCO (gene-space completeness) and QV (consensus accuracy) to judge whether the assembly is likely to be useful for the downstream research you plan.
BUSCO tests whether conserved genes expected in your lineage dataset are present and intact, and reports single-copy, duplicated, fragmented, and missing categories. That makes it a biological complement to N50. N50 can be high even when gene space is incomplete; BUSCO can reveal that early.
Not necessarily. High QV indicates strong consensus base accuracy, which helps annotation and reduces false frameshifts. But QV does not certify structural correctness, haplotype correctness, or repeat representation. A contig can be base-accurate and still be misjoined or biologically misleading at long range.
They cover different failure modes: contiguity (N50), conserved gene-space completeness (BUSCO), and base-level confidence (QV). Reading them together helps you spot contradictions, like long contigs with missing genes, gene-complete assemblies with too many consensus errors, or polished consensus without evidence of structural correctness.
Yes, if the weakness does not block your downstream goal. Gene discovery may tolerate lower contiguity if BUSCO and QV support reliable gene models. SV/haplotype work may require validated structure even when BUSCO is high. Decide based on your intended analyses, not generic thresholds.
Goals determine which failure modes are unacceptable. Annotation-focused projects are sensitive to BUSCO fragmentation/missingness and QV. Haplotype-resolved or SV-heavy projects are sensitive to structural correctness and haplotype handling. Pan-genome work adds a need for consistent QC and deliverables across samples.
Review the evidence behind the numbers: BUSCO lineage dataset and breakdown, how QV was estimated (and with what reads), and what structural validation was performed. Also confirm deliverables match your plan (assembly representation, annotation outputs, QC artifacts, reproducibility notes).
No. This article is for research use only (RUO) evaluation of animal and plant de novo genome assemblies. It does not make clinical or diagnostic claims and is not intended for patient care.
Dr. Yang H.
Senior Scientist at CD Genomics
LinkedIn: Dr. Yang H. on LinkedIn
For research purposes only, not intended for personal diagnosis, clinical testing, or health assessment