TL;DR – Quick Platform Picker for Genomic Selection
This guide offers a practical genotyping technology comparison and explains how to choose between LC-WGS, WGS, GBS, and SNP arrays – including LC-WGS vs GBS and LC-WGS vs SNP arrays – for real breeding and genomic selection projects.
Figure 1. Overview of how WGS, LC-WGS, GBS, and SNP arrays support genomic selection and genetic gain.
Choosing the right genotyping platform is one of the most impactful decisions in a modern breeding program. The platform you pick determines how many candidates you can genotype, how dense your markers are, and how reliable your genomic predictions will be.
Genomic selection is a breeding approach that uses genome-wide markers and statistical models to predict the performance of candidates and select parents earlier and more efficiently. Marker density, error rate, imputation quality, and cost per sample all shape prediction accuracy and, ultimately, genetic gain per year and per dollar.
If marker density is too low, you may miss key QTLs and underestimate relationships among lines. If per-sample cost is too high, you may be forced to reduce population size and lose selection intensity. The optimal genotyping strategy balances marker quality with population size for each phase of your breeding pipeline.
Figure 2. Conceptual workflow of genomic selection: a training population with both phenotypes and genome-wide markers is used to train prediction models, which then estimate genomic breeding values for new candidates based only on their genotypes. (Budhlakoti M. et al. (2022) Frontiers in Genetics)
In most plant and animal programs, genomic selection links three components:
Once the model is trained, you can apply a cost-effective genotyping platform—often LC-WGS, GBS, or SNP arrays—to thousands of candidates, compute genomic estimated breeding values (GEBVs), and select parents without waiting for full multi-year field data.
There is no single "best" genotyping platform for genomic selection. The right choice depends on marker density requirements, population size, reference genome quality, trait architecture, and budget. In practice, many successful programs combine WGS for discovery and reference panels with LC-WGS, GBS, or SNP arrays for routine genotyping.
This section explains what each platform does in simple terms so that PIs, breeding project leads, and technical decision-makers can compare them clearly.
LC-WGS is whole-genome sequencing at low depth (for example 0.5–4×) combined with genotype imputation to obtain dense SNP genotypes across the genome.
Instead of reading every base many times, LC-WGS "skims" the genome. The raw data are too thin to call genotypes at all sites confidently, so imputation methods fill in the gaps using a reference panel—often constructed from WGS of founders and key lines. When the reference panel matches the breeding germplasm, imputed LC-WGS can provide millions of SNPs per sample at moderate cost.
Typical use cases for an LC-WGS sequencing service for genomic selection include:
WGS is deep sequencing of the entire genome to capture most genetic variants present in an individual.
Because it aims to detect nearly all SNPs and structural variants, WGS has the highest information content and usually the highest cost per sample among the four platforms. In breeding and genomic selection, WGS is typically applied to a smaller set of lines rather than all candidates.
Common applications for a whole-genome resequencing service:
In short, WGS lays the foundation that makes LC-WGS and GBS more powerful.
GBS is a reduced-representation sequencing approach that uses restriction enzymes and barcoded adapters to sample SNPs across the genome at low cost.
Figure 3. Example of a classic genotyping-by-sequencing (GBS) library design, showing how restriction enzymes, barcoded adapters, and PCR primers work together to generate multiplexed, reduced-representation libraries for SNP discovery and genotyping. (Elshire R.J. et al. (2011) PLOS ONE)
Instead of sequencing entire genomes, GBS digests DNA, selects fragments around enzyme cut sites, and sequences those fragments for many samples in parallel. The result is a sparse but genome-wide marker set, often with substantial missing data that can be partially imputed.
Typical uses for a GBS genotyping service for plant breeding populations include:
SNP arrays are fixed microarrays that genotype a predefined set of SNPs at known genomic positions.
A chosen array design (for example, 40K, 100K, or 600K SNPs) is applied consistently across many samples. Each sample is hybridized to the array, and the platform reports genotypes at those specific markers. Arrays are known for:
SNP arrays are widely used for:
Many programs access this technology through an SNP array genotyping service for routine GS and QC or a crop-specific SNP panel service.
This section compares LC-WGS, WGS, GBS, and SNP arrays by cost per sample, marker density, and data quality in the context of breeding and genomic selection.
In practice, there is always a compromise between how many markers you want and how many samples you can afford. WGS provides the most information but limits sample size. GBS and arrays are cheaper per sample but provide fewer markers or more missing data. LC-WGS aims for a middle ground: dense markers at moderate cost, powered by good imputation.
The table below summarizes typical characteristics. Exact numbers vary by crop, genome size, and provider, but the relative positions are consistent with published benchmarking and field experience.
| Platform | Typical Marker Density | Relative Per-Sample Cost | Coverage / Missing Data | Imputation Dependence | Reference Genome Requirement | Best-Fit Use Cases |
|---|---|---|---|---|---|---|
| Whole-Genome Resequencing | Millions of SNPs, genome-wide | Highest | High depth, low missingness | Optional (for phasing, rare) | Strong reference strongly preferred | Discovery, reference panels, fine mapping |
| LC-WGS | Millions after imputation | Medium | Low depth, high raw missingness | High (central to workflow) | Good reference + reference panel | Large GS cohorts, multi-trait GS, GWAS + GS |
| GBS | Tens–hundreds of thousands | Low | Variable depth, many missing calls | Moderate to high (optional) | Helpful but not mandatory | Early diversity, low-budget GS, GWAS in new species |
| SNP array | Tens–hundreds of thousands | Low to medium | High call rate, low missingness | Low to moderate (dense arrays) | Reference useful for annotation | Routine GS, QC, long-running multi-environment trials |
LC-WGS and GBS rely more heavily on imputation than WGS or arrays. Imputation performs best when:
Practical suggestions from real projects:
This section explains how to choose between LC-WGS and GBS for large plant breeding populations and genomic selection pipelines.
LC-WGS is usually preferable when:
In these situations, LC-WGS often provides a better balance of marker density, cost, and flexibility than GBS.
GBS remains a strong option in several scenarios:
Here, a GBS genotyping service for plant breeding provides an accessible entry point into genomic selection and diversity analysis.
Teams that have run both technologies often report that:
The key is to match platform complexity to your current genomic infrastructure and bioinformatics capacity.
Recommended Services for This Step:
Learn More:
This section shows when it makes sense to move from SNP arrays to LC-WGS in genomic selection and breeding programs.
SNP arrays remain a very competitive choice when:
In these contexts, an SNP array genotyping service for routine GS and QC often gives the lowest risk and easiest implementation.
LC-WGS brings additional capabilities that arrays cannot easily match:
For programs that already run arrays successfully, LC-WGS sequencing service for genomic selection can be introduced first in experimental cohorts or specific trait projects to quantify the added value.
A gradual transition reduces technical and operational risk:
Working with a provider that offers WGS, LC-WGS, GBS, and SNP array services in a coordinated way makes this transition much smoother.
Recommended Services for This Step:
Learn More:
This section shows how to match each genotyping platform to concrete breeding scenarios instead of treating them as isolated technologies.
In the discovery and pre-breeding phase, the goal is to understand diversity, build solid genomic resources, and capture useful haplotypes:
Investments at this stage make all later LC-WGS, GBS, and SNP array work more informative.
For routine genomic selection, the question is mainly how to maximize genetic gain per unit cost and time:
A good plant breeding genotyping strategy treats platform choice, model design, and field trial structure as an integrated system rather than separate decisions.
Requirements shift slightly for GWAS and diversity analysis:
Pairing the right platform with a GWAS and genomic analysis service ensures that marker density, population design, and replication are adequate for your trait architecture.
You can reuse this simple checklist whenever you start a new project:
This process keeps technology choices aligned with breeding goals rather than vendor catalogs.
This section describes what a typical end-to-end genotyping project looks like when you move from concept to execution.
A typical project with CD Genomics as an integrated sequencing and bioinformatics provider includes:
Throughout the process, unexpected issues—such as sample mix-ups or unusual relatedness patterns—can be detected and discussed before they impact field seasons.
Figure 4. CD Genomics as an integrated provider of WGS, LC-WGS, GBS, and SNP array services for breeding and genomic selection projects.
Some recurring best practices:
These practical steps often protect more value than a small difference in cost per sample.
If you are still weighing LC-WGS vs GBS vs SNP arrays for your next season, a project-specific review is usually the fastest way to a clear answer.
By sharing your crop, target traits, population size, existing data (arrays, GBS, WGS), and budget range with CD Genomics, you can receive:
This turns technology confusion into a concrete, actionable genotyping plan.
Key Takeaways: LC-WGS, WGS, GBS, and SNP Arrays
Q1. Do I need a reference genome to use LC-WGS or GBS for genomic selection?
A high-quality reference genome is strongly recommended for LC-WGS because imputation accuracy depends on how well reference haplotypes represent your breeding germplasm. GBS can be used without a reference, but having one improves alignment, variant calling, and GWAS interpretation.
Q2. Is LC-WGS accurate enough for genomic selection compared with SNP arrays?
When designed properly—with adequate coverage, a representative reference panel, and a well-tuned imputation pipeline—LC-WGS can provide genomic prediction accuracy comparable to or higher than dense SNP arrays in many crops. Accuracy depends on the combination of marker density, training population size, and trait architecture rather than the platform alone.
Q3. Which is cheaper for large breeding populations, LC-WGS or GBS?
On a per-sample basis, GBS is often slightly cheaper than LC-WGS, especially at high multiplex levels. However, LC-WGS can deliver denser markers and may reduce other costs (such as repeated panel redesign) over time. The most economical option depends on your required marker density, available reference resources, and the number of lines genotyped each season.
Q4. How many SNP markers do I need for genomic selection in crops?
There is no universal threshold. Many crop programs obtain useful prediction accuracy with tens of thousands of well-distributed markers, while some complex traits and diverse germplasm panels benefit from hundreds of thousands of SNPs. Instead of chasing an absolute number, aim for sufficient coverage of the genome, a well-designed training population, and consistent genotyping quality.
Q5. Can I mix platforms across cycles or panels in the same breeding program?
Yes. Many programs successfully combine WGS for reference panels, LC-WGS or GBS for routine cohorts, and SNP arrays for specific QC or legacy datasets. The important step is to harmonize data across platforms—usually by imputing onto a common marker set—so that genomic prediction models see consistent genotypes over time.
References
Send a MessageFor any general inquiries, please fill out the form below.
CD Genomics is propelling the future of agriculture by employing cutting-edge sequencing and genotyping technologies to predict and enhance multiple complex polygenic traits within breeding populations.