banner
Genome Sequencing of Cotton: Technologies, Applications, and Future Directions

Genome Sequencing of Cotton: Technologies, Applications, and Future Directions

Inquiry

Cotton stands as a cornerstone of the global fibre crop industry, serving as an indispensable raw material for the textile sector. Its economic significance is undeniable, generating substantial revenue for numerous countries and underpinning a vast textile value chain that spans cultivation, processing, and distribution. This has far-reaching implications for global economic development.

The advent of genome sequencing technology has opened new frontiers in understanding cotton's genetic makeup. By sequencing the cotton genome, we can unveil its complete sequence information, elucidating gene structures, functions, and their interactions. This not only deepens our knowledge of fundamental biological processes such as cotton growth, development, and fibre formation but also provides precise targets and scientific foundations for genetic improvement and variety breeding. The potential to drive sustainable development in the cotton industry is immense.

This article delves into the methodologies of cotton genome sequencing, tracing the achievements of early sequencing efforts and the breakthroughs spurred by technological progress. From initial attempts to modern, high-throughput sequencing techniques, we explore how advancements have accelerated our understanding of cotton genetics.

Methods for Cotton Genome Sequencing

As research into the cotton genome advances, accurate and efficient sequencing methods stand as the cornerstone for unraveling its genetic mysteries. From the initial attempts by early researchers to the current integration of multiple cutting-edge technologies, each breakthrough has paved the way for a more comprehensive and precise understanding of the cotton genome.

Early Sequencing Endeavors

In the journey of cotton genome sequencing, early efforts laid the groundwork for subsequent research. A significant milestone was reached in 2012 with the successful sequencing of the first diploid cotton species, Gossypium raimondii. This achievement marked the dawn of a new era in cotton genome research, providing us with an initial glimpse into the basic structure of the cotton genome. The sequencing of G. raimondii served as a crucial reference sequence, facilitating further investigations into other cotton species.

The genome of the diploid cotton variety Raymond's Cotton (Wang et al., 2012) Genome of diploid cotton Raymond's cotton (Wang et al., 2012)

Meanwhile, significant breakthroughs have also been achieved in the assembly of polyploid cotton genomes, with upland cotton (Gossypium hirsutum) taking center stage. In 2024, Huang utilized a combination of sequencing technologies, including PacBio HiFi, Oxford Nanopore ultra-long read sequencing, and Hi-C, to complete the telomere-to-telomere (T2T) genome assembly of "Zhongmian 113," a leading variety of upland cotton. This assembly covers 26 centromeres, 52 telomeres, and 55 rDNA clusters, offering a high-quality reference for studying the structure and function of the cotton genome.

The research shed light on the evolutionary mechanism of cotton centromeres. Unlike many other species, cotton centromeres lack typical satellite repeats. Instead, they have evolved through the invasion of long terminal repeat (LTR) retrotransposons. This unique centromere structure provides fresh insights into the diversity and evolution of plant centromeres.

Moreover, the study uncovered a regulatory module linked to the Mutator transposon (MuTC01). This module interacts with miR2947 to produce small interfering RNAs (siRNAs), which in turn regulate embryonic development. This discovery highlights the crucial role of transposons in plant gene regulation, offering new clues to understanding the molecular mechanisms underlying cotton embryo development.

This research not only provides a high-quality assembly of the cotton genome but also unravels the evolutionary mechanisms of centromeres and regulatory pathways for embryonic development. These findings lay a solid theoretical foundation for genetic improvement and fundamental research in cotton.

Polyploid cotton genome assembly (Huang et al., 202) Assembly of polyploid cotton genome (Huang et al., 2024)

Gossypium hirsutum, the most widely cultivated cotton species globally, has opened up unprecedented opportunities for genetic improvement and variety breeding with the successful assembly of its genome. By deciphering the upland cotton genome, we gain profound insights into the genetic underpinnings of its superior fiber quality, high yield, and other desirable traits. This knowledge empowers us to undertake targeted genetic modifications aimed at enhancing these traits.

Technological Advancements

The rapid progression of science and technology has placed next-generation sequencing (NGS) technologies and bioinformatics tools at the forefront of cotton genome sequencing. NGS, with its high throughput, cost-effectiveness, and efficiency, has significantly accelerated the process of cotton genome sequencing. It generates vast amounts of sequencing data in a short time, providing a rich source of information for a comprehensive analysis of the cotton genome.

However, the cotton genome presents unique challenges due to its high complexity and specificity. To overcome these hurdles, researchers have continuously explored and innovated, developing a suite of advanced bioinformatics tools and algorithms. These tools are adept at identifying and distinguishing repetitive sequences, enabling precise assembly and analysis of polyploid genomes. For instance, combining long-read sequencing technologies with short-read sequencing strategies allows for a more accurate resolution of complex genomic structures.

Additionally, comparative genomics approaches leverage genomic information from other species to aid in the assembly and annotation of the cotton genome, significantly enhancing its accuracy and completeness. For example, combining long-read and short-read sequencing technologies enables more accurate resolution of complex genomic structures. Additionally, comparative genomics approaches leverage genomic information from other species to assist in the assembly and annotation of the cotton genome, thereby enhancing the accuracy and completeness of the genome assembly.

Cotton gene sequencing technology Cotton gene sequencing technology

Applications of Cotton Genome Sequencing

As cotton genome sequencing progresses, the vast amount of genomic data obtained presents unprecedented opportunities for the cotton industry. This data is a treasure trove, containing the genetic codes behind cotton's growth, development, stress resistance, and other key traits. Transforming this valuable genomic information into practical productivity to drive the improvement of cotton varieties and the upgrading of the industry has become a crucial focus in current cotton genome research.

Enhancing Fiber Quality

Cotton fiber quality is a pivotal factor influencing its economic value, and genome sequencing offers robust technical support for fiber quality enhancement. Through genome sequencing, researchers can identify genes responsible for important quality traits such as fiber length, strength, and fineness. The discovery of these genes allows us to delve into the regulatory mechanisms of fiber development at the molecular level, providing clear targets for genetic improvement of fiber quality.

Pei and colleagues utilized Hi-C sequencing and ChIP-Seq technologies to investigate the dynamic changes in chromatin conformation and gene expression regulation during different developmental stages (0, 5, 10, and 20 days) of cotton fibers. Their findings revealed that chromatin states transition from active (A compartment) to inactive (B compartment) during fiber development, correlating with the silencing of developmental inhibitor genes. The two subgenomes (At and Dt) exhibited coordinated behavior in chromatin state transitions and gene expression regulation, particularly during the later stages of fiber development. Dynamic chromatin loops play a crucial role in rewiring gene regulatory networks, with notable differences observed between the two subgenomes, suggesting a bias in homologous gene expression. This study unveiled the spatiotemporal asymmetry of the three-dimensional genome structure during cotton fiber development, offering new insights and a theoretical foundation for plant cell differentiation and fiber quality improvement.

Three-dimensional genome structure during cotton fibre development (Pei et al., 2022). Three-dimensional genome structure during cotton fibre development (Pei et al., 2022)

Li and colleagues conducted PacBio HiFi and Oxford Nanopore sequencing on 216 diploid cotton (Gossypium arboreum) and 3,606 tetraploid cotton (Gossypium hirsutum) samples. Through meticulous assembly, they reconstructed 15 Asian cotton and 35 upland cotton genomes, forming a pangenome. Their research unveiled that during the domestication and selection processes, cotton genomes lost substantial sequences and genes, predominantly in highly divergent regions (HYD). Notably, these regions contained a significantly higher proportion of LTR retrotransposons compared to collinear regions (SYN). By integrating fiber transcriptome data, the study shed light on conserved regulatory patterns in both diploid and tetraploid cottons, as well as the impact of genomic structural variations on gene expression regulation.

Moreover, the team identified genetic loci associated with fiber quality, such as presence-absence variations (PAVs) and expression quantitative trait loci (eQTLs). These loci demonstrated significant phenotypic correlations with key quality traits, including fiber length, strength, and elongation. The findings offer novel strategies for enhancing cotton fiber quality.

Cotton fibre quality improvement applications (Li et al., 2021). Cotton Fibre Quality Improvement Applications (Li et al., 2021)

Disease and Stress Resistance

During its growth, cotton faces threats from various pathogens, and genome sequencing offers a crucial avenue for discovering genes that combat pathogens like Verticillium wilt. Through in-depth studies of the cotton genome, researchers can identify genes associated with disease resistance and elucidate their mechanisms, thereby enhancing cotton's ability to fend off pathogens. Additionally, amidst climate change, cotton also contends with environmental stresses such as drought and salinity. Genome-assisted breeding techniques provide effective means to bolster cotton's drought and salinity tolerance. By screening genes linked to stress resistance and introducing them into cotton varieties, or by regulating cotton's own stress-resistant genes, breeders can develop cotton varieties adapted to harsh environments.

Yang and colleagues employed Oxford Nanopore long-read sequencing and Hi-C technology to assemble high-quality genomes of two closely related cotton species, Gossypium thurberi and Gossypium davidsonii, shedding light on cotton's stress resistance mechanisms. Despite the relative conservation of gene content in the D genome, the study revealed significant diversification in gene order, structure, gene family expansion, three-dimensional chromatin architecture, and stress-related traits. Hi-C heatmaps accurately localized centromeres, which were found to evolve at a significantly faster rate than chromosome arms. Furthermore, the research uncovered genetic networks associated with salt stress tolerance and Verticillium wilt resistance. For instance, overexpressing the WRKY33 gene from Gossypium thurberi in upland cotton significantly enhanced its resistance to Verticillium wilt. Three-dimensional genome analysis demonstrated that diverse interactions between proximal and distal regulatory regions shape the expression regulation of defense-related genes. These findings offer new insights into leveraging wild cotton resources for crop improvement, aiding in the enhancement of cotton's stress resistance.

Cotton genetic modification for disease and stress resistance (Yang et al., 2021). Cotton Genetic Modification for Disease and Stress Resistance (Yang et al., 2021)

Cotton Genome Sequencing Breeding: Dual Benefits for Economy and Environment

The advancements in breeding driven by genetic information from cotton genome sequencing have profoundly influenced both the economic and environmental aspects of the cotton industry. Economically, superior cotton varieties cultivated through genetic information breeding offer higher yields and superior quality, boosting the income of cotton farmers. These high-quality varieties also command greater market competitiveness, leading to increased profits for enterprises and fostering sustainable growth within the cotton sector.

From an environmental perspective, genetic information breeding plays a pivotal role in reducing pesticide usage. Traditionally, cotton farming has relied heavily on pesticides to combat pests and diseases, which not only escalates production costs but also causes severe environmental pollution. However, disease-resistant and pest-resistant cotton varieties developed through genome sequencing and genetic improvement diminish reliance on pesticides, thereby mitigating soil, water, and air contamination, and safeguarding the ecological environment. Furthermore, amidst climate change, breeding cotton varieties adapted to adverse conditions such as drought and salinity enhances their resilience, mitigating yield losses due to natural disasters and ensuring stable cotton production. This provides robust support for achieving sustainable cotton production.

Future Directions for Cotton Genome Sequencing

Building on the remarkable achievements in cotton genome sequencing, we stand at a new starting point, gazing towards a broader horizon of future development. While current research findings have laid a solid foundation for progress in the cotton industry, the rapid advancement of science and technology, coupled with new challenges facing the cotton sector, necessitate continuous exploration and innovation. In the future, cotton genome sequencing will evolve towards greater precision, efficiency, and comprehensiveness to better meet the sustainable development needs of the cotton industry.

CRISPR Editing for Trait Improvement

With the ongoing development of gene editing technologies, CRISPR-based genome editing has demonstrated immense potential in cotton genetic improvement. CRISPR technology offers advantages such as simplicity of operation, high efficiency, and strong precision, enabling precise editing and modification of the cotton genome. By leveraging CRISPR, we can target specific traits for improvement, performing operations like knocking out, inserting, or replacing key genes controlling fiber development. This allows us to regulate fiber quality traits such as length, strength, and fineness. Simultaneously, editing genes associated with disease and stress resistance can endow cotton with stronger resilience, enabling it to better adapt to diverse environmental conditions. This precise trait improvement method will significantly accelerate the cotton variety breeding process, enhance breeding efficiency, and bring about new breakthroughs for the cotton industry.

Global Seq Collaboration for Wild Cotton Biodiversity

Wild cotton relatives serve as a vital reservoir of cotton genetic resources, carrying a wealth of genetic variations that may harbor genes of significant value for cotton improvement. However, many wild cotton relatives face extinction risks due to ecological degradation and human activities. Through global collaboration, researchers from various countries can share resources and technologies to conduct comprehensive genome sequencing and analysis of wild cotton relatives. This not only aids in deepening our understanding of cotton's evolutionary history and genetic diversity but also enables the discovery of more valuable genetic resources. Moreover, conserving the biodiversity of wild cotton relatives contributes to maintaining ecological balance and provides a robust foundation for the sustainable development of cotton.

Conclusion

Cotton genome sequencing, a landmark technology, has revolutionized the cotton industry from "field to fabric." Over the years, sequencing methodologies have evolved from arduous early explorations to today's technological advancements, enabling us to unravel the mysteries of the cotton genome more comprehensively and deeply. In terms of applications, genome sequencing has yielded remarkable results in improving fiber quality, enhancing disease and stress resistance, and has brought substantial economic and environmental benefits to the cotton industry.

Looking ahead, directions such as CRISPR-based genome editing technologies and global collaboration in sequencing wild cotton relatives promise to inject new vitality into the further development of the cotton industry. We have every reason to believe that with the continuous advancement and refinement of cotton genome sequencing technologies, the cotton industry will embrace an even brighter future, making greater contributions to the development of human society.

References

  1. Wang K, Wang Z, Li F, Ye W, Wang J, Song G, Yue Z, Cong L, Shang H, Zhu S, Zou C, Li Q, Yuan Y, Lu C, Wei H, Gou C, Zheng Z, Yin Y, Zhang X, Liu K, Wang B, Song C, Shi N, Kohel RJ, Percy RG, Yu JZ, Zhu YX, Wang J, Yu S. "The draft genome of a diploid cotton Gossypium raimondii." Nat Genet. 2012; 44(10):1098 - 103. https://doi.org/10.1038/ng.2371
  2. Huang G, Bao Z, Feng L, Zhai J, Wendel JF, Cao X, Zhu Y. "A telomere-to-telomere cotton genome assembly reveals centromere evolution and a Mutator transposon-linked module regulating embryo development." Nat Genet. 2024; 56(9):1953 - 1963. https://doi.org/10.1038/s41588-024-01877-6
  3. Pei L, Huang X, Liu Z, Tian X, You J, Li J, Fang DD, Lindsey K, Zhu L, Zhang X, Wang M. "Dynamic 3D genome architecture of cotton fiber reveals subgenome-coordinated chromatin topology for 4-staged single-cell differentiation." Genome Biol. 2022; 23(1):45. https://doi.org/10.1186/s13059-022-02616-y
  4. Li J, Yuan D, Wang P, Wang Q, Sun M, Liu Z, Si H, Xu Z, Ma Y, Zhang B, Pei L, Tu L, Zhu L, Chen LL, Lindsey K, Zhang X, Jin S, Wang M. "Cotton pan-genome retrieves the lost sequences and genes during domestication and selection." Genome Biol. 2021; 22(1):119. https://doi.org/10.1186/s13059-021-02351-w
  5. Yang Z, Ge X, Li W, Jin Y, Liu L, Hu W, Liu F, Chen Y, Peng S, Li F. "Cotton D genome assemblies built with long-read data unveil mechanisms of centromere evolution and stress tolerance divergence." BMC Biol. 2021; 19(1):115. https://doi.org/10.1186/s12915-021-01041-0
For research purposes only, not intended for clinical diagnosis, treatment, or individual health assessments.
Send a MessageSend a Message

For any general inquiries, please fill out the form below.

For research purposes only, not intended for clinical diagnosis, treatment, or individual health assessments.
We provide the best service according to your needs Contact Us
OUR MISSION

CD Genomics is propelling the future of agriculture by employing cutting-edge sequencing and genotyping technologies to predict and enhance multiple complex polygenic traits within breeding populations.

Contact Us
Copyright © CD Genomics. All Rights Reserved.
Top