When Is Micro-C XL a Better Fit Than Hi-C for Fungal Genomes?
Summary
Micro-C XL vs Hi-C fungal genome study design turns on a single biological constraint: fungal genomes are compact and gene-dense, with chromosomally interacting domains (CIDs) that span 0.5–10 kb and typically contain between zero and eight genes — a scale at which restriction enzyme-based Hi-C resolution is often insufficient to resolve individual domain boundaries and short-range regulatory contacts. This guide provides a decision framework for microbiology and fungal genomics teams choosing between Hi-C and Micro-C XL, covering genome-size and resolution considerations, the "better fit when / not ideal when" decision framework, protocol adaptation for fungal cell walls, and QC benchmarks specific to compact genomes.
Key Takeaways
- Micro-C XL was originally developed and validated in Saccharomyces cerevisiae and Schizosaccharomyces pombe (Hsieh et al., Nature Methods, 2016), making fungi the primary reference organisms for this method — not an adaptation case
- CIDs in budding yeast span 0.5–10 kb, or 4–50 nucleosomes, and typically contain 0–8 genes; their boundaries are located at promoters of highly transcribed genes where RSC chromatin remodeling complex and cohesin loading factor bind (Hsieh et al., Molecular Cell, 2015)
- Micro-C XL improves upon original Micro-C through dual crosslinking (formaldehyde + long-range crosslinker DSG) and isolation of insoluble chromatin, substantially increasing signal-to-noise ratio and enabling recovery of both CIDs and higher-order features such as centromere clustering
- Restriction enzyme-based Hi-C produces fragment sizes of approximately 1–5 kb in yeast (depending on cutter frequency), which blurs CID structure at the 0.5–10 kb scale — the critical resolution window for fungal gene-regulatory interpretation
- For fungal projects where CID mapping is not the primary goal — A/B compartment analysis, large-scale domain comparison across species — Hi-C remains appropriate and cost-effective
- Cell wall digestion to generate spheroplasts is a required pre-processing step for all fungal chromatin conformation methods; incomplete digestion is the primary failure mode specific to fungal applications
- For non-model fungal pathogens (Candida albicans, Aspergillus fumigatus, Neurospora crassa), reference genome quality and annotation completeness should be assessed before committing to CID boundary calling
The "Better Fit When / Not Ideal When" Decision Framework
Before evaluating protocol details, the most important question is whether nucleosome-level resolution is required by the biological question. Micro-C XL and Hi-C are not interchangeable upgrades — they answer different questions, and the choice between them for a fungal project should be made at the study design stage, not after library preparation.
Figure 1. The research question — CID resolution vs compartment architecture — is the primary decision point between Micro-C XL and Hi-C in fungal genome projects.
Choose Micro-C XL when your research question requires any of the following:
- Mapping CIDs at gene-scale resolution: CIDs in S. cerevisiae and S. pombe span 0.5–10 kb, a range that is structurally invisible to restriction enzyme-based Hi-C at its typical fragment resolution in compact genomes
- Resolving NFR-defined domain boundaries: CID boundaries in yeast are specifically located at nucleosome-depleted regions (NDRs) at promoters of highly transcribed genes — a boundary definition that requires sub-nucleosome positional accuracy to map reliably
- Detecting short-range regulatory contacts: promoter–promoter interactions and gene-loop structures in the 200 bp–4 kb range are only accessible at MNase-level digestion resolution
- Characterizing chromatin fiber folding: tri- and tetra-nucleosome motifs contributing to chromatin fiber compaction are detectable only at nucleosome resolution
Hi-C is sufficient when:
- The primary goal is A/B compartment identification: compartment analysis does not require sub-kilobase resolution; Hi-C at standard depth provides accurate compartment calls in fungal genomes
- The project compares chromatin organization across multiple fungal species: maintaining method consistency across species is more important than maximizing resolution in any one; Hi-C's lower cost enables larger species panels
- Budget constraints limit sequencing depth: Micro-C XL requires sufficient depth to cover the genome at nucleosome resolution; Hi-C achieves equivalent compartment-level information at lower cost
- The project is a pilot feasibility study: Hi-C is an appropriate first-pass method before committing to full nucleosome-resolution profiling
Understanding Hi-C principles before choosing between methods is important context. For a technical overview of Hi-C technology principles including crosslinking, restriction enzyme digestion, and ligation logic, see our introduction resource. If your project falls into the Hi-C category, our Hi-C sequencing service supports a range of fungal and other compact genome projects.
Why Fungal Genome Biology Favors Nucleosome-Level Resolution
The argument for Micro-C XL in fungal research is not a matter of preference or technical elegance — it follows directly from the physical dimensions of the structures being studied. To understand why, it helps to compare the scale of the regulatory architecture in fungi with the resolution ceiling imposed by restriction enzyme-based Hi-C.
CIDs in Yeast: Size, Boundaries, and Why They're Invisible to Standard Hi-C
The seminal Micro-C study (Hsieh et al., Molecular Cell, 2015) identified abundant self-associating chromatin domains in S. cerevisiae — structures that had not been observed in any prior Hi-C or 3C analysis of yeast. These CIDs span 0.5–10 kb (equivalent to 4–50 nucleosomes) and typically contain between zero and eight genes. They were consistently detected across 21 independent biological replicates, across three different wild-type strain backgrounds (S288C, W303, and a plasmid-encoded histone strain), and in a distantly related hemiascomycete, Kluyveromyces lactis — evidence for their biological reality and conservation rather than technical artifact.
The reason these domains had been invisible to Hi-C is straightforward arithmetic. A 6-base cutter restriction enzyme (HindIII, EcoRI) cuts the S. cerevisiae genome approximately every 4,000 bp, generating ~3,000 fragments across the 12 Mb genome. A 4-base cutter (MboI, DpnII) cuts more frequently, but still generates fragments averaging 256 bp only in ideal sequence contexts — in practice, fragment size distribution in yeast Hi-C data shows a peak around 500–1,000 bp. This fragment resolution means that two loci separated by less than ~1 kb cannot be distinguished in a Hi-C contact map; interactions within a single CID spanning one or two genes simply cannot be resolved.
MNase digestion in Micro-C XL produces fragments at the nucleosome unit — approximately 147 bp wrapped DNA plus linker DNA, targeting a product of ~200 bp. This provides a 5–10× improvement in resolution over 4-base cutter Hi-C, and an approximately 20× improvement over 6-base cutter Hi-C, bringing the resolution directly into the CID size range of 0.5–10 kb.
CID boundaries are not arbitrary: they are specifically associated with NDRs at promoters of highly transcribed genes, regions of rapid histone turnover, and binding sites for the RSC ATP-dependent chromatin remodeling complex and the cohesin loading complex (Scc2-Scc4). This boundary definition is functionally informative — CID borders mark transition points between distinct transcriptional and chromatin states, and mapping them accurately is the prerequisite for understanding how gene-scale chromatin organization contributes to transcriptional regulation in fungi.
Figure 2. MNase-based digestion in Micro-C XL provides uniform ~200 bp fragment resolution, enabling CID boundary detection at NFR positions that Hi-C cannot resolve in compact fungal genomes.
The Dual Crosslinking Advantage: Why Micro-C XL Outperforms Single-Crosslinker Micro-C in Fungi
The "XL" in Micro-C XL refers to a specific protocol modification — the addition of a long-range protein-protein crosslinker (EGS or DSG) prior to the standard formaldehyde crosslinking step. Formaldehyde is a "zero-length" crosslinker that primarily captures protein-DNA contacts and very short-range protein-protein interactions; it does not efficiently crosslink proteins separated by more than a few angstroms. As a result, original Micro-C maps captured short-range CID structure well but poorly recovered higher-order features — centromere clustering, telomere associations — that require crosslinking of proteins in indirect spatial proximity.
The addition of DSG or EGS before formaldehyde extends the effective crosslinking radius, capturing protein-protein contacts at distances up to approximately 11–16 Å. Combined with isolation of the insoluble chromatin fraction (pellet chromatin rather than soluble chromatin after cell lysis), Micro-C XL substantially improves signal-to-noise ratio by physically enriching for chromatin-associated ligation products and depleting free DNA background. In S. pombe, Micro-C XL maps show both CIDs and centromere clustering — the full range of chromosome folding features from nucleosome to genome scale — in a single experiment, a capability that neither Hi-C nor original Micro-C could achieve in yeast.
This technical improvement matters practically for project design: for fungal researchers requiring both gene-scale (CID) and chromosome-scale (centromere clustering, telomere associations) contact information, Micro-C XL eliminates the need for two separate experiments with different crosslinking conditions.
When Hi-C Is Still the Right Choice for Fungal Projects
Not every fungal chromatin question requires nucleosome-level resolution, and not every research group has the sample quantity or sequencing budget to justify Micro-C XL. Understanding Hi-C's genuine strengths in the fungal context — rather than treating it as a lower-quality default — is important for selecting the right method.
For a direct technical comparison of Hi-C and Micro-C across key parameters, see our Hi-C vs Micro-C method comparison resource. Our Hi-C service supports fungal genome projects requiring compartment-level or large-scale domain analysis.
Compartment Analysis and Large-Scale Domain Comparisons
A/B compartment identification — the segregation of transcriptionally active (A) and inactive (B) chromatin into distinct nuclear territories — does not require sub-kilobase resolution. The mathematical basis for compartment calling (eigenvector decomposition of the contact matrix) operates at a scale of 25–100 kb bins in mammalian genomes and at correspondingly scaled bins in compact fungal genomes. Hi-C provides sufficient data for this analysis at lower cost than Micro-C XL, because compartment calls depend on global contact patterns rather than local, high-resolution boundary precision.
For fungal projects focused on questions such as: "do active genes cluster in a defined nuclear compartment in Aspergillus under stress conditions?" or "does chromatin compartmentalization change during morphological switching in Candida?" — Hi-C at moderate depth provides interpretable, cost-effective data.
Multi-Species Fungal Genomics: When Consistency Matters More Than Resolution
Cross-species chromatin architecture comparisons are a growing area in fungal genomics, driven by interest in how genome organization diverges between pathogenic and non-pathogenic yeasts, or between sexual and asexual fungal species. For these comparative studies, method consistency across species is a higher priority than maximizing resolution in any one organism. Running Hi-C under identical conditions in six fungal species produces a directly comparable dataset; running Micro-C XL in some and Hi-C in others introduces a resolution confound that complicates cross-species interpretation.
Hi-C also requires less cell input than Micro-C XL for multi-species panels where some organisms grow poorly or provide limited biomass. For studies prioritizing breadth of species coverage over depth of resolution, Hi-C is the pragmatic choice.
Protocol Considerations for Fungal Cell Wall Digestion and Sample Preparation
Every chromatin conformation capture method in fungi — Hi-C, Micro-C, Micro-C XL, and related approaches — requires a step absent from mammalian cell protocols: removal of the fungal cell wall before chromatin can be accessed for crosslinking or digestion. This spheroplast generation step is the most fungal-specific and most variable element of the entire workflow, and it is the primary source of sample failure in fungal chromatin studies.
Figure 3. Spheroplast preparation is the fungal-specific rate-limiting step; incomplete cell wall digestion is the most common failure mode in fungal Micro-C XL projects.
Spheroplast Preparation: The Rate-Limiting Step for Fungal Chromatin Capture
The fungal cell wall is a complex, multi-layered structure of glucans, chitin, and mannoproteins whose composition varies substantially across species. Saccharomyces cerevisiae and Schizosaccharomyces pombe cell walls are well characterized, and zymolyase (a β-1,3-glucanase preparation from Arthrobacter luteus) or lyticase (a similar enzyme preparation) digests them efficiently under standard conditions. For Candida albicans, cell wall composition changes depending on growth phase and morphological state — yeast-form and hyphal cells have different cell wall architectures, and zymolyase efficiency differs between them. For filamentous fungi such as Aspergillus fumigatus or Neurospora crassa, cell wall composition includes higher chitin content, and chitinase supplementation or mechanical disruption may be required.
Spheroplast formation efficiency should be verified microscopically before crosslinking proceeds. The standard check is osmotic lysis: spheroplasts placed in hypotonic solution lyse immediately (visible loss of cell refractility), while intact-walled cells do not. A common error is assuming that spheroplast formation is complete based on enzyme concentration and incubation time alone, without visual verification — this error propagates silently into the downstream chromatin capture step and results in low-complexity libraries with poor ligation efficiency.
Osmotic stability of spheroplasts during the digestion step is maintained by including a compatible osmolyte (typically 1.2 M sorbitol for S. cerevisiae) in all buffers from zymolyase addition through crosslinking. Departures from osmotic balance — including inadvertent dilution during buffer changes — cause spheroplast lysis and release of genomic DNA into the soluble fraction, contaminating the chromatin preparation.
MNase Titration in Compact Fungal Genomes
Once spheroplasts are crosslinked, MNase digestion produces the mononucleosomal fragments that define Micro-C XL's resolution advantage. In S. cerevisiae, the nucleosome repeat length (NRL) is approximately 165 bp — shorter than in mammals (~200 bp) — meaning that the MNase titration optimized for mammalian or Drosophila Micro-C may over-digest fungal chromatin. Over-digestion produces sub-mononucleosomal fragments that reduce unique ligation products and shift the contact distance distribution toward artifactually short-range contacts.
MNase titration should be performed empirically for each fungal species (and for different growth conditions or morphological states within the same species) by testing 2–3 enzyme concentrations and assessing the resulting fragment size distribution on a Bioanalyzer or TapeStation. A successful titration shows a clear mononucleosomal peak at approximately 150–165 bp (for S. cerevisiae) or the species-appropriate NRL, with minimal sub-mononucleosomal degradation products below 100 bp and minimal di-nucleosomal contamination above 300 bp.
For Candida albicans, where nucleosome positioning data from ATAC-seq studies (Price et al., mBio, 2019) demonstrates active chromatin profiling is feasible in this pathogen, the nucleosome organization is broadly similar to S. cerevisiae, and MNase conditions validated in budding yeast provide a reasonable starting point. However, the heterozygous diploid genome of C. albicans SC5314 (approximately 14 Mb haploid equivalent, with complex homolog polymorphism) introduces additional mapping challenges not present in haploid laboratory yeast strains. Our Micro-C XL service includes species-specific protocol optimization for non-standard fungal organisms.
Reference Genome Requirements and Analysis Considerations for Non-Model Fungi
The interpretability of Micro-C XL data from fungal pathogens is ultimately bounded by the quality of the available reference genome and its annotation. The biological signal may be present in the sequencing data, but if the reference genome is fragmented at the contig level or lacks comprehensive TSS annotation, CID boundary calling will produce positional estimates that cannot be meaningfully interpreted in terms of gene regulation.
What Reference Genome Quality Is Required for CID Boundary Calling
CID boundary calling is performed by identifying positions in the genome that show strong contact insulation — loci where interactions crossing that position are significantly depleted relative to local contact density. This calculation requires: a chromosome-level or near-chromosome-level assembly (contig N50 of at least several hundred kb), accurate gene model annotation to assign boundaries to regulatory elements (NDRs, promoters), and reliable repeat annotation to distinguish boundary calls at genuine regulatory elements from artifactual insulation at repetitive regions.
For S. cerevisiae (SGD R64-3-1) and S. pombe (PomBase ASM294v2), these requirements are fully met — both genomes have chromosome-level assemblies, comprehensive gene models, and well-characterized repeat landscapes. For Candida albicans SC5314 (Assembly 22, CGD), the reference genome meets the assembly quality standard, but the heterozygous diploid structure introduces phasing complexity — contacts mapped to the A allele and B allele of homologous chromosomes are not separable without haplotype-resolved sequencing data, and standard Micro-C XL analysis treats the diploid genome as a haploid equivalent. This simplification is acceptable for most CID boundary calling applications but should be noted when interpreting allele-specific chromatin organization. For Aspergillus fumigatus Af293 and Neurospora crassa OR74A, chromosome-level assemblies are available and adequate for CID analysis.
For less-established fungal species or specific clinical isolates without chromosome-level assemblies, Micro-C XL data can still be generated and analyzed at the contact matrix level, but CID boundary assignment should be reported with appropriate uncertainty, and interpretation should be restricted to regions with high assembly confidence.
Analysis Tools for Compact Genomes: Bin Size, Normalization, and Boundary Calling
Standard Hi-C and Micro-C analysis pipelines (HiCExplorer, cooltools, Juicer) are applicable to fungal genomes, but default parameters require adjustment for compact genome size. The most important parameter is bin size: tools designed for mammalian genomes default to 25–100 kb bins, which at this size would compress the entire S. cerevisiae genome into fewer than 500 bins — too few to resolve CID structure meaningfully.
For S. cerevisiae (~12 Mb) and similarly sized fungal genomes, bin sizes of 200–500 bp are appropriate for CID-level analysis. For compartment-level analysis, 1–5 kb bins provide sufficient resolution while maintaining statistical robustness. These bin sizes require proportionally higher sequencing depth per genome than mammalian Hi-C analyses — the absolute number of cis contacts per bin must be sufficient for normalization and insulation scoring, regardless of absolute genome size.
Normalization by iterative correction (ICE) or Knight-Ruiz balancing should be applied before boundary calling; raw contact matrices from compact fungal genomes show strong distance-decay biases that, if uncorrected, produce artifactual boundary calls at regions of high fragment density. For downstream interpretation of insulation scores and boundary strength, the Hi-C data analysis resource provides detailed guidance on contact matrix normalization, compartment calling, and boundary detection workflows applicable to compact genomes. For Hi-C heatmap interpretation including reading interaction matrices and identifying domain structures visually, see our heatmap guide. Our epigenomic data analysis service supports custom Micro-C XL analysis pipelines for non-standard genomes and compact genome configurations.
Planning a Micro-C XL or Hi-C project for a fungal or other compact genome? Our team can advise on reference genome suitability, bin size parameters, and CID boundary calling strategy before you commit samples. Contact us to discuss your project →
Frequently Asked Questions
1) Was Micro-C XL originally developed for fungal genomes?
Yes — fungi are the primary validation organisms for Micro-C XL, not an adaptation case. The original Micro-C paper (Hsieh et al., Molecular Cell, 2015) established the method in S. cerevisiae, identifying CIDs spanning one to five genes across 21 biological replicates. Micro-C XL (Hsieh et al., Nature Methods, 2016) extended the approach with dual crosslinking and insoluble chromatin isolation, validated simultaneously in S. cerevisiae and S. pombe. The method was specifically designed to address the resolution limitations of Hi-C in the compact yeast genome — making fungi the organism class for which Micro-C XL is most directly validated.
2) What resolution does Hi-C achieve in yeast or fungal genomes?
Resolution in Hi-C depends on restriction enzyme choice and sequencing depth. In S. cerevisiae, HindIII (6-cutter) generates approximately 3,000 fragments genome-wide, with average fragment size of approximately 4,000 bp — too coarse to resolve CIDs that span 0.5–10 kb. MboI or DpnII (4-base cutters recognizing GATC) cut more frequently, but typical fragment sizes still average 500–1,000 bp in yeast, placing the practical resolution ceiling at approximately 1–2 kb. CIDs in the 0.5–3 kb range — spanning one to three genes — are invisible at this resolution. Micro-C XL, with MNase-generated ~200 bp fragments, resolves contacts in the 200 bp–4 kb range, directly matching the CID size spectrum in yeast.
3) What are chromosomally interacting domains (CIDs) in fungi and why do they matter?
CIDs are self-associating chromatin domains first identified in S. cerevisiae by Micro-C analysis (Hsieh et al., 2015). They span 0.5–10 kb (4–50 nucleosomes) and contain between zero and eight genes. Their boundaries are specifically located at NDRs coinciding with promoters of highly transcribed genes, bound by the RSC chromatin remodeling complex and the cohesin loading factor Scc2–Scc4. CIDs define the basic unit of transcriptional compartmentalization in yeast — genes within the same CID tend to share regulatory context, while genes in adjacent CIDs are relatively insulated from each other's regulatory environment. Understanding CID organization is therefore directly relevant to questions about gene co-regulation, chromatin remodeling, and transcription factor access in fungi.
4) How does cell wall digestion affect Micro-C XL quality in fungal samples?
Cell wall digestion to generate spheroplasts is the fungal-specific rate-limiting step. Incomplete digestion — where a fraction of cells retain intact walls — produces mixed chromatin preparations in which some cells are not effectively crosslinked or digested by MNase, reducing library complexity and reproducibility. The failure is silent: libraries generated from incompletely spheroplasted cells pass fragment size QC (MNase products look normal) but show low unique ligation rates and poor contact distance distributions. Verification of spheroplast formation by osmotic lysis assay before crosslinking is essential. For Candida albicans hyphal forms and Aspergillus species, additional cell wall digestion optimization beyond standard zymolyase conditions is typically required.
5) Can Micro-C XL be used for fungal pathogens like Candida albicans or Aspergillus fumigatus?
Yes, with species-specific protocol adaptation. Candida albicans has been studied by ATAC-seq and chromatin profiling approaches (Price et al., mBio, 2019), confirming that nucleosome organization in this pathogen is accessible to chromatin capture methods. The key adaptations for C. albicans Micro-C XL are: optimization of zymolyase concentration and incubation conditions for the specific growth phase and morphological form; awareness of the heterozygous diploid genome structure for alignment and boundary calling; and selection of the high-quality SC5314 A22 reference genome assembly. For Aspergillus fumigatus, the higher chitin content of the cell wall typically requires supplemented digestion conditions. A chromosome-level reference genome (Af293) is available and suitable for CID analysis.
6) When is Hi-C a better choice than Micro-C XL for fungal chromatin studies?
Hi-C is the better choice when: (1) the primary question is A/B compartment identification or large-scale domain architecture, which does not require nucleosome-level resolution; (2) the project involves multiple fungal species and cross-species consistency is more important than per-species resolution; (3) budget or sample constraints limit sequencing depth; or (4) the project is a pilot study to establish technical feasibility before investing in full Micro-C XL profiling. Hi-C is not "worse" than Micro-C XL — it is optimized for different length scales of chromosome organization. The decision should be driven by the biological question, not by the assumption that higher resolution is always better.
7) What bin size and analysis tools are recommended for Micro-C XL data from compact fungal genomes?
For CID-level analysis in S. cerevisiae (~12 Mb) and similarly compact fungal genomes, bin sizes of 200–500 bp are appropriate — much smaller than the default parameters in tools designed for mammalian genomes. HiCExplorer and cooltools support user-defined bin sizes and are both suitable for compact genome analysis. For compartment-level analysis, 1–5 kb bins provide sufficient resolution. Iterative correction (ICE) or Knight-Ruiz normalization should be applied before boundary calling or insulation scoring. Sequencing depth requirements scale with resolution: at 200 bp bins in a 12 Mb genome, achieving 5–10× average coverage per bin requires approximately 50–100 million unique cis read pairs, comparable to or slightly below mammalian Hi-C depth requirements at 5 kb bins.
References
- Hsieh, T.H. et al. Micro-C XL: assaying chromosome conformation from the nucleosome to the entire genome. Nature Methods, 13, 1009–1011 (2016). https://doi.org/10.1038/nmeth.4025
- Hsieh, T.H. et al. Mapping nucleosome-resolution chromosome folding in yeast by Micro-C. Molecular Cell, 62, 90–104 (2015). https://pubmed.ncbi.nlm.nih.gov/26119342/
- Price, R.J. et al. Chromatin profiling of the repetitive and nonrepetitive genomes of the human fungal pathogen Candida albicans. mBio, 10, e01376-19 (2019). https://doi.org/10.1128/mBio.01376-19
- Hoencamp, C. & Rowland, B.D. A user's guide to Hi-C analysis. Briefings in Functional Genomics, 22, 1–12 (2023). https://doi.org/10.1093/bfgp/elac019
- Ramírez, F. et al. High-resolution TADs reveal DNA sequences underlying genome organization in flies. Nature Communications, 9, 189 (2018). https://doi.org/10.1038/s41467-017-02525-w
- Abdennur, N. & Mirny, L.A. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics, 36, 311–316 (2020). https://doi.org/10.1093/bioinformatics/btz540

