Driven by technological advances, the quest for precision medicine has ushered the ‘omics’ era into clinical practice.1,2 Increasingly, comprehensive sequencing approaches are being deployed to support diagnoses and/or for the detection of actionable findings.3,4 In most clinical diagnostic laboratories, an intriguing dichotomy has emerged in which innovative genomic studies have largely centered on detecting/excluding single nucleotide variants (SNVs) while evaluation of structural variants (SVs) remains most often performed by traditional cytogenetic approaches. Despite a rich history, cytogenetic studies – including chromosome banding analysis (CBA), Fluorescence In Situ Hybridization (FISH) and, more recently, chromosomal microarrays (CMA) – have well-described, method-specific diagnostic ‘blind spots’ which remain largely unaddressed (Table 1).5 There is thus a need to improve our capacity to robustly and reliably detect clinically relevant structural variants which may not be captured by more conventional methodologies.
Table 1. Comparison of Methodologies for Structural Variant Detection
| Chromosome Banding Analysis (CBA) | Fluorescent In Situ Hybridization (FISH) | Chromosomal Microarray Analysis (CMA) | Optical Genome Mapping (OGM) | |
| Coverage: | Genome-Wide | Targeted | Genome-Wide | Genome-Wide |
| Specimen-Type | Viable Cells | Viable or Fixed Cells | Viable or Fixed Cells | Viable Cells(*) |
| Analysis Type: | Single Cell | Single Cell | Bulk Specimen | Bulk Specimen |
| Resolution: | > 5 ~ 10 Mb | ~70 kb – 1 Mb | > 5 kb ~ 200 kb | > ~500 bp ~ 5 kb |
| Sensitivity: | ~10%(**) | ~2-5% | ~10-15% | 5%-10% |
| SVs types (***): | CNV + SV | CNV + SV | CNV +/- AOH(****) | CNV + SV + AOH |
(*) Viable cells can be frozen. Specifically designed DNA extraction protocol facilitates the isolation of ultra-high molecular weight (UHMW) DNA, 750ng of which is used each assay.
(**) While 20 metaphases are routinely reported for somatic-based testing, many more are often screened and thus the sensitivity may be lower than indicated
(***) For the purpose of this table, SV is delineated into copy number variants (CNVs) versus structural variations (SV) in which there is a change in the position of the genome.
(****) Only those chromosomal microarrays which incorporate SNPs permit the detection of region of absence or copy neutral loss of heterozygosity (AOH/CN-LOH),
Optical Genome Mapping (OGM) is an imaging technology which evaluates the fluorescent labeling pattern of individual DNA molecules to perform an unbiased assessment of genome-wide structural variants down to 500 base pairs (bp) in size,6 a resolution that far exceeds conventional cytogenetic approaches. OGM relies on a specifically designed extraction protocol facilitating the isolation of ultra-high molecular weight (UHMW) DNA. This protocol utilizes a paramagnetic disk purposed with trapping DNA for wash steps thereby reducing sheering forces present in standard column-based extraction methods. The result is DNA fragments of ~150 kilobases (kbp) to megabases (Mbp) in size, ~5-10x longer than the average fragment size from conventional DNA isolations techniques. DNA is fluorescently labeled via covalent modification at CTTAAG hexamer motifs, generating genome-wide density of approximately 14-17 labels per 100kb in sequence specific patterns.7 Labeled DNA is loaded on silicon chips composed of hundreds of thousands of parallel nanochannels where individual DNA molecules are linearized, imaged, and digitized. The specific labeling profile of individual DNA molecules – including spacing and pattern of hexamers labels – are subsequently grouped based on similarity, producing ~500 kb to megabase-sized consensus maps which can be compared in silico to the expected labeling pattern of a reference genome (Figure 1A). In essence, this imaging technology converts DNA into a “barcode” whose labeling profile and characteristics can sensitively and specifically resolve copy number and structural variation without the need for sequence level data (Figure 1A). The quality of the DNA – including both size and labeling characteristics – as well as the number of images captured influences genome-wide coverage. Presently, each flow cell, which can accommodate a single specimen – can generate up to 5000 Gigabase pairs (Gbp) of raw data, nearly a four-fold improvement from prior generations of this technology, achieving a maximum theoretical genome-wide coverage of ~1250x. Commercially developed informatic software supports several analyses including: de novo structural variant analysis for typical germline assessments (>~80x-coverage; requiring > ~400Gbp data collection) or ‘Rare Variant Analysis (RVP)’ for somatic assessment down to a ~5% variant allele fraction. (>~340x coverage; requiring >~1500 Gbp data). Both algorithms facilitate the detection of a wide array of structural variants; from copy number gains/losses to balanced/unbalanced translocations and insertions to inversions (Figure 1B). Notably, this assay can be completed in approximately ~4-days, the majority of which involves imaging and data analysis and thus requires limited hands-on technologist time (4~5 hours per run of 8~15 specimens).

The end-to-end solution provided by OGM (from DNA isolation to data analysis) has facilitated a flurry of studies confirming the clinical diagnostic potential of this approach. While necessarily skewed towards hematologic malignancies, at the time of this publication over 1000 cases have been profiled by OGM, encompassing both myeloid and lymphoid neoplasms.8-13 These studies have collectively identified several recurrent themes. (1) OGM can reliably, and reproducibly detect all forms of clinically significant SVs that have been reported by standard of care testing Specifically, OGM has achieved >95% concordance with SVs detected by CBA, with discordances driven primarily by low-level whole chromosome aneuploidies (present in <5% of nuclei), which may be beyond the reliable detection threshold of this assay.14,15 (2) Present diagnostic standards for SV detection in hematologic malignancies fail to recognize clinically relevant aberrations at an alarming rate. In their study of MDS, Yang et al., noted that OGM identified clinically relevant SVs in 34% (34/100) of patients which were cytogenetically cryptic and therefore undetectable by CBA. Notably, in 17% (17/100) of their cohort, these findings would have changed the risk assessment, 15 a finding which was recapitulated in studies of AML in which OGM detected an additional 12-23% of SVs which would have altered clinical management or rendered patients eligible for clinical trials.14,16 While some of the inherent vulnerabilities of CBA could be supplemented by FISH-based studies (such as the detection of KMT2A and NUP98 rearrangements), OGM provides what both FISH and karyotype collectively can perform within a single assay.7 Similarly, in lymphoid cancers, OGM was able to assign major cytogenetic risk groups in 78% (32/41) of T and B-lineage precursor lymphoblastic leukemias, 20% greater than what can be accomplished by combined use of CBA, FISH and multiplex-ligation dependent probe amplification (MLPA).17 (3) Gene-level resolution of OGM improves understanding of cancer genomes and identifies clinically meaningful molecular subgroups. In 67% of myeloid malignancies studied, OGM analysis improved the resolution of breakpoints or clarified cytogenetic aberrations reported in by CBA. The enhanced resolution has led to expanding the capabilities of cytogenetic approaches to intragenic alterations, such as Partial Tandem Duplications (PTD) of KMT2A15, or PAX5.18
The promise of OGM for improved SV detection is not restricted to cancer-based testing. In germline-settings, where copy number variants (CNVs) detection is primarily performed by CMA, recent studies have shown that OGM has the capacity to detect all clinically relevant variants observed by standard of care studies.19 Moreover, OGM has been shown to effectively resolve complex chromosomal rearrangement (CCR) cases in the prenatal setting, assisting in characterizing the underlying genetic etiology of recurrent miscarriages.20 Moreover, germline studies have highlighted the potential for OGM to explore the ‘dark genome’ – repetitive regions of the genome which have been challenging to effectively characterize using Next Generation Sequencing (NGS) approaches. Repeat expansions – small (3-6 bp) nucleotide repeats which can be found tens to thousands of times in tandem – are associated with the onset of Fragile X syndrome, or myotonic dystrophy, among other conditions and can be successfully identified by OGM analyses if within the detection limits (i.e. > 500bp) of OGM analysis.21-23 Moreover, this approach led to the detection of novel mechanisms of pathogenesis, including the identification of a ~2.8 kb of a insertion within the intron 2 of SMARCB1 which was missed by whole genome sequencing due to the repetitive nature of the sequence coupled with the short-read nature of traditional NGS technologies, and provided a novel mechanism of inactivation leading to cancer predisposition.24
Despite these successes, it is important to note the inherent limitations of this technology which begins with the need for ultra-high weight molecular DNA. This precludes the capacity to evaluate specimens which have undergone fixation or to profile DNA that was isolated using conventional extractions. Moreover, not all specimens may yield effective isolation, which may be influenced by pre-analytical variables (specimen quality) or related to the technical performance of the isolation. OGM is also not presently a high-throughput technology. Each chip is composed of three flow-cells. At the time of this publication, current OGM workflows using a single instrument have capacity to analyze ~18-30 genomes per week. OGM also does not provide sequence level data and thus may require orthogonal, sequenced-based approaches to confirm certain classes of structural variants (i.e., small insertional events). Finally, with its increased detection of cryptic SVs, OGM may detect increased genomic variation of unknown significance and challenges current interpretative capabilities.
Taken together, while OGM shows the exquisite potential to significantly ameliorate the detection of clinically relevant structural variants above and beyond current cytogenetic approaches, its specific deployment as a possible clinical tool is best evaluated in the context of the totality of molecular and cytogenetic assays available at each institute. With the constant evolution in both understanding and technological capabilities, clinical diagnostic laboratories are presently faced with constraints regarding how best to prioritize the development of new assays. While SV-detection is beginning to undergo a radical transformation – from the resolution of gross chromosomal changes to nucleotide level solutions – the analytics and interpretation databases required to support nucleotide-level resolution is currently challenging for the typical clinical laboratory. As such, OGM represents a critical steppingstone to more finely resolved detection of structural lesions with relatively uniform and unbiased coverage of the genome.


