Best Practices for Clinical Whole Genome Sequencing (PDF)

Document Details

Uploaded by Deleted User

Christina A. Austin-Tse, Vaidehi Jobanputra, Denise L. Perry, David Bick, Ryan J. Taft, Eric Venner, Richard A. Gibbs, Ted Young, Sarah Barnett, John W. Belmont, Nicole Boczek, Shimul Chowdhury, Katar

Tags

whole genome sequencing genetic disorders clinical diagnostics medical genomics

Summary

This document presents best practice recommendations for the interpretation and reporting of clinical diagnostic whole genome sequencing. It discusses challenges and emerging approaches critical to maximize the potential of this comprehensive test, including variant interpretation and reporting as well as the use of clinical genomics in diagnostic settings.

Full Transcript

www.nature.com/npjgenmed REVIEW ARTICLE OPEN Best practices for the interpretation and reporting of clinical whole genome sequencing Christina A. Austin-Tse 1,2,3,16 ✉, Vaidehi Jobanputra 4,5,16, Deni...

www.nature.com/npjgenmed REVIEW ARTICLE OPEN Best practices for the interpretation and reporting of clinical whole genome sequencing Christina A. Austin-Tse 1,2,3,16 ✉, Vaidehi Jobanputra 4,5,16, Denise L. Perry6, David Bick 7, Ryan J. Taft6, Eric Venner8, Richard A. Gibbs8, Ted Young 9, Sarah Barnett10, John W. Belmont 6, Nicole Boczek10,11, Shimul Chowdhury12, Katarzyna A. Ellsworth12, Saurav Guha 4, Shashikant Kulkarni13,14, Cherisse Marcou10, Linyan Meng13,14, David R. Murdock8,14, Atteeq U. Rehman4, Elizabeth Spiteri15, Amanda Thomas-Wilson4, Hutton M. Kearney 10,16, Heidi L. Rehm1,3,16 and Medical Genome Initiative* Whole genome sequencing (WGS) shows promise as a first-tier diagnostic test for patients with rare genetic disorders. However, standards addressing the definition and deployment practice of a best-in-class test are lacking. To address these gaps, the Medical Genome Initiative, a consortium of leading health care and research organizations in the US and Canada, was formed to expand access to high quality clinical WGS by convening experts and publishing best practices. Here, we present best practice recommendations for the interpretation and reporting of clinical diagnostic WGS, including discussion of challenges and emerging approaches that will be critical to harness the full potential of this comprehensive test. npj Genomic Medicine (2022)7:27 ; https://doi.org/10.1038/s41525-022-00295-z 1234567890():,; INTRODUCTION are appropriately and accurately interpreted, validated, and Whole genome sequencing (WGS) is emerging as a first-tier reported, laboratories must carefully consider additional steps in diagnostic test for rare genetic diseases1,2. Compared to whole the testing process, including test ordering and orthogonal exome sequencing (WES) and other molecular diagnostic tests confirmation. (e.g. sequencing panels, microarrays), WGS is more comprehensive To facilitate more widespread adoption of whole genome for two reasons: (i) it allows detection of a broad range of variant sequencing, the Medical Genome Initiative16 (MGI) formed a types in a single assay, including single nucleotide variants (SNV), working group to establish best practice recommendations for the small insertions and deletions, mitochondrial variants (MT), repeat interpretation and reporting of clinical diagnostic WGS as a expansions (RE), copy number variants (CNV) and other structural comprehensive test. Teleconference meetings were held over a variants (SV); and (ii) it is untargeted, resulting in more uniform 12-month period. Informal polling (Supplementary Note 1) was coverage of exonic regions3–5 and added coverage of intronic, used to gain insight into the current practices of each member intergenic and regulatory regions. institution related to a multitude of topics including requisition/ Multiple publications have demonstrated the diagnostic super- consent, data annotation, analysis, triage and variant curation, iority of WGS as compared to chromosomal microarray (CMA), reporting, and reanalysis. Information obtained was used to guide karyotyping, or other targeted sequencing assays2,6–10. While a the discussion and development of recommendations based on recent meta-analysis11 found no significant difference in yields consensus among the participating laboratories. The discussions between WES and WGS, comparisons across cohorts, such as this also allowed identification of areas lacking consensus and key one, have limited utility given the variability introduced by unmet needs that, if addressed, would enable increased adoption differing patient age groups, clinical indications, family structures, of WGS in routine practice. and variant types analyzed12. In contrast, studies comparing yields within the same cohort support diagnostic or analytical superiority of WGS2,6,13–15. As a result, WGS has the potential to replace most OVERVIEW other forms of DNA-based testing. Clinical diagnostic genomic sequencing tests can be separated Genome test interpretation and reporting represents a chal- into three phases of analysis: primary, secondary, and tertiary (Fig. lenge to laboratories seeking to implement, or maximize the 1). Primary analysis encompasses the technical components of the diagnostic potential of, clinical WGS. For instance, laboratories assay, including DNA extraction, library preparation, sequence must design analytical strategies capable of efficiently prioritizing generation, and preliminary data quality control (QC). Secondary clinically relevant variation across all variant types captured by analysis involves bioinformatic processes such as alignment of the WGS (Table 1). Furthermore, to ensure that the prioritized variants raw sequence data to a genome reference, variant calling, and 1 Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA. 2Laboratory for Molecular Medicine, Mass General Brigham Personalized Medicine, Cambridge, MA, USA. 3Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA. 4Molecular Diagnostics Laboratory, New York Genome Center, New York, NY, USA. 5Department of Pathology and Cell Biology, Columbia University Irving Medical Center, New York, NY, USA. 6Illumina Inc., San Diego, CA, USA. 7 HudsonAlpha Institute for Biotechnology, Huntsville, AL, USA. 8Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, USA. 9Genome Diagnostics, Department of Paediatric Laboratory Medicine, The Hospital for Sick Children, Toronto, ON, Canada. 10Division of Laboratory Genetics and Genomics, Department of Laboratory Medicine and Pathology, Mayo Clinic, Rochester, MN, USA. 11Center for Individualized Medicine, College of Medicine, Mayo Clinic, Rochester, MN, USA. 12Rady Children’s Institute for Genomic Medicine, San Diego, CA, USA. 13Baylor Genetics and Baylor College of Medicine, Houston, TX, USA. 14Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA. 15Department of Pathology, Stanford Medicine, Stanford University, Stanford, CA, USA. 16These authors contributed equally: Christina A. Austin-Tse, Vaidehi Jobanputra, Hutton M. Kearney, Heidi L. Rehm. *A list of members and their affiliations appears in the Supplementary Information. ✉email: [email protected] Published in partnership with CEGMR, King Abdulaziz University 1234567890():,; 2 Table 1. Interpretation and reporting considerations for clinically relevant variant types detectable by clinical WGS. Variant type or category Key interpretation and reporting considerations Diagnostic potentiala Exome platform Genome platform maturityb maturityb Single nucleotide variants (SNV) Represents the largest number of variants for review, requiring phenotype-driven and High High High genotype-driven filtering strategies (Fig. 2). All possible consequences should be considered (e.g. review of splicing annotations, transcript-specific impacts, MNVs). Some positions may have more than one alternate allele (multi-allelic variant) Small (1%) to ensure known pathogenic high of an unbiased analysis approach. This strategy facilitates the frequency variants will be reviewed. Resources for identifying detection of: (1) unanticipated genetic diagnoses that may explain these types of variants are being assembled through ClinGen48 all or a portion of the patient phenotype; (2) patients with unusual and the Genetic Testing Reference Materials Coordination presentations of established disorders (phenotypic expansion); (3) Program (GeT-RM, https://www.cdc.gov/labquality/get-rm/index. multiple genetic diagnoses in a single individual; (4) variants html). To supplement these resources, we have also provided relevant to a secondary phenotype or family history of disease; (5) examples of low penetrance, risk, and other high frequency variants in novel disease genes; and (6) clinically significant variants of interest in Supplementary Data 4. incidental or secondary findings. Beyond the identification of variants in well-established disease Genotype-driven analyses should aim to capture all variants genes that match the patient phenotype, genotype-driven that might have sufficient evidence to be classified as pathogenic analysis can also be tailored to the discovery of novel disease or likely pathogenic, including: previously reported disease- genes. Given the rapid pace at which new genotype-phenotype causing variants, predicted loss of function (pLOF) variants (e.g. correlations are discovered, reporting these findings may aid in nonsense, frameshift, and essential splice sites) and, when building evidence for disease causality in a time period relevant to multiple family members are sequenced, variants that are the patient’s care. As a result, clinical laboratories are encouraged suspicious based on their inheritance pattern (e.g. de novo to include analysis of genes not yet linked to disease with variants or biallelic rare variants in a gene associated with a judicious reporting. Gene discovery analyses may prioritize de recessive disorder). Automated prioritization of previously novo and/or pLOF variants in highly constrained genes based on reported pathogenic variants can be challenging given the gnomAD constraint scores as well as biallelic pLOF variants in unstructured nature of the scientific literature. To aid analysts in genes that are devoid of homozygous LOF variants in gnomAD. identifying variants with known or suspected disease association, software tools may identify variants previously reported in Phenotype-driven analysis association with any phenotype based on incorporated literature In most cases, the genotype-driven analysis should be supple- searches or database entries. ClinVar is a critical database for this mented with additional “phenotype-driven” analyses, particularly purpose44. While variant classifications in ClinVar should not be if there are specific genes that are highly relevant to the patient’s assumed to be correct, they represent a useful tool to efficiently phenotype. Phenotype-driven analyses allow for the comprehen- identify candidate variants and publications that warrant further review. In addition to freely accessible and downloadable sive review of potentially relevant variants that may not meet the databases like ClinVar, laboratories may choose to supplement criteria defined in the genotype-driven analysis approach their annotations with gene or variant-disease relationships from described above (e.g. novel missense variants in dominant genes). additional sources including subscription-based databases (see Variants identified exclusively by phenotype-driven analyses are Supplementary Data 3 for databases in use at participating more likely to be classified as benign or variants of uncertain institutions). significance given that they have no prior reports of pathogeni- In order to reduce the number of variants requiring expert city, no de novo occurrence, and no predicted LOF impact, all of review and to target variants most likely to cause genetic disease, which would surface through genotype-driven analyses. Never- additional filters are typically applied to genotype-driven analyses. theless, they may still meet criteria for reporting if located in a For example, expert review may be focused on variants in or near gene strongly associated with the patient’s phenotype (see genes that have a reported link to human disease, such as those reporting section below). Some nonspecific or highly genetically curated by OMIM and other gene-level resources, most of which heterogeneous phenotypes (e.g. developmental delay and autism) have been aggregated by the Gene Curation Coalition (thegencc. are less likely to benefit from phenotype-driven analysis strategies. org). To further reduce the interpretive burden, laboratories also Therefore, the decision regarding the appropriateness of this employ allele frequency cutoffs, which make use of population approach is at the discretion of the analysis team. frequency data from reference databases such as the Genome When performing phenotype-driven analyses, laboratories must Aggregation Database (gnomAD; https://gnomad.broadinstitute. have a mechanism for defining the genes of interest for a given org/) to exclude variants that are too common to cause rare phenotype. Automated phenotype-driven analyses, such as those genetic disease. Caution must be exercised as some cohorts in integrated into commercially available genomic analysis plat- gnomAD do not represent the general population and were not forms, typically depend on structured patient phenotype data (e.g. screened to exclude all individuals with a genetic disease. HPO terms) to prioritize variants found in genes relevant to the Additionally, variants that arise from clonal hematopoiesis of patient’s phenotype. While automated methods have clear indeterminate potential (CHIP)45,46 may falsely elevate population benefits in terms of efficiency, their performance varies depending allele frequencies in several genes associated with germline on the algorithms and gene-phenotype association sources used. genetic syndromes (e.g. DNMT3A, ASXL1, and TP53)47. Alternatively, laboratories may manually curate a list of relevant While reducing the number of variants requiring review is genes that can be used to prioritize variants for downstream critical to the efficiency of WGS analysis, filtering criteria must analysis. There are multiple available sources of gene-disease ensure true pathogenic variation is not missed. For example, association information (see Supplementary Data 5). Of note, there pathogenic founder variants, variants with reduced penetrance, or is no single source from which all relevant genes can reliably be variants in genes associated with a disease of varying clinical mined. As a result, the curation of gene-disease associations from severity may be more common in the population than the applied multiple databases is likely to produce a more comprehensive list. frequency cutoff, yet still be clinically relevant to the patient. Furthermore, caution should be exercised as idiosyncrasies in Laboratories must design filtering approaches to account for this search functionality and gene-disease annotations can lead to circumstance. For example, laboratories could review all variants missing critical genes (e.g. if a database associates the GJB2 gene classified as likely pathogenic/pathogenic in ClinVar and the exclusively with “hearing loss”, a query using the term “deafness” laboratory’s internal knowledge base with no additional filtering may not return the gene). Given multiple potential sources of error criteria, or implement more permissive criteria (e.g. 500 kb for deletions and WGS reporting policies should maximize the test’s diagnostic >1 Mb for duplications) or gene content threshold (e.g. more than potential while minimizing the number of variants that may cause 25 deleted genes), should be curated and classified per published unnecessary clinician follow-up or patient stress or anxiety. guidelines77, and reported according to laboratory policy. Policies should be available to patients and ordering providers to ensure the patient is appropriately consented prior to the In-depth variant assessment initiation of testing. Finally, laboratories are strongly encouraged When variants of interest are identified during triage, in-depth to engage ordering providers in reporting decisions when there is gene and variant analysis should be performed prior to making uncertainty as to whether a variant aligns with the patient’s reporting decisions. Such analyses should follow existing guide- phenotype or the family’s preferences. This communication and lines on the interpretation of sequence variants78,79, copy number the factors considered in the decision to return a result should be variants77, low penetrance/risk alleles (https://www. documented in the laboratory’s record. clinicalgenome.org/working-groups/low-penetrance-risk-allele- working-group/), mitochondrial variants58,80,81, and evaluating Result summary gene-disease relationships82. While it is common practice for targeted panel results to be classified as “Positive”, “Negative”, or “Inconclusive” based on the Secondary review variants identified, the definition of these categories becomes Given the large amount of information processed by analysts more complex in genomic testing, which may identify variants during triage steps, it is recommended that all triage decisions be relevant to the primary indication for testing, variants that explain agreed upon by at least two trained staff members. This QC step non-primary phenotypes, secondary findings, or other variant increases the accuracy of genome interpretations and aims to types. We support current ACMG recommendations49, which state identify potentially missed candidate variants. Dual review that “Primary findings in a diagnostic test should appear as a increases the time and cost of the test, but the laboratory may succinct interpretive result at the beginning of the report efficiently design the review step in consideration of the type of indicating the presence or absence of variants consistent with personnel completing each step and the depth of review required the phenotype.” Laboratories may find terms such as “Positive” or to achieve high quality genome interpretation. “Negative” useful for straightforward WGS results, but a descrip- tive statement defining those terms (e.g. “Positive: Findings Orthogonal confirmation or characterization of reportable explain indication for testing”) should also be considered. When variants the interpretive result is more complex, descriptive statements that speak to the relevance of the result to the patient phenotype The decision to pursue orthogonal testing depends on the extent are essential. The integration of WGS results into the medical of the laboratory’s validation for a particular variant type, quality record should be considered when drafting interpretive result metrics for a specific variant and requirements from regulatory summaries, since the summary may influence whether or not a bodies83–85. For example, a laboratory may conduct orthogonal provider chooses to review the report in detail. It is also possible testing on the same specimen to confirm a variant with low that future clinical informatics standards (e.g. HL7) may dictate quality metrics, confirm the result with a new specimen, clarify the that each indication for testing has a separate and distinct result. exact repeat size for a locus with an expanded STR call using a Therefore, considering report structure in light of these evolving targeted assay, deeply sequence (via a separate NGS test) an SNV standards may be prudent. to investigate potential of mosaicism, or determine levels of heteroplasmy in additional tissues from the affected individual. Technical limitations Reporting practices for technical limitations should be consistent REPORTING with other NGS-based testing standards49. In addition to listing Report contents should conform to existing guidelines for key components of the bioinformatics pipeline, the description of reporting from the ACMG49. Additional considerations particularly the analysis strategy should define all filtering and/or prioritization relevant to WGS reporting are presented below, including variants approaches used. The known limitations of the testing methodol- and genes of uncertain significance, STRs, and variants unrelated ogy as well as any variant types not interrogated should be to the primary indication for testing. described. Any known technically challenging regions or coverage Variants identified in established disease genes (as defined by issues in genes that are likely to be highly relevant to the patient current standards82) that are relevant to the primary indication for (e.g. limited sensitivity for F8 inversions in a patient with Published in partnership with CEGMR, King Abdulaziz University npj Genomic Medicine (2022) 27 C.A. Austin-Tse et al. 10 Table 2. Suggested steps of reanalysis based on events that have occurred since initial analysis. Tertiary analysis Change since initial analysis Primary analysis (sample/ Secondary analysis (mapping, Annotation Variant Variant and Reporting library prep and sequencing) alignment, variant calling, QC) stratification gene assessment Significant improvements in library ✔ ✔ ✔ ✔ ✔ ✔ prep/sequencing technology Bioinformatics improvements ✔ ✔ ✔ ✔ ✔ >1 year lapsed since initial analysis ✔ ✔ ✔ ✔ Additional patient phenotypes or ✔ ✔ ✔ family history Improved understanding of the ✔ ✔ ✔ genetic etiology of patient condition New methodology or resource for ✔ ✔ variant assessment hemophilia86) should be specifically called out as a potential Furthermore, given the differing approaches, algorithms, and source of reduced sensitivity. Several tools for calculating region- professional opinion inherent to WGS analysis, laboratories should specific coverage are freely available87–89. support the sharing of raw sequencing data and other file types to enable analysis by other laboratories or research programs if requested by the patient or ordering provider. MGI laboratories REANALYSIS are currently providing this data through encrypted hard drives or Periodic case reanalysis has been demonstrated to improve access through a portal. diagnostic yield90–93. It is therefore recommended that labora- tories provide an option for reanalysis of finalized WGS cases. Ideally, reanalysis policies should be developed in advance of test KEY RECOMMENDATIONS launch and communicated to providers at the time the test is Key recommendations are summarized below. Given our goal to ordered. The ACMG has recently produced two “points to provide a complete reference for WGS, this list includes consider” documents relevant to reanalysis policies and proce- recommendations that are also relevant to WES. dures94,95. Of note, procedures for variant-level reevaluation are not substantially different from other genetic testing methods and Requisition/Consent have been addressed elsewhere78,94. While the use of structured phenotype ontologies is important Robust reanalysis may necessitate re-running multiple steps of for the automation and scalability of WGS, laboratories are the test (Table 2). Given the significant role that the patient cautioned against requiring ordering providers to submit phenotype plays in the analysis process, laboratories should information in the primary structured form given the potential request updates to the patient’s medical and family history prior for loss of detailed and nuanced phenotypic information to initiating reanalysis. Phenotype- and genotype-driven analyses The test ordering process should enable physicians to specify should be reviewed and adjusted in light of the updated patient the primary clinical question of interest information. Test requisition forms and any laboratory provided consent Even in the absence of new phenotype information, newly forms should clearly indicate that genetic variants relevant to published variant or gene-level evidence may allow a previously any phenotype provided to the laboratory may be returned unreviewed variant to meet criteria for expert review or a unless specific reporting instructions are provided by the previously reviewed variant to meet criteria for reporting. ordering clinician Laboratories may consider suggesting a minimum time period For trio or other multiple-family member sequencing (e.g. one year) to have elapsed since the initial analysis to conduct approaches, requisition and/or consent forms should clarify this type of case review. Alternatively, reanalysis may be initiated how the data from auxiliary family members will be used and by new publications, updated transcripts and new gene models, reported or changes to the WGS test definition that may impact a set Phenotypic data submitted with the test order should of cases. undergo review by laboratory staff prior to the initiation of The initiation of reanalysis may be either reactive (i.e., when testing and laboratories should seek clarification of unclear or requested by the ordering provider/patient) or proactive (inde- conflicting information pendently triggered by the lab). As of the time of this publication, Policies for secondary findings analysis and secondary and the majority of laboratories currently offering WGS analysis are incidental findings reporting should be developed in advance performing reactive reanalysis. However, proactive reanalysis, of launching a WGS test and clearly defined on the including the acquisition of updated clinical information, is laboratory’s requisition and any provided consent form recognized as an important step towards maximizing the clinical Annotation utility of the WGS test. Several key issues stand in the way of this ideal, including the personnel effort required to conduct analysis, In the absence of infrastructure that can support continuous lack of reimbursement, limited tools to enable tracking of cases in updates, we recommend that data sources used in annotation need of follow-up, and limited tools to automatically identify new pipelines be updated at least quarterly. scientific literature relevant to variants and genes reviewed in past Until noncoding regions of the genome can be system- cases. Improved systems to address these issues will be critical to atically interrogated and interpreted, laboratories should the future implementation of proactive reanalysis. ensure that their pipelines are able to capture all known Given the substantial effort required for case reanalysis, pathogenic variants in ClinVar, including those that fall laboratories may need to charge a fee for this service. outside of coding sequence and flanking intronic regions. npj Genomic Medicine (2022) 27 Published in partnership with CEGMR, King Abdulaziz University C.A. Austin-Tse et al. 11 Analysis variants that affect exonic splice enhancers, and databases cataloguing structural variation within large populations). The overall analysis approach should incorporate both genotype-driven and phenotype-driven strategies. Analysis The complete set of variants meeting the genotype- and Routine deposition of variant data in structured format into phenotype-centric filtering criteria should be reviewed for centralized databases (e.g. ClinVar) and scientific literature to every case, even when a likely explanatory variant is identified improve identification of previously reported variants. early in the triage process. Improved tools for calling, filtering, and interpretation of SVs If the interpretive tool in use does not readily identify and STRs. compound heterozygous calls across variant classes (e.g. Maturation and validation of AI/machine learning tools will be SNV and CNV), re-examination of calls across all relevant needed to scale analysis. variant classes should be performed when a compelling monoallelic variant is found in an autosomal recessive gene Reporting associated with the patient’s phenotype. Structured integration of WGS results into the medical record Given the large amount of information processed by analysts or connected platforms. during triage steps, it is recommended that all variants identified by genotype- and phenotype-centric filtering Reanalysis criteria be reviewed by at least two trained staff members. Tools to support systematic proactive reanalysis (i.e., tools to The sensitivity and specificity of automated analysis tools automatically identify new scientific literature relevant to remain insufficient to replace the role of a human analyst. variants and genes reviewed in past cases). Results from automated analyses should be carefully assessed Reimbursement for reanalysis. by a human analyst prior to consideration for reporting. Reporting Reporting summary The laboratory should establish policies defining the types of Further information on research design is available in the Nature findings that are considered for return, and these policies Research Reporting Summary linked to this article. should be available to patients and providers. Variant-level evidence, gene-level evidence, and the DATA AVAILABILITY correlation between the patient’s phenotype and the All structured data generated or analyzed during this study are included in this gene should be addressed. published article (and its supplementary information files). The goal of reporting policies should be to maximize the test’s diagnostic potential while minimizing the number of Received: 4 August 2021; Accepted: 17 February 2022; VUS reported. We strongly encourage open communication between order- ing providers and the laboratory regarding reporting deci- sions, particularly for more challenging cases (e.g. if there is REFERENCES uncertainty as to whether a finding aligns with patient 1. Scocchia, A. et al. Clinical whole genome sequencing as a first-tier test at a phenotype). resource-limited dysmorphology clinic in Mexico. NPJ Genom. Med. 4, 5 (2019). When summarizing WGS findings on reports, laboratories may 2. Lionel, A. C. et al. Improved diagnostic yield compared with targeted gene find terms such as “Positive” or “Negative” useful for sequencing panels suggests a role for whole-genome sequencing as a first-tier straightforward results, but a descriptive statement defining genetic test. Genet. Med. 20, 435–443 (2018). 3. Meienberg, J., Bruggmann, R., Oexle, K. & Matyas, G. Clinical sequencing: is WGS those terms (i.e. “Positive: Findings explain indication for the better WES? Hum. Genet. 135, 359–362 (2016). testing”) should also be considered. When the interpretive 4. Belkadi, A. et al. Whole-genome sequencing is more powerful than whole-exome result is more complex, high-level descriptive statements that sequencing for detecting exome variants. Proc. Natl Acad. Sci. USA 112, speak to the relevance of the result to the patient phenotype 5473–5478 (2015). are essential. 5. Lelieveld, S. H., Spielmann, M., Mundlos, S., Veltman, J. A. & Gilissen, C. Com- parison of exome and genome sequencing technologies for the complete cap- Reanalysis ture of protein‐coding regions. Hum. Mutat. 36, 815–822 (2015). It is highly recommended that laboratories provide an option 6. Bertoli-Avella, A. M. et al. Successful application of genome sequencing in a for reanalysis of WGS cases. A charge for this service is diagnostic setting: 1007 index cases from a clinically heterogeneous cohort. Eur. J. Hum. Genet. https://doi.org/10.1038/s41431-020-00713-9 (2020). acceptable. 7. Stavropoulos, D. J. et al. Whole genome sequencing expands diagnostic utility and improves clinical management in pediatric medicine. NPJ Genom Med 1, (2016). 8. Willig, L. K. et al. Whole-genome sequencing for identification of Mendelian KEY UNMET NEEDS disorders in critically ill infants: a retrospective analysis of diagnostic and clinical Requisition/Consent findings. Lancet Respir. Med 3, 377–387 (2015). 9. Ostrander, B. E. P. et al. Whole-genome analysis for effective clinical diagnosis and Phenotype capture methods maximize the amount of patient gene discovery in early infantile epileptic encephalopathy. NPJ Genom. Med 3, 22 and family history information available to laboratories in a (2018). structured, machine-readable format without placing unne- 10. Rajagopalan, R. et al. Genome sequencing increases diagnostic yield in clinically diagnosed Alagille syndrome patients with previously negative test results. Genet. cessary burden on clinicians. Med. https://doi.org/10.1038/s41436-020-00989-8 (2020). Annotation 11. Clark, M. M. et al. Meta-analysis of the diagnostic and clinical utility of genome and exome sequencing and chromosomal microarray in children with suspected Standardization of NGS data annotations. genetic diseases. NPJ Genom. Med. 3, 16 (2018). Annotations to support analysis of a broader range of 12. Bick, D., Jones, M., Taylor, S. L., Taft, R. J. & Belmont, J. Case for genome molecular pathogenic mechanisms (e.g. in silico predictors sequencing in infants and children with rare, undiagnosed or genetic diseases. J. for noncoding variants that affect promoters or coding Med. Genet. 56, 783–791 (2019). Published in partnership with CEGMR, King Abdulaziz University npj Genomic Medicine (2022) 27 C.A. Austin-Tse et al. 12 13. Gilissen, C. et al. Genome sequencing identifies major causes of severe intellec- 43. Vaché, C. et al. Usher syndrome type 2 caused by activation of an USH2A tual disability. Nature 511, 344–347 (2014). pseudoexon: implications for diagnosis and therapy. Hum. Mutat. 33, 104–108 14. Splinter, K. et al. Effect of genetic diagnosis on patients with previously (2012). undiagnosed disease. N. Engl. J. Med. 379, 2131–2139 (2018). 44. Landrum, M. J. et al. ClinVar: improving access to variant interpretations and 15. Kingsmore, S. F. et al. A randomized, controlled trial of the analytic and diagnostic supporting evidence. Nucleic Acids Res. 46, D1062–D1067 (2018). performance of singleton and trio, rapid genome and exome sequencing in Ill 45. Bick, A. G. et al. Inherited causes of clonal haematopoiesis in 97,691 whole infants. Am. J. Hum. Genet. 105, 719–733 (2019). genomes. Nature 586, 763–768 (2020). 16. Marshall, C. R. et al. The Medical Genome Initiative: moving whole-genome 46. Steensma, D. P. Clinical consequences of clonal hematopoiesis of indeterminate sequencing for rare disease diagnosis to the clinic. Genome Med. 12, 48 (2020). potential. Blood Adv. 2, 3404–3410 (2018). 17. Marshall, C. R. et al. Best practices for the analytical validation of clinical whole- 47. Carlston, C. M. et al. Pathogenic ASXL1 somatic variants in reference databases genome sequencing intended for the diagnosis of germline disease. NPJ Genom. complicate germline variant interpretation for Bohring-Opitz Syndrome. https:// Med. 5, 47 (2020). doi.org/10.1101/090720. 18. Tanudisastro, H. A. et al. Australia and New Zealand renal gene panel testing in 48. Ghosh, R. et al. Updated recommendation for the benign stand-alone ACMG/ routine clinical practice of 542 families. NPJ Genom. Med. 6, 20 (2021). AMP criterion. Hum. Mutat. 39, 1525–1530 (2018). 19. Ashford, M. Stanford launches clinical whole-genome sequencing for inherited 49. Rehder, C. et al. Next-generation sequencing for constitutional variants in the cardiovascular testing. https://www.genomeweb.com/sequencing/stanford- clinical laboratory, 2021 revision: a technical standard of the American College of launches-clinical-whole-genome-sequencing-inherited-cardiovascular-testing Medical Genetics and Genomics (ACMG). Genet. Med. 1–17 (2021). (2021). 50. Martin, A. R. et al. PanelApp crowdsources expert knowledge to establish con- 20. Today, C. In next-gen sequencing, panel versus exome. https://www. sensus diagnostic gene panels. Nat. Genet. 51, 1560–1565 (2019). captodayonline.com/next-gen-sequencing-panel-versus-exome/ (2016). 51. Werling, D. M. et al. An analytical framework for whole-genome sequence 21. Dias, R. & Torkamani, A. Artificial intelligence in clinical and genomic diagnostics. association studies and its implications for autism spectrum disorder. Nat. Genet. Genome Med. 11, 70 (2019). 50, 727–736 (2018). 22. Clark, M. M. et al. Diagnosis of genetic diseases in seriously ill children by rapid 52. Gross, A. M. et al. Copy-number variants in clinical genome sequencing: whole-genome sequencing and automated phenotyping and interpretation. Sci. deployment and interpretation for rare and undiagnosed disease. Genet. Med. 21, Transl. Med. 11, (2019). 1121–1130 (2019). 23. Son, J. H. et al. Deep phenotyping on electronic health records facilitates genetic 53. Whitford, W., Lehnert, K., Snell, R. G. & Jacobsen, J. C. Evaluation of the perfor- diagnosis by clinical exomes. Am. J. Hum. Genet. 103, 58–73 (2018). mance of copy number variant prediction tools for the detection of deletions 24. Girdea, M. et al. PhenoTips: patient phenotyping software for clinical and from whole genome sequencing data. J. Biomed. Inform. 94, 103174 (2019). research use. Hum. Mutat. 34, 1057–1065 (2013). 54. Collins, R. L. et al. A structural variation reference for medical and population 25. Hammond, P. The use of 3D face shape modelling in dysmorphology. Arch. Dis. genetics. Nature 581, 444–451 (2020). Child. 92, 1120–1126 (2007). 55. Zhao, X. et al. Expectations and blind spots for structural variation detection from 26. Latorre-Pellicer, A. et al. Evaluating Face2Gene as a Tool to Identify Cornelia de long-read assemblies and short-read genome sequencing technologies. Am. J. Lange Syndrome by Facial Phenotypes. Int. J. Mol. Sci. 21, (2020). Hum. Genet. 108, 919–928 (2021). 27. Mishima, H. et al. Evaluation of Face2Gene using facial images of patients with 56. McKinlay Gardner, R. J., Gardner, R. J. M. & Amor, D. J. Gardner and Sutherland’s congenital dysmorphic syndromes recruited in Japan. J. Hum. Genet. 64, 789–794 Chromosome Abnormalities and Genetic Counseling. (Oxford University Press, (2019). 2018). 28. Schriml, L. M. et al. Human Disease Ontology 2018 update: classification, content 57. Laricchia, K. M., Lake, N. J., Watts, N. A., Shand, M. & Haessly, A. Mitochondrial DNA and workflow expansion. Nucleic Acids Res. 47, D955–D962 (2019). variation across 56,434 individuals in gnomAD. bioRxiv (2021). 29. Presidential Commission for the Study of Bioethical Issues. Anticipate and Com- 58. McCormick, E. M. et al. Specifications of the ACMG/AMP standards and guide- municate: Ethical Management of Incidental and Secondary Findings in the Clinical, lines for mitochondrial DNA variant interpretation. Hum. Mutat. 41, 2028–2057 Research, and Direct-to-consumer Contexts. (Createspace Independent Pub, 2015). (2020). 30. Kalia, S. S. et al. Recommendations for reporting of secondary findings in clinical 59. Kogelnik, A. M., Lott, M. T., Brown, M. D., Navathe, S. B. & Wallace, D. C. MITOMAP: exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement a human mitochondrial genome database. Nucleic Acids Res. 24, 177–179 (1996). of the American College of Medical Genetics and Genomics. Genet. Med. 19, 60. Falk, M. J. et al. Mitochondrial Disease Sequence Data Resource (MSeqDR): a 249–255 (2017). global grass-roots consortium to facilitate deposition, curation, annotation, and 31. Webber, E. M. et al. Evidence-based assessments of clinical actionability in the integrated analysis of genomic data for the mitochondrial disease clinical and context of secondary findings: Updates from ClinGen’s Actionability Working research communities. Mol. Genet. Metab. 114, 388–396 (2015). Group. Hum. Mutat. 39, 1677–1685 (2018). 61. Preste, R., Vitale, O., Clima, R., Gasparre, G. & Attimonelli, M. HmtVar: a new 32. Bick, D. et al. Successful application of whole genome sequencing in a medical resource for human mitochondrial variations and pathogenicity data. Nucleic genetics clinic. J. Pediatr. Genet. 6, 61–76 (2017). Acids Res. 47, D1202–D1210 (2019). 33. de Wert, G. et al. Opportunistic genomic screening. Recommendations of the 62. Maude, H. et al. NUMT confounding biases mitochondrial heteroplasmy calls in European Society of Human Genetics. Eur. J. Hum. Genet. https://doi.org/10.1038/ favor of the reference allele. Front Cell Dev. Biol. 7, 201 (2019). s41431-020-00758-w (2020). 63. Dolzhenko, E. et al. ExpansionHunter: a sequence-graph-based tool to analyze 34. Schwartz, M. L. B. et al. A model for genome-first care: returning secondary variation in short tandem repeat regions. Bioinformatics 35, 4754–4756 (2019). genomic findings to participants and their healthcare providers in a large 64. Paulson, H. Repeat expansion diseases. Handb. Clin. Neurol. 147, 105–123 (2018). research cohort. Am. J. Hum. Genet. 103, 328–337 (2018). 65. Wallace, S. E. & Bean, L. J. H. Resources for genetics professionals — genetic dis- 35. O’Daniel, J. M. et al. A survey of current practices for genomic sequencing test orders caused by nucleotide repeat expansions and contractions. (University of interpretation and reporting processes in US laboratories. Genet. Med. 19, Washington, Seattle, 2019). 575–582 (2017). 66. Dolzhenko, E. et al. Detection of long repeat expansions from PCR-free whole- 36. Ackerman, S. L. & Koenig, B. A. Understanding variations in secondary findings genome sequence data. Genome Res. 27, 1895–1903 (2017). reporting practices across U.S. genome sequencing laboratories. AJOB Empir. 67. Ibanez, K. et al. Whole genome sequencing for the diagnosis of neurological Bioeth. 9, 48–57 (2018). repeat expansion disorders in the UK: a retrospective diagnostic accuracy and 37. GA4GH variation representation specification—GA4GH variation representation prospective clinical validation study. Lancet Neurol. 21, 234–245 (2022). specification 1.1.2 documentation. https://vrs.ga4gh.org/en/stable/. 68. Ji, J. et al. A semiautomated whole-exome sequencing workflow leads to 38. den Dunnen, J. T. et al. HGVS recommendations for the description of sequence increased diagnostic yield and identification of novel candidate variants. Cold variants: 2016 update. Hum. Mutat. 37, 564–569 (2016). Spring Harb. Mol. Case Stud. 5, (2019). 39. Vulliamy, T., Marrone, A., Dokal, I. & Mason, P. J. Association between aplastic 69. Thuriot, F. et al. Clinical validity of phenotype-driven analysis software PhenoVar anaemia and mutations in telomerase RNA. Lancet 359, 2168–2170 (2002). as a diagnostic aid for clinical geneticists in the interpretation of whole-exome 40. Bertini, V. et al. Blepharophimosis, ptosis, epicanthus inversus syndrome: new sequencing data. Genet. Med. 20, 942–949 (2018). report with a 197-kb deletion upstream of FOXL2 and review of the literature. 70. Stark, Z. et al. A clinically driven variant prioritization framework outperforms Mol. Syndromol. 10, 147–153 (2019). purely computational approaches for the diagnostic analysis of singleton WES 41. Chatterjee, S. & Ahituv, N. Gene Regulatory Elements, Major Drivers of Human data. Eur. J. Hum. Genet. 25, 1268–1272 (2017). Disease. Annu. Rev. Genomics Hum. Genet. 18, 45–63 (2017). 71. Cipriani, V. et al. An improved phenotype-driven tool for rare mendelian variant 42. Whiffin, N. et al. Characterising the loss-of-function impact of 5’ untranslated prioritization: benchmarking exomiser on real patient whole-exome data. Genes region variants in 15,708 individuals. Nat. Commun. 11, 2523 (2020). 11, (2020). npj Genomic Medicine (2022) 27 Published in partnership with CEGMR, King Abdulaziz University C.A. Austin-Tse et al. 13 72. Lincoln, S. E. et al. One in seven pathogenic variants can be challenging to detect 94. Deignan, J. L. et al. Points to consider in the reevaluation and reanalysis of by NGS: an analysis of 450,000 patients with implications for clinical sensitivity genomic test results: a statement of the American College of Medical Genetics and genetic test implementation. Genet. Med. (2021) https://doi.org/10.1038/ and Genomics (ACMG). Genet. Med. 21, 1267–1270 (2019). s41436-021-01187-w. 95. David, K. L. et al. Patient re-contact after revision of genomic test results: points to 73. Wilcox, E. et al. Creation of an expert curated variant list for clinical genomic test consider—a statement of the American College of Medical Genetics and Geno- development and validation: A ClinGen and GeT-RM collaborative project. bioRxiv mics (ACMG). Genet. Med. 21, 769–771 (2019). (2021) https://doi.org/10.1101/2021.06.09.21258594. 74. Posey, J. E. et al. Resolution of disease phenotypes resulting from multilocus genomic variation. N. Engl. J. Med. 376, 21–31 (2017). ACKNOWLEDGEMENTS 75. Philippakis, A. A. et al. The Matchmaker Exchange: a platform for rare disease We thank the following individuals for their critical review of the manuscript: Steven gene discovery. Hum. Mutat. 36, 915–921 (2015). Harrison, Michael Zody, Erin Thorpe, Julie Taylor, Aditi Chawla, R. Tanner Hagelstrom, 76. Azzariti, D. R. & Hamosh, A. Genomic data sharing for novel Mendelian disease Michael Stromberg, David Dimmock, and Christian Marshall. gene discovery: the matchmaker exchange. Annu. Rev. Genomics Hum. Genet. 21, 305–326 (2020). 77. Riggs, E. R. et al. Technical standards for the interpretation and reporting of AUTHOR CONTRIBUTIONS constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical C.A.T., V.J., D.L.P., H.M.K., and H.L.R. conceived the idea. C.A.T. and V.J. contributed Genome Resource (ClinGen). Genet. Med. 22, 245–257 (2020). equally to writing the manuscript, with significant contributions from D.L.P., H.M.K., 78. Richards, S. et al. Standards and guidelines for the interpretation of sequence and H.L.R. C.A.T., V.J., and D.L.P. collected and analyzed polling responses. D.L.P., D.B., variants: a joint consensus recommendation of the American College of Medical E.V., R.G., T.Y., S.B., N.B., K.A.E., S.G., C.M., L.M., D.R.M., A.R., E.S., and A.T.W. contributed Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. original data used to drive consensus. D.B., R.J.T., R.G., T.Y., E.V., J.W.B., S.C., S.K., D.R.M., 17, 405–424 (2015). E.S., H.M.K., and H.L.R. provided design advice and critical review of the manuscript. 79. Clinical Genome Resource. Sequence Variant Interpretation. https:// clinicalgenome.org/working-groups/sequence-variant-interpretation/. 80. Wong, L.-J. C. et al. Clinical and laboratory interpretation of mitochondrial mRNA COMPETING INTERESTS variants. Hum. Mutat. https://doi.org/10.1002/humu.24082 (2020). D.L.P. and R.J.T. are current employees and shareholders of Illumina Inc. The 81. Wong, L.-J. C. et al. Interpretation of mitochondrial tRNA variants. Genet. Med. 22, remaining authors declare no competing interests. 917–926 (2020). 82. Strande, N. T. et al. Evaluating the clinical validity of gene-disease associations: an evidence-based framework developed by the clinical genome resource. Am. J. ADDITIONAL INFORMATION Hum. Genet. 100, 895–906 (2017). Supplementary information The online version contains supplementary material 83. Holt, J. M. et al. Reducing Sanger confirmation testing through false positive available at https://doi.org/10.1038/s41525-022-00295-z. prediction algorithms. Genet. Med. https://doi.org/10.1038/s41436-021-01148-3 (2021). Correspondence and requests for materials should be addressed to Christina A. 84. Lincoln, S. E. et al. A rigorous interlaboratory examination of the need to confirm Austin-Tse. next-generation sequencing-detected variants with an orthogonal method in clinical genetic testing. J. Mol. Diagn. 21, 318–329 (2019). Reprints and permission information is available at http://www.nature.com/ 85. Baudhuin, L. M. et al. Confirming variants in next-generation sequencing panel reprints testing by sanger sequencing. J. Mol. Diagn. 17, 456–461 (2015). 86. Konkle, B. A., Huston, H. & Nakaya Fletcher, S. Hemophilia A. in GeneReviews (eds. Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims Adam, M. P. et al.) (University of Washington, Seattle, 2000). in published maps and institutional affiliations. 87. Pedersen, B. S. & Quinlan, A. R. Mosdepth: quick coverage calculation for gen- omes and exomes. Bioinformatics 34, 867–868 (2018). 88. Quinlan, A. R. BEDTools: The Swiss-army tool for genome feature analysis. Curr. Protoc. Bioinforma. 47, 11.12.1–34 (2014). Open Access This article is licensed under a Creative Commons 89. Tarasov, A., Vilella, A. J., Cuppen, E., Nijman, I. J. & Prins, P. Sambamba: fast Attribution 4.0 International License, which permits use, sharing, processing of NGS alignment formats. Bioinformatics 31, 2032–2034 (2015). adaptation, distribution and reproduction in any medium or format, as long as you give 90. Costain, G. et al. Periodic reanalysis of whole-genome sequencing data enhances appropriate credit to the original author(s) and the source, provide a link to the Creative the diagnostic advantage over standard clinical genetic testing. Eur. J. Hum. Commons license, and indicate if changes were made. The images or other third party Genet. 26, 740–744 (2018). material in this article are included in the article’s Creative Commons license, unless 91. Ewans, L. J. et al. Whole-exome sequencing reanalysis at 12 months boosts indicated otherwise in a credit line to the material. If material is not included in the diagnosis and is cost-effective when applied early in Mendelian disorders. Genet. article’s Creative Commons license and your intended use is not permitted by statutory Med. 20, 1564–1574 (2018). regulation or exceeds the permitted use, you will need to obtain permission directly 92. Wright, C. F. et al. Making new genetic diagnoses with old data: iterative rea- from the copyright holder. To view a copy of this license, visit http://creativecommons. nalysis and reporting from genome-wide data in 1,133 families with develop- org/licenses/by/4.0/. mental disorders. Genet. Med. 20, 1216–1223 (2018). 93. Machini, K. et al. Analyzing and reanalyzing the genome: findings from the MedSeq project. Am. J. Hum. Genet. 105, 177–188 (2019). © The Author(s) 2022 Published in partnership with CEGMR, King Abdulaziz University npj Genomic Medicine (2022) 27

Use Quizgecko on...
Browser
Browser