Analyses of non-coding somatic drivers in 2,658 cancer whole genomes

PCAWG Consortium, PCAWG Drivers and Functional Interpretation Working Group, PCAWG Structural Variation Working Group

Research output: Contribution to journalArticlepeer-review

386 Scopus citations

Abstract

The discovery of drivers of cancer has traditionally focused on protein-coding genes1–4. Here we present analyses of driver point mutations and structural variants in non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium5 of the International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a statistically rigorous strategy for combining significance levels from multiple methods of driver discovery that overcomes the limitations of individual methods. For structural variants, we present two methods of driver discovery, and identify regions that are significantly affected by recurrent breakpoints and recurrent somatic juxtapositions. Our analyses confirm previously reported drivers6,7, raise doubts about others and identify novel candidates, including point mutations in the 5′ region of TP53, in the 3′ untranslated regions of NFKBIZ and TOB1, focal deletions in BRD4 and rearrangements in the loci of AKR1C genes. We show that although point mutations and structural variants that drive cancer are less frequent in non-coding genes and regulatory sequences than in protein-coding genes, additional examples of these drivers will be found as more cancer genomes become available.

Original languageEnglish
Pages (from-to)102-111
Number of pages10
JournalNature
Volume578
Issue number7793
DOIs
StatePublished - 6 Feb 2020

Bibliographical note

Publisher Copyright:
© 2020, The Author(s).

Funding

Competing interests The following authors declare that they have competing interests. P.B. receives grant funding from Novartis from an unrelated project; R.B. owns equity in Ampressa Therapeutics and receives grant funding from Novartis; G.G. receives research funds from IBM and Pharmacyclics and is an inventor on patent applications related to MuTect, ABSOLUTE, MutSig, MSMuTect, MSMutSig and POLYSOLVER; B.J.R. is a consultant at and has ownership interest (including stock, patents and so on) in Medley Genomics; O.S. is currently an employee of Cedilla Therapeutics); and Y.L. is currently an employee of Seven Bridges Genomics. Acknowledgements We thank the ICGC and TCGA PCAWG Network and the PCAWG steering committee for enabling this work, and for guidance throughout the study. We thank K. Kübler for assistance with meta-cohort generation and R. Heller for discussion on FDR. We are grateful to the PCAWG steering committee, M. Meyerson and E. S. Lander for helpful feedback, and M. Miller for editing this manuscript. Work in the Getz laboratory was partially funded by the GDAC grants (NIH U24CA143845 and NIH U24CA210999), G.G.’s funds at the Broad Institute and MGH. G.G. is also partially supported by the Paul C. Zamecnik Chair in Oncology in MGH. J.S.P. was partially funded by Independent Research Fund Denmark (12-126439 and 7016-00379) and The Danish Cancer Society (R124-A7869). R.B. received funds from the National Institutes of Health (U54CA143798, R01CA188228, R35GM127029, and R01CA215489), the DFCI-Novartis Drug Discovery Program, the Pediatric Low Grade Astrocytoma Foundation, the Cure Starts Now Foundation and The Fund for Innovation in Cancer Informatics. J.W. was partly funded by Independent Research Fund Denmark (4183-00233B and 8020-00282B) and Danish Cancer Society (R147-Rp12977). N.L.-B. acknowledges funding from the European Research Council consolidator grant 682398) and Spanish Ministry of Economy and Competitiveness (SAF2015-66084-R, MINECO/FEDER, UE). We acknowledge the contributions of the many clinical networks across ICGC and TCGA who provided samples and data to the PCAWG Consortium, and the contributions of the Technical Working Group and the Germline Working Group of the PCAWG Consortium for collation, realignment and harmonized variant calling of the cancer genomes used in this study. We thank the patients and their families for their participation in the individual ICGC and TCGA projects.

FundersFunder number
DFCI-Novartis Drug Discovery Program
GDAC
MINECO/FEDER
MSMutSig
Spanish Ministry of Economy and CompetitivenessSAF2015-66084-R
National Institutes of HealthU24CA210999, R01CA215489, R35GM127029, R01CA188228, U54CA143798, U24CA143845
National Human Genome Research InstituteR01HG007069
International Business Machines Corporation
Massachusetts General Hospital
Pediatric Low Grade Astrocytoma Foundation
Cure Starts Now Foundation4183-00233B, 8020-00282B, R147-Rp12977
Kræftens BekæmpelseR124-A7869
Broad Institute
European Commission682398
Danmarks Frie Forskningsfond12-126439, 7016-00379

    Fingerprint

    Dive into the research topics of 'Analyses of non-coding somatic drivers in 2,658 cancer whole genomes'. Together they form a unique fingerprint.

    Cite this