Quantification of read species behavior within whole genome sequencing of cancer genomes for the stratification and visualization of genomic variation

Dror Hibsh, Kenneth H. Buetow, Gur Yaari, Sol Efroni

Research output: Contribution to journalArticlepeer-review

Abstract

The cancer genome is abnormal genome, and the ability tomonitor its sequence had undergone a technological revolution. Yet prognosis and diagnosis remain an expert-based decision, with only limited abilities to provide machine-based decisions. We introduce a heterogeneity-based method for stratifying and visualizing whole-genome sequencing (WGS) reads. This method uses the heterogeneity within WGS reads to markedly reduce the dimensionality of next-generation sequencing data; it is available through the tool HiBS (Heterogeneity-Based Subclassification) that allows cancer sample classification. We validated HiBS using >200 WGS samples from nine different cancer types from The Cancer Genome Atlas (TCGA).With HiBS, we show progress with two WGS related issues: (i) differentiation between normal (NB) and tumor (TP) samples based solely on the information structure of their WGS data, and (ii) identification of specific regions of chromosomal amplification/deletion and their association with tumor stage. By comparing results to those obtained through available WGS analyses tools, we demonstrate some of the novelties obtained by the approach implemented in HiBS and also show nearly perfect normal/tumor classification, used to identify known and unknown chromosomal aberrations. Finally, the HiBS index has been associated with breast cancer tumor stage.

Original languageEnglish
Article numbere81
JournalNucleic Acids Research
Volume44
Issue number9
DOIs
StatePublished - 19 May 2016

Bibliographical note

Publisher Copyright:
© 2016 The Author(s).

Funding

FundersFunder number
Seventh Framework Programme285875

    Fingerprint

    Dive into the research topics of 'Quantification of read species behavior within whole genome sequencing of cancer genomes for the stratification and visualization of genomic variation'. Together they form a unique fingerprint.

    Cite this