Inferring expressed genes by whole-genome sequencing of plasma DNA

Peter Ulz, Gerhard G Thallinger, Martina Auer, Ricarda Graf, Karl Kashofer, Stephan W Jahn, Luca Abete, Gunda Pristauz, Edgar Petru, Jochen B Geigl, Ellen Heitzer, Michael R Speicher

Research output: Contribution to journalArticleResearchpeer-review

Abstract

The analysis of cell-free DNA (cfDNA) in plasma represents a rapidly advancing field in medicine. cfDNA consists predominantly of nucleosome-protected DNA shed into the bloodstream by cells undergoing apoptosis. We performed whole-genome sequencing of plasma DNA and identified two discrete regions at transcription start sites (TSSs) where nucleosome occupancy results in different read depth coverage patterns for expressed and silent genes. By employing machine learning for gene classification, we found that the plasma DNA read depth patterns from healthy donors reflected the expression signature of hematopoietic cells. In patients with cancer having metastatic disease, we were able to classify expressed cancer driver genes in regions with somatic copy number gains with high accuracy. We were able to determine the expressed isoform of genes with several TSSs, as confirmed by RNA-seq analysis of the matching primary tumor. Our analyses provide functional information about cells releasing their DNA into the circulation.

Original languageEnglish
Pages (from-to)1273-8
Number of pages6
JournalNature genetics
Volume48
Issue number10
DOIs
Publication statusPublished - Oct 2016

Fingerprint

DNA Sequence Analysis
Genome
DNA
Nucleosomes
Genes
Transcription Initiation Site
Neoplasm Genes
Neoplasms
Protein Isoforms
Medicine
Tissue Donors
RNA
Apoptosis

Keywords

  • Journal Article

Cite this

Ulz, P., Thallinger, G. G., Auer, M., Graf, R., Kashofer, K., Jahn, S. W., ... Speicher, M. R. (2016). Inferring expressed genes by whole-genome sequencing of plasma DNA. Nature genetics, 48(10), 1273-8. https://doi.org/10.1038/ng.3648

Inferring expressed genes by whole-genome sequencing of plasma DNA. / Ulz, Peter; Thallinger, Gerhard G; Auer, Martina; Graf, Ricarda; Kashofer, Karl; Jahn, Stephan W; Abete, Luca; Pristauz, Gunda; Petru, Edgar; Geigl, Jochen B; Heitzer, Ellen; Speicher, Michael R.

In: Nature genetics, Vol. 48, No. 10, 10.2016, p. 1273-8.

Research output: Contribution to journalArticleResearchpeer-review

Ulz, P, Thallinger, GG, Auer, M, Graf, R, Kashofer, K, Jahn, SW, Abete, L, Pristauz, G, Petru, E, Geigl, JB, Heitzer, E & Speicher, MR 2016, 'Inferring expressed genes by whole-genome sequencing of plasma DNA' Nature genetics, vol. 48, no. 10, pp. 1273-8. https://doi.org/10.1038/ng.3648
Ulz, Peter ; Thallinger, Gerhard G ; Auer, Martina ; Graf, Ricarda ; Kashofer, Karl ; Jahn, Stephan W ; Abete, Luca ; Pristauz, Gunda ; Petru, Edgar ; Geigl, Jochen B ; Heitzer, Ellen ; Speicher, Michael R. / Inferring expressed genes by whole-genome sequencing of plasma DNA. In: Nature genetics. 2016 ; Vol. 48, No. 10. pp. 1273-8.
@article{870825fd1f5e4ea1b6f6855d5bb5b4f5,
title = "Inferring expressed genes by whole-genome sequencing of plasma DNA",
abstract = "The analysis of cell-free DNA (cfDNA) in plasma represents a rapidly advancing field in medicine. cfDNA consists predominantly of nucleosome-protected DNA shed into the bloodstream by cells undergoing apoptosis. We performed whole-genome sequencing of plasma DNA and identified two discrete regions at transcription start sites (TSSs) where nucleosome occupancy results in different read depth coverage patterns for expressed and silent genes. By employing machine learning for gene classification, we found that the plasma DNA read depth patterns from healthy donors reflected the expression signature of hematopoietic cells. In patients with cancer having metastatic disease, we were able to classify expressed cancer driver genes in regions with somatic copy number gains with high accuracy. We were able to determine the expressed isoform of genes with several TSSs, as confirmed by RNA-seq analysis of the matching primary tumor. Our analyses provide functional information about cells releasing their DNA into the circulation.",
keywords = "Journal Article",
author = "Peter Ulz and Thallinger, {Gerhard G} and Martina Auer and Ricarda Graf and Karl Kashofer and Jahn, {Stephan W} and Luca Abete and Gunda Pristauz and Edgar Petru and Geigl, {Jochen B} and Ellen Heitzer and Speicher, {Michael R}",
year = "2016",
month = "10",
doi = "10.1038/ng.3648",
language = "English",
volume = "48",
pages = "1273--8",
journal = "Nature genetics",
issn = "1061-4036",
publisher = "Nature Publishing Group",
number = "10",

}

TY - JOUR

T1 - Inferring expressed genes by whole-genome sequencing of plasma DNA

AU - Ulz, Peter

AU - Thallinger, Gerhard G

AU - Auer, Martina

AU - Graf, Ricarda

AU - Kashofer, Karl

AU - Jahn, Stephan W

AU - Abete, Luca

AU - Pristauz, Gunda

AU - Petru, Edgar

AU - Geigl, Jochen B

AU - Heitzer, Ellen

AU - Speicher, Michael R

PY - 2016/10

Y1 - 2016/10

N2 - The analysis of cell-free DNA (cfDNA) in plasma represents a rapidly advancing field in medicine. cfDNA consists predominantly of nucleosome-protected DNA shed into the bloodstream by cells undergoing apoptosis. We performed whole-genome sequencing of plasma DNA and identified two discrete regions at transcription start sites (TSSs) where nucleosome occupancy results in different read depth coverage patterns for expressed and silent genes. By employing machine learning for gene classification, we found that the plasma DNA read depth patterns from healthy donors reflected the expression signature of hematopoietic cells. In patients with cancer having metastatic disease, we were able to classify expressed cancer driver genes in regions with somatic copy number gains with high accuracy. We were able to determine the expressed isoform of genes with several TSSs, as confirmed by RNA-seq analysis of the matching primary tumor. Our analyses provide functional information about cells releasing their DNA into the circulation.

AB - The analysis of cell-free DNA (cfDNA) in plasma represents a rapidly advancing field in medicine. cfDNA consists predominantly of nucleosome-protected DNA shed into the bloodstream by cells undergoing apoptosis. We performed whole-genome sequencing of plasma DNA and identified two discrete regions at transcription start sites (TSSs) where nucleosome occupancy results in different read depth coverage patterns for expressed and silent genes. By employing machine learning for gene classification, we found that the plasma DNA read depth patterns from healthy donors reflected the expression signature of hematopoietic cells. In patients with cancer having metastatic disease, we were able to classify expressed cancer driver genes in regions with somatic copy number gains with high accuracy. We were able to determine the expressed isoform of genes with several TSSs, as confirmed by RNA-seq analysis of the matching primary tumor. Our analyses provide functional information about cells releasing their DNA into the circulation.

KW - Journal Article

U2 - 10.1038/ng.3648

DO - 10.1038/ng.3648

M3 - Article

VL - 48

SP - 1273

EP - 1278

JO - Nature genetics

JF - Nature genetics

SN - 1061-4036

IS - 10

ER -