Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding

Daniel D. Le, Tyler C. Shimko, Arjun K. Aditham, Allison M. Keys, Scott A. Longwell, Yaron Orenstein, Polly M. Fordyce

Research output: Contribution to journalArticlepeer-review

49 Scopus citations

Abstract

Transcription factors (TFs) are primary regulators of gene expression in cells, where they bind specific genomic target sites to control transcription. Quantitative measurements of TF-DNA binding energies can improve the accuracy of predictions of TF occupancy and downstream gene expression in vivo and shed light on how transcriptional networks are rewired throughout evolution. Here, we present a sequencing-based TF binding assay and analysis pipeline (BET-seq, for Binding Energy Topography by sequencing) capable of providing quantitative estimates of binding energies for more than one million DNA sequences in parallel at high energetic resolution. Using this platform, we measured the binding energies associated with all possible combinations of 10 nucleotides flanking the known consensus DNA target interacting with two model yeast TFs, Pho4 and Cbf1. A large fraction of these flanking mutations change overall binding energies by an amount equal to or greater than consensus site mutations, suggesting that current definitions of TF binding sites may be too restrictive. By systematically comparing estimates of binding energies output by deep neural networks (NNs) and biophysical models trained on these data, we establish that dinucleotide (DN) specificities are sufficient to explain essentially all variance in observed binding behavior, with Cbf1 binding exhibiting significantly more nonadditivity than Pho4. NN-derived binding energies agree with orthogonal biochemical measurements and reveal that dynamically occupied sites in vivo are both energetically and mutationally distant from the highest affinity sites.

Original languageEnglish
Pages (from-to)E3702-E3711
JournalProceedings of the National Academy of Sciences of the United States of America
Volume115
Issue number16
DOIs
StatePublished - 17 Apr 2018
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2018 National Academy of Sciences. All rights reserved.

Funding

We thank Justin Kinney, Hua Tang, Anshul Kundaje, Daniel Herschlag, and Rhiju Das for helpful discussions. T.C.S. and A.K.A. acknowledge NSF Graduate Research Fellowships; A.K.A. acknowledges the Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health) Predoctoral Training Program. This work was supported by NIH/National Institute of General Medical Sciences Grant R00GM09984804. P.M.F. is a Chan Zuckerberg Biohub Investigator and acknowledges Sloan Research Foundation and McCormick and Gabilan faculty fellowships. ACKNOWLEDGMENTS. We thank Justin Kinney, Hua Tang, Anshul Kundaje, Daniel Herschlag, and Rhiju Das for helpful discussions. T.C.S. and A.K.A. acknowledge NSF Graduate Research Fellowships; A.K.A. acknowledges the Stanford ChEM-H (Chemistry, Engineering, and Medicine for Human Health) Predoctoral Training Program. This work was supported by NIH/National Institute of General Medical Sciences Grant R00GM09984804. P.M.F. is a Chan Zuckerberg Biohub Investigator and acknowledges Sloan Research Foundation and McCormick and Gabilan faculty fellowships.

FundersFunder number
McCormick and Gabilan faculty fellowships
Medicine for Human Health
NIH/National Institute of General Medical Sciences
Sloan Research Foundation
National Science Foundation
National Eye Institute
National Institute of General Medical SciencesR00GM099848
McCormick Foundation
National Academies of Sciences, Engineering, and Medicine

    Keywords

    • Microfluidics
    • Protein-DNA binding
    • Transcription factor binding
    • Transcription factor pecificity
    • Transcriptional regulation

    Fingerprint

    Dive into the research topics of 'Comprehensive, high-resolution binding energy landscapes reveal context dependencies of transcription factor binding'. Together they form a unique fingerprint.

    Cite this