RAP: Accurate and fast motif finding based on protein-binding microarray data

Yaron Orenstein, Eran Mick, Ron Shamir

Research output: Contribution to journalArticlepeer-review

9 Scopus citations


The novel high-throughput technology of protein-binding microarrays (PBMs) measures binding intensity of a transcription factor to thousands of DNA probe sequences. Several algorithms have been developed to extract binding-site motifs from these data. Such motifs are commonly represented by positional weight matrices. Previous studies have shown that the motifs produced by these algorithms are either accurate in predicting in vitro binding or similar to previously published motifs, but not both. In this work, we present a new simple algorithm to infer binding-site motifs from PBM data. It outperforms prior art both in predicting in vitro binding and in producing motifs similar to literature motifs. Our results challenge previous claims that motifs with lower information content are better models for transcription-factor binding specificity. Moreover, we tested the effect of motif length and side positions flanking the "core" motif in the binding site. We show that side positions have a significant effect and should not be removed, as commonly done. A large drop in the results quality of all methods is observed between in vitro and in vivo binding prediction. The software is available on acgt.cs.tau.ac.il/rap.

Original languageEnglish
Pages (from-to)375-382
Number of pages8
JournalJournal of Computational Biology
Issue number5
StatePublished - 1 May 2013
Externally publishedYes

Bibliographical note

Funding Information:
We thank R. Stone for providing the CCD I-band images from the Lick 1 m telescope, and B. Schaeffer and W. Harrison for assistance with the observing at Keck. This work was partially supported by NSF award 9018527.


  • Motif finding
  • Protein-binding microarray
  • Protein-binding site


Dive into the research topics of 'RAP: Accurate and fast motif finding based on protein-binding microarray data'. Together they form a unique fingerprint.

Cite this