TY - JOUR
T1 - Evaluation of novel computational methods to identify RNA-binding protein footprints from structural data
AU - Mizrahi, Orel
AU - Corley, Meredith
AU - Feldman, Ori
AU - Fröhlking, Thorben
AU - Sun, Lei
AU - Ziesel, Alison
AU - Antczak, Maciej
AU - Bernetti, Mattia
AU - Elhajjajy, Shaimae I.
AU - Huang, Wenze
AU - Nguyen, Grady G.
AU - Park, Samuel S.
AU - Perez Martell, Raul I.
AU - Trinity, Luke
AU - Xu, Kui
AU - Zok, Tomasz
AU - Bussi, Giovanni
AU - Jabbari, Hosna
AU - Orenstein, Yaron
AU - Aviran, Sharon
AU - Meyer, Michelle M.
AU - Yeo, Gene W.
N1 - Publisher Copyright:
© 2025 Mizrahi et al.
PY - 2025/7/16
Y1 - 2025/7/16
N2 - RNA-binding proteins (RBP) play diverse roles in mRNA processing and function. However, from thousands of RBPs encoded in the human genome, a detailed molecular understanding of their interactions with RNA is available only for a small fraction. In most cases, our knowledge of the combination of RNA sequence and structure required for specific RBP binding is insufficient for accurately predicting binding sites transcriptome-wide. In this context, the rapidly expanding collection of transcriptomic data sets that map distinct, yet intertwined posttranscriptional marks, such as RNA structure and RBP binding, presents an opportunity for integrative analysis to better characterize RBP binding. A grand challenge faced by our community is that relatively little information on the secondary structure context within and near RBP-binding sites has been gleaned from integrating such data sets, partially due to lack of suitable computational methods. To engage scientists from diverse backgrounds in addressing this gap, the RNA Society organized the RBP Footprint Grand Challenge in 2021, an international community effort to develop new methods or leverage existing ones for predicting RBP-binding sites through analysis of a growing volume of sequence, structure, and binding data and to experimentally validate select predictions. Here, we report the initiative, analyses, and methods developed by the participants, validation results, and five new in vivo binding data sets generated for validation. We hope our work will inspire additional innovation in computational methods, further utilization of available data resources, and future endeavors to engage the community in collaborating toward closing other critical data-analysis gaps.
AB - RNA-binding proteins (RBP) play diverse roles in mRNA processing and function. However, from thousands of RBPs encoded in the human genome, a detailed molecular understanding of their interactions with RNA is available only for a small fraction. In most cases, our knowledge of the combination of RNA sequence and structure required for specific RBP binding is insufficient for accurately predicting binding sites transcriptome-wide. In this context, the rapidly expanding collection of transcriptomic data sets that map distinct, yet intertwined posttranscriptional marks, such as RNA structure and RBP binding, presents an opportunity for integrative analysis to better characterize RBP binding. A grand challenge faced by our community is that relatively little information on the secondary structure context within and near RBP-binding sites has been gleaned from integrating such data sets, partially due to lack of suitable computational methods. To engage scientists from diverse backgrounds in addressing this gap, the RNA Society organized the RBP Footprint Grand Challenge in 2021, an international community effort to develop new methods or leverage existing ones for predicting RBP-binding sites through analysis of a growing volume of sequence, structure, and binding data and to experimentally validate select predictions. Here, we report the initiative, analyses, and methods developed by the participants, validation results, and five new in vivo binding data sets generated for validation. We hope our work will inspire additional innovation in computational methods, further utilization of available data resources, and future endeavors to engage the community in collaborating toward closing other critical data-analysis gaps.
KW - RNA structure probing
KW - RNA-binding protein
KW - SHAPE
KW - bioinformatics
KW - community initiative
KW - integrative transcriptome analysis
UR - https://www.scopus.com/pages/publications/105011849220
U2 - 10.1261/rna.080215.124
DO - 10.1261/rna.080215.124
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 40399037
AN - SCOPUS:105011849220
SN - 1355-8382
VL - 31
SP - 1103
EP - 1124
JO - RNA
JF - RNA
IS - 8
ER -