TY - JOUR
T1 - De-novo protein function prediction using DNA binding and RNA binding proteins as a test case
AU - Peled, Sapir
AU - Leiderman, Olga
AU - Charar, Rotem
AU - Efroni, Gilat
AU - Shav-Tal, Yaron
AU - Ofran, Yanay
N1 - Publisher Copyright:
© The Author(s) 2016.
PY - 2016/11/21
Y1 - 2016/11/21
N2 - Of the currently identified protein sequences, 99.6% have never been observed in the laboratory as proteins and their molecular function has not been established experimentally. Predicting the function of such proteins relies mostly on annotated homologs. However, this has resulted in some erroneous annotations, and many proteins have no annotated homologs. Here we propose a de-novo function prediction approach based on identifying biophysical features that underlie function. Using our approach, we discover DNA and RNA binding proteins that cannot be identified based on homology and validate these predictions experimentally. For example, FGF14, which belongs to a family of secreted growth factors was predicted to bind DNA. We verify this experimentally and also show that FGF14 is localized to the nucleus. Mutating the predicted binding site on FGF14 abrogated DNA binding. These results demonstrate the feasibility of automated de-novo function prediction based on identifying function-related biophysical features.
AB - Of the currently identified protein sequences, 99.6% have never been observed in the laboratory as proteins and their molecular function has not been established experimentally. Predicting the function of such proteins relies mostly on annotated homologs. However, this has resulted in some erroneous annotations, and many proteins have no annotated homologs. Here we propose a de-novo function prediction approach based on identifying biophysical features that underlie function. Using our approach, we discover DNA and RNA binding proteins that cannot be identified based on homology and validate these predictions experimentally. For example, FGF14, which belongs to a family of secreted growth factors was predicted to bind DNA. We verify this experimentally and also show that FGF14 is localized to the nucleus. Mutating the predicted binding site on FGF14 abrogated DNA binding. These results demonstrate the feasibility of automated de-novo function prediction based on identifying function-related biophysical features.
UR - http://www.scopus.com/inward/record.url?scp=84995923266&partnerID=8YFLogxK
U2 - 10.1038/ncomms13424
DO - 10.1038/ncomms13424
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 27869118
SN - 2041-1723
VL - 7
JO - Nature Communications
JF - Nature Communications
M1 - 13424
ER -