Abstract
Protein-protein interactions are facilitated by a myriad of residue-residue contacts on the interacting proteins. Identifying the site of interaction in the protein is a key for deciphering its functional mechanisms, and is crucial for drug development. Many studies indicate that the compositions of contacting residues are unique. Here, we describe a neural network that identifies protein-protein interfaces from sequence. For the most strongly predicted sites (in 34 of 333 proteins), 94% of the predictions were confirmed experimentally. When 70% of our predictions were right, we correctly predicted at least one interaction site in 20% of the complexes (66/333). These results indicate that the prediction of some interaction sites from sequence alone is possible. Incorporating evolutionary and predicted structural information may improve our method. However, even at this early stage, our tool might already assist wet-lab biology.
Original language | English |
---|---|
Pages (from-to) | 236-239 |
Number of pages | 4 |
Journal | FEBS Letters |
Volume | 544 |
Issue number | 1-3 |
DOIs | |
State | Published - 5 Jun 2003 |
Externally published | Yes |
Bibliographical note
Funding Information:Thanks to Rana Samuels (Columbia) for her invaluable comments on the manuscript. Thanks to Jinfeng Liu (Columbia) for computer assistance and to Kaz Wrzeszczynski (Columbia) for crucial insights. Thanks also to the anonymous referee and to Rob Russell (EMBL) for helpful comments. The work was supported by Grants 1-P50-GM62413-01 and RO1-GM63029-01 from the National Institute of Health. Last, but not least, thanks to all those who deposit their experimental data in public databases, and to those who maintain these databases.
Funding
Thanks to Rana Samuels (Columbia) for her invaluable comments on the manuscript. Thanks to Jinfeng Liu (Columbia) for computer assistance and to Kaz Wrzeszczynski (Columbia) for crucial insights. Thanks also to the anonymous referee and to Rob Russell (EMBL) for helpful comments. The work was supported by Grants 1-P50-GM62413-01 and RO1-GM63029-01 from the National Institute of Health. Last, but not least, thanks to all those who deposit their experimental data in public databases, and to those who maintain these databases.
Funders | Funder number |
---|---|
National institute of Health | |
National Institute of General Medical Sciences | R01GM063029 |
Keywords
- Bioinformatics
- Data mining
- Neural network
- Protein function
- Protein structure
- Protein-protein interaction
- Sequence analysis