hrvatski jezikClear Cookie - decide language by browser settings

Prediction of Protein-Protein Interaction Sites in Sequences and 3D Structures by Random Forests

Šikić, Mile; Tomić, Sanja; Vlahoviček, Kristian (2009) Prediction of Protein-Protein Interaction Sites in Sequences and 3D Structures by Random Forests. Plos computational biology, 5 (1). e1000278-1. ISSN 1553-734X

[img]
Preview
PDF - Published Version - article
Available under License Creative Commons Attribution.

Download (288kB) | Preview

Abstract

Identifying interaction sites in proteins provides important clues to the function of a protein, and is becoming increasingly relevant in topics like systems biology and drug discovery. Here, a sliding window approach is combined with the Random Forests method to predict protein interaction sites using i) a combination of sequence and structure-derived parameters ; and ii) sequence information alone. For sequence-based prediction we achieved a precision of 84% with a 26% recall and the F-measure of 40%. When combined with structural information, the prediction performance increases to a precision of 76% and a recall of 38% with the F-measure of 51%. We also present an attempt to rationalize the sliding window size and demonstrate that a nine-residue window is the most suitable for predictor construction. Finally, we demonstrate the applicability of our prediction methods by modelling the Ras-Raf complex using predicted interaction sites as target binding interfaces. Our results suggest that it is possible to predict protein interaction sites with quite a high accuracy using only sequence information.

Item Type: Article
Uncontrolled Keywords: protein – protein interactions ; sequence ; residues ; 3D protein structure ; Radnom Forest ; Classification ; protein docking
Subjects: NATURAL SCIENCES > Physics
NATURAL SCIENCES > Biology
TECHNICAL SCIENCES > Computing
Divisions: Division of Organic Chemistry and Biochemistry
Projects:
Project titleProject leaderProject codeProject type
Proučavanje biomakromolekula računalnim metodama i razvoj novih algoritama-Sanja Tomić098-1191344-2860MZOS
Modeliranje kompleksnih sustava-Branko Jeren036-0362214-1987MZOS
Računalna genomika mikrobnih okoliša i bioinformatika ekstremofila-Kristian Vlahoviček119-0982913-1211MZOS
Depositing User: Sanja Tomić
Date Deposited: 02 Aug 2017 14:23
Last Modified: 02 Aug 2017 14:25
URI: http://fulir.irb.hr/id/eprint/3516
DOI: 10.1371/journal.pcbi.1000278

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year