Searching for protein signatures using a multilevel alphabet

Ronit Hod, Refael Kohen, Yael Mandel-Gutfreund

Research output: Contribution to journalArticlepeer-review

Abstract

Short motifs are known to play diverse roles in proteins, such as in mediating the interactions with other molecules, binding to membranes, or conducting a specific biological function. Standard approaches currently employed to detect short motifs in proteins search for enrichment of amino acid motifs considering mostly the sequence information. Here, we presented a new approach to search for common motifs (protein signatures) which share both physicochemical and structural properties, looking simultaneously at different features. Our method takes as an input an amino acid sequence and translates it to a new alphabet that reflects its intrinsic structural and chemical properties. Using the MEME search algorithm, we identified the proteins signatures within subsets of protein which encompass common sequence and structural information. We demonstrated that we can detect enriched structural motifs, such as the amphipathic helix, from large datasets of linear sequences, as well as predicting common structural properties (such as disorder, surface accessibility, or secondary structures) of known functional-motifs. Finally, we applied the method to the yeast protein interactome and identified novel putative interacting motifs. We propose that our approach can be applied for de novo protein function prediction given either sequence or structural information.

Original languageEnglish
Pages (from-to)1058-1068
Number of pages11
JournalProteins: Structure, Function and Bioinformatics
Volume81
Issue number6
DOIs
StatePublished - Jun 2013

Keywords

  • Motif search
  • Multilevel alphabet
  • Protein disorder
  • Secondary structure
  • Surface accessibility

All Science Journal Classification (ASJC) codes

  • Structural Biology
  • Biochemistry
  • Molecular Biology

Fingerprint

Dive into the research topics of 'Searching for protein signatures using a multilevel alphabet'. Together they form a unique fingerprint.

Cite this