Machine learning approaches demonstrate that protein structures carry information about their genetic coding

Linor Ackerman-Schraier, Aviv A. Rosenberg, Ailie Marx, Alex M. Bronstein

Research output: Contribution to journalArticlepeer-review

Abstract

Synonymous codons translate into the same amino acid. Although the identity of synonymous codons is often considered inconsequential to the final protein structure, there is mounting evidence for an association between the two. Our study examined this association using regression and classification models, finding that codon sequences predict protein backbone dihedral angles with a lower error than amino acid sequences, and that models trained with true dihedral angles have better classification of synonymous codons given structural information than models trained with random dihedral angles. Using this classification approach, we investigated local codon–codon dependencies and tested whether synonymous codon identity can be predicted more accurately from codon context than amino acid context alone, and most specifically which codon context position carries the most predictive power.

Original languageEnglish
Article number21968
JournalScientific Reports
Volume12
Issue number1
DOIs
StatePublished - Dec 2022

All Science Journal Classification (ASJC) codes

  • General

Fingerprint

Dive into the research topics of 'Machine learning approaches demonstrate that protein structures carry information about their genetic coding'. Together they form a unique fingerprint.

Cite this