Skip to main navigation Skip to search Skip to main content

Machine learning approaches demonstrate that protein structures carry information about their genetic coding

Research output: Contribution to journalArticlepeer-review

Abstract

Synonymous codons translate into the same amino acid. Although the identity of synonymous codons is often considered inconsequential to the final protein structure, there is mounting evidence for an association between the two. Our study examined this association using regression and classification models, finding that codon sequences predict protein backbone dihedral angles with a lower error than amino acid sequences, and that models trained with true dihedral angles have better classification of synonymous codons given structural information than models trained with random dihedral angles. Using this classification approach, we investigated local codon–codon dependencies and tested whether synonymous codon identity can be predicted more accurately from codon context than amino acid context alone, and most specifically which codon context position carries the most predictive power.

Original languageEnglish GB
Article number21968
JournalScientific Reports
Volume12
Issue number1
DOIs
StatePublished - Dec 2022

ASJC Scopus subject areas

  • General

Fingerprint

Dive into the research topics of 'Machine learning approaches demonstrate that protein structures carry information about their genetic coding'. Together they form a unique fingerprint.

Cite this