An Adversarial Scheme for Integrating Multi-modal Data on Protein Function

Rami Nasser, Leah V. Schaffer, Trey Ideker, Roded Sharan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In order to begin to decipher the structure of the cell, we need to integrate multiple types of data of different scales on subcellular organization. Such integration requires dealing with multiple data modalities and with missing data. To this end, we developed MIRAGE, a multi-modal generative model for integrating protein sequence, protein-protein interaction, and protein localization data. Our approach successfully learns a joint embedding space that captures the complex relationships between these diverse modalities. We evaluate our model’s performance against existing methods, obtaining superior performance in several key tasks, including protein function prediction and module detection. MIRAGE source code is available at https://github.com/raminass/MIRAGE. A full manuscript describing MIRAGE and its applications is available at https://www.biorxiv.org/content/10.1101/2025.01.16.633332.

Original languageEnglish
Title of host publicationResearch in Computational Molecular Biology - 29th International Conference, RECOMB 2025, Proceedings
EditorsSriram Sankararaman
PublisherSpringer Science and Business Media Deutschland GmbH
Pages264-267
Number of pages4
ISBN (Print)9783031902512
DOIs
StatePublished - 2025
Event29th International Conference on Research in Computational Molecular Biology, RECOMB 2025 - Seoul, Korea, Republic of
Duration: 26 Apr 202529 Apr 2025

Publication series

NameLecture Notes in Computer Science
Volume15647 LNBI

Conference

Conference29th International Conference on Research in Computational Molecular Biology, RECOMB 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period26/04/2529/04/25

All Science Journal Classification (ASJC) codes

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'An Adversarial Scheme for Integrating Multi-modal Data on Protein Function'. Together they form a unique fingerprint.

Cite this