Using deep neural networks to disentangle visual and semantic information in human perception and memory

Adva Shoham, Idan Daniel Grosbard, Or Patashnik, Daniel Cohen-Or, Galit Yovel

Research output: Contribution to journalArticlepeer-review

Abstract

Mental representations of familiar categories are composed of visual and semantic information. Disentangling the contributions of visual and semantic information in humans is challenging because they are intermixed in mental representations. Deep neural networks that are trained either on images or on text or by pairing images and text enable us now to disentangle human mental representations into their visual, visual–semantic and semantic components. Here we used these deep neural networks to uncover the content of human mental representations of familiar faces and objects when they are viewed or recalled from memory. The results show a larger visual than semantic contribution when images are viewed and a reversed pattern when they are recalled. We further reveal a previously unknown unique contribution of an integrated visual–semantic representation in both perception and memory. We propose a new framework in which visual and semantic information contribute independently and interactively to mental representations in perception and memory.

Original languageEnglish
Pages (from-to)702-717
Number of pages16
JournalNature Human Behaviour
Volume8
Issue number4
DOIs
StatePublished - Apr 2024

All Science Journal Classification (ASJC) codes

  • Social Psychology
  • Experimental and Cognitive Psychology
  • Behavioral Neuroscience

Fingerprint

Dive into the research topics of 'Using deep neural networks to disentangle visual and semantic information in human perception and memory'. Together they form a unique fingerprint.

Cite this