Skip to main navigation Skip to search Skip to main content

Cognition-Supervised Saliency Detection: Contrasting EEG Signals and Visual Stimuli

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

4 Citations (Scopus)
16 Downloads (Pure)

Abstract

Understanding human assessment of semantically salient parts of multimedia content is crucial for developing human-centric applications, such as annotation tools, search and recommender systems, and systems able to generate new media matching human interests. However, the challenge of acquiring suitable supervision signals to detect semantic saliency without extensive manual annotation remains significant. Here, we explore a novel method that utilizes signals measured directly from human cognition via electroencephalogram (EEG) in response to natural visual perception. These signals are used for supervising representation learning to capture semantic saliency. Through a contrastive learning framework, our method aligns EEG data with visual stimuli, capturing human cognitive responses without the need for any manual annotation. Our approach demonstrates that the learned representations closely align with human-centric notions of visual saliency and achieve competitive performance in several downstream tasks. We also introduce an open EEG/image dataset to facilitate research in utilizing cognitive signals for multimodal data analysis and developing models for cross-modal representation learning.

Original languageEnglish
Title of host publicationMM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia
Number of pages10
PublisherAssociation for Computing Machinery, Inc.
Publication date2024
Pages7744-7753
ISBN (Electronic)9798400706868
DOIs
Publication statusPublished - 2024
Event32nd ACM International Conference on Multimedia, MM 2024 - Melbourne, Australia
Duration: 28 Oct 20241 Nov 2024

Conference

Conference32nd ACM International Conference on Multimedia, MM 2024
Country/TerritoryAustralia
CityMelbourne
Period28/10/202401/11/2024
SeriesMM 2024 - Proceedings of the 32nd ACM International Conference on Multimedia

Bibliographical note

Publisher Copyright:
© 2024 Owner/Author.

Keywords

  • contrastive learning
  • eeg
  • generative modeling
  • neuroimaging

Cite this