Hyperbolic Contrastive Learning for Document Representations - A Multi-View Approach with Paragraph-Level Similarities

Jaeeun Nam, Ilias Chalkidis, Mina Rezaei*

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

5 Downloads (Pure)

Abstract

Self-supervised learning (SSL) has gained prominence due to the increasing availability of unlabeled data and advances in computational efficiency, leading to revolutionized natural language processing with pre-trained language models like BERT and GPT.Representation learning, a core concept in SSL, aims to reduce data dimensionality while preserving meaningful aspects.Conventional SSL methods typically embed data in Euclidean space.However, recent research has revealed that alternative geometries can hold even richer representations, unlocking more meaningful insights from the data.Motivated by this, we propose two novel methods for integrating Hilbert geometry into self-supervised learning for efficient document embedding.First, we present a method directly incorporating Hilbert geometry into the standard Euclidean contrastive learning framework.Additionally, we propose a multi-view hyperbolic contrastive learning framework contrasting both documents and paragraphs.Our findings demonstrate that contrasting only paragraphs, rather than entire documents, can lead to superior efficiency and effectiveness.

Original languageEnglish
Title of host publicationECAI 2024 - 27th European Conference on Artificial Intelligence, Including 13th Conference on Prestigious Applications of Intelligent Systems, PAIS 2024, Proceedings
EditorsUlle Endriss, Francisco S. Melo, Kerstin Bach, Alberto Bugarin-Diz, Jose M. Alonso-Moral, Senen Barro, Fredrik Heintz
Number of pages8
PublisherIOS Press BV
Publication date2024
Pages1656-1663
ISBN (Electronic)9781643685489
DOIs
Publication statusPublished - 2024
Event27th European Conference on Artificial Intelligence, ECAI 2024 - Santiago de Compostela, Spain
Duration: 19 Oct 202424 Oct 2024

Conference

Conference27th European Conference on Artificial Intelligence, ECAI 2024
Country/TerritorySpain
CitySantiago de Compostela
Period19/10/202424/10/2024
Sponsoret al., Google Cloud, Huawei, IBM, INDITEXTECH, sdg group
SeriesFrontiers in Artificial Intelligence and Applications
Volume392
ISSN0922-6389

Bibliographical note

Publisher Copyright:
© 2024 The Authors.

Cite this