Abstract
Self-supervised learning (SSL) has gained prominence due to the increasing availability of unlabeled data and advances in computational efficiency, leading to revolutionized natural language processing with pre-trained language models like BERT and GPT.Representation learning, a core concept in SSL, aims to reduce data dimensionality while preserving meaningful aspects.Conventional SSL methods typically embed data in Euclidean space.However, recent research has revealed that alternative geometries can hold even richer representations, unlocking more meaningful insights from the data.Motivated by this, we propose two novel methods for integrating Hilbert geometry into self-supervised learning for efficient document embedding.First, we present a method directly incorporating Hilbert geometry into the standard Euclidean contrastive learning framework.Additionally, we propose a multi-view hyperbolic contrastive learning framework contrasting both documents and paragraphs.Our findings demonstrate that contrasting only paragraphs, rather than entire documents, can lead to superior efficiency and effectiveness.
Original language | English |
---|---|
Title of host publication | ECAI 2024 - 27th European Conference on Artificial Intelligence, Including 13th Conference on Prestigious Applications of Intelligent Systems, PAIS 2024, Proceedings |
Editors | Ulle Endriss, Francisco S. Melo, Kerstin Bach, Alberto Bugarin-Diz, Jose M. Alonso-Moral, Senen Barro, Fredrik Heintz |
Number of pages | 8 |
Publisher | IOS Press BV |
Publication date | 2024 |
Pages | 1656-1663 |
ISBN (Electronic) | 9781643685489 |
DOIs | |
Publication status | Published - 2024 |
Event | 27th European Conference on Artificial Intelligence, ECAI 2024 - Santiago de Compostela, Spain Duration: 19 Oct 2024 → 24 Oct 2024 |
Conference
Conference | 27th European Conference on Artificial Intelligence, ECAI 2024 |
---|---|
Country/Territory | Spain |
City | Santiago de Compostela |
Period | 19/10/2024 → 24/10/2024 |
Sponsor | et al., Google Cloud, Huawei, IBM, INDITEXTECH, sdg group |
Series | Frontiers in Artificial Intelligence and Applications |
---|---|
Volume | 392 |
ISSN | 0922-6389 |
Bibliographical note
Publisher Copyright:© 2024 The Authors.