Sentiment Classification of Historical Danish and Norwegian Literary Texts

Publikation: Bidrag til bog/antologi/rapportKonferencebidrag i proceedingsForskningpeer review

77 Downloads (Pure)

Abstract

Sentiment classification is valuable for literary analysis, as sentiment is crucial in literary narratives. It can, for example, be used to investigate a hypothesis in the literary analysis of 19th-century Scandinavian novels that the writing of female authors in this period was characterized by negative sentiment, as this paper shows. In order to enable a data-driven analysis of this hypothesis, we create a manually annotated dataset of sentence-level sentiment annotations for novels from this period and use it to train and evaluate various sentiment classification methods. We find that pre-trained multilingual language models outperform models trained on modern Danish, as well as classifiers based on lexical resources. Finally, in the classifier-assisted corpus analysis, we both confirm and contest the literary hypothesis and further shed light on the temporal development of the trend. Our dataset and trained models will be useful for future analysis of historical Danish and Norwegian literary texts.
OriginalsprogEngelsk
TitelProceedings of the 24th Nordic Conference on Computational Linguistics (NoDaLiDa)
ForlagAssociation for Computational Linguistics (ACL)
Publikationsdatomaj 2023
Sider324–334
StatusUdgivet - maj 2023
BegivenhedNoDaLiDa 2023: The 24th Nordic Conference on Computational Linguistics - Faroe Islands, Tórshavn, Danmark
Varighed: 22 maj 202324 maj 2023
Konferencens nummer: 24
https://www.nodalida2023.fo/

Konference

KonferenceNoDaLiDa 2023
Nummer24
LokationFaroe Islands
Land/OmrådeDanmark
ByTórshavn
Periode22/05/202324/05/2023
Internetadresse

Citationsformater