Automatic recognition of the function of singular neuter pronouns in texts and spoken data

Costanza Navarretta*

*Corresponding author af dette arbejde

Publikation: Bidrag til tidsskriftKonferenceartikelForskningpeer review

3 Citationer (Scopus)

Abstract

We describe the results of unsupervised (clustering) and supervised (classification) learning experiments with the purpose of recognising the function of singular neuter pronouns in Danish corpora of written and spoken language. Danish singular neuter pronouns comprise personal and demonstrative pronouns. They are very frequent and have many functions such as non-referential, cataphoric, deictic and anaphoric. The antecedents of discourse anaphoric singular neuter pronouns can be nominal phrases of different gender and number, verbal phrases, adjectival phrases, clauses or discourse segments of different size and they can refer to individual and abstract entities. Danish neuter pronouns occur in more constructions and have different distributions than the corresponding English pronouns it, this and that. The results of the classification experiments show a significant improvement of the performance with respect to the baseline in all types of data. The best results were obtained on text data, while the worst results were achieved on free-conversational, multi-party dialogues.

Konference

Konference7th Discourse Anaphora and Anaphor Resolution Colloquium, DAARC 2009
Land/OmrådeIndien
ByGoa
Periode05/11/200906/11/2009

Citationsformater