CLARIN-DK – status and challenges

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Abstract

    The initiative CLARIN-DK (starting as a Danish preparatory DK-CLARIN project) is a part of the Danish research infrastructure initiative, DIGHUMLAB. In this paper the aims, status, and the current challenges for CLARIN-DK are presented. CLARIN-DK focuses on written and spoken language resources, multimodal resources and tools, and involving users is a core issue. Users involved in a preparatory project gave input that led to the current user interface of the resource repository website, clarin.dk. Clarin.dk is now in the transition phase from a repository to a research infrastructure, where researchers and students can be supported in their research, education and studies. Clarin.dk works with a Service-Oriented Architecture (SOA), uses eSciDoc and Fedora Commons, and is primarily based on open source solutions. A key issue in CLARIN-DK is using standards such as TEIP5, IMDI, OLAC, and CMDI for resource metadata. Optional metadata fields suggested by users have been included when it could comply with the standards, allowing for the diversity needed when describing the research material. Current work includes normalising metadata naming in the search pages, and making search more user-friendly by adding selectable pick-lists for query values. Also a consolidation of metadata quality is currently performed by changing some metadata values to a more harmonized set of values. All deposited metadata are maintained. Clarin.dk will apply for assessment as a CLARIN ERIC B centre in 2013 enforcing the sustainability and persistency of the infrastructure. Clarin.dk has already joined the national identity federation WAYF, implemented SSL-certificates, and offers harvesting of metadata via OAI-PMH as part of the CLARIN centre requirements.
    Original languageEnglish
    Title of host publicationProceedings of the workshop on Nordic language research infrastructure at NODALIDA 2013
    Number of pages12
    Place of PublicationLinköpings universitet
    PublisherLinköping University Electronic Press
    Publication date2013
    Pages21-32
    ISBN (Electronic)1650-3740
    Publication statusPublished - 2013
    EventNODALIDA 2013 Workshop on Nordic language research infrastructure - University of Oslo, Oslo, Norway
    Duration: 22 May 201322 May 2013

    Workshop

    WorkshopNODALIDA 2013 Workshop on Nordic language research infrastructure
    LocationUniversity of Oslo
    Country/TerritoryNorway
    CityOslo
    Period22/05/201322/05/2013
    SeriesNEALT Proceedings Series
    Number20
    ISSN1736-6305

    Cite this