Ladataan Tapahtumat

« Kaikki Tapahtumat

CLARIN Café: Data Citation With CLARIN (Part I)

4.6.2026 15.0016.30

General Information

In this CLARIN Café, we present the recent CLARIN Dataset Citation Guidelines on how CLARIN language resources and technologies, such as language corpora, datasets, and tools, should be properly cited in scholarly work. We discuss how the CLARIN guidelines were developed and how they differ from other existing recommendations, such as FORCE11’s Joint Declaration of Data Citation Principles. We will provide use cases showing how the citations of certain CLARIN corpora and tools can be made to conform to the guidelines, with a focus on potentially problematic examples. We then present how CLARIN repositories support dataset citation in the form of citation instructions and discuss the citation of dynamic datasets. We then survey how members of CLARIN have thus far cited resources in abstracts published in past CLARIN proceedings. We discuss how citation instructions are presented in the author guidelines of prominent journals that focus on the creation of language resources.

Finally, we open up the floor to discussions with the audience. Depending on background and work experience, we will exchange on use cases and bottlenecks.

We plan to organise a follow-up Café on Data Citation later in the year with representatives from scientific journals and sister research infrastructure to provide further discussion and gain insights on best practices and limiting factors when it comes to data citation.

Target Audiences

All welcome, though the Café may be particularly relevant to researchers in linguists and language sciences, digital humanities, LT and , Social sciences, Speech technology, History, and practitioners in the sector and in data science.

How to Join

Please register for free using this link in order to receive the meeting room details.

To help us tailor the Café, please provide details about your own background and your questions upon registration.

Programme

  • Welcome
  • Presentation of CLARIN ERIC citation guidelines (Jakob Lenardič, 15 min)
  • The (in)visibility of citation instructions across CLARIN repositories (Jakob Lenardič, 5 min)
  • The citation of CLARIN datasets in practice (Jakob Lenardič, 10 min)
  • Citation instructions in big name journals and conferences (Jakob Lenardič, 5 min)
  • Support for dynamic dataset citation (Martin Matthiesen, 10 min)
  • Audience Q&A (all, 40 min)
  • Wrap-Up

Tiedot

Järjestäjä

Tapahtumapaikka

  • Online event – Etätapahtuma