FIN-CLARIAH Kick-off 3.6.2022

<< FIN-CLARIAH Overview

Posters presented in the kick-off event – Avajaistilaisuudessa esiteltävät posterit

To view or download the PDF version, click on the image.

W1.1 Text processing and annotation environments

Image of the poster W1.1 Text processing and annotation environments
Top of page

W1.2 Speech processing and annotation

Image of the poster W1.2 Speech processing and annotation
Top of page

W1.3 Noise-tolerant NLP

Image of the poster W1.3 Noise-tolerant NLP
Top of page

W2.1 Social Data Science

Image of the poster W2.1 Social Data Science
Top of page

W2.2 Learners’ Assessment Environments

Image of the poster W2.2 Learners' Assessment Environments
Top of page

W2.3 Translation and Interpretation

Image of the poster W2.3 Translation and Interpretation
Top of page

W2.4 Terminology

Image of the poster W2.4 Terminology
Top of page

W2.5 Solutions for better use of language learner performances in research

Image of the poster W2.5 Solutions for better use of language learner performances in research
Top of page

W3.1 Increasingly automated ingestion of material

Image of the poster W3.1 Increasingly automated ingestion of material
Top of page

W3.2 AI solutions to better use of National Archives mass digitisation services

Image of the poster W3.2 AI solutions to better use of National Archives mass digitisation services
Top of page

W3.3 Qualitative survey data

Image of the poster W3.3 Qualitative survey data
Top of page

W3.4 Analysis tools for real-time chats in gameplay streams

Image of the poster W3.4 Analysis tools for real-time chats in gameplay streams
Top of page

W3.5 Text network analysis of political texts

Image of the poster W3.5 Text network analysis of political texts
Top of page

W4.1 Metadata harmonization and analysis

Image of the poster W4.1 Metadata harmonization and analysis
Top of page

W4.2 Linked Open Data Services

Image of the poster W4.2 Linked Open Data Services
Top of page

W4.3 Subsetting and evaluating data

Image of the poster W4.3 Subsetting and evaluating data
Top of page

W4.4 Social media noise (and how to tackle it?)

Image of the poster W4.4 Social media noise (and how to tackle it?)
Top of page

W5.1 & 5.2 Information interaction

Image of the poster W5.1 & 5.2 Information interaction

Top of page

Posters about FIN-CLARIAH

The kick-off get-together of the FIN-CLARIAH infrastructure project is held in the premises of the National Library on 3.6.2022. You can see the posters online on the event page.

Register now for the online course Corpus Linguistics and Statistical Methods

The online course  Corpus Linguistics and Statistical Methods (Korpuslingvistiikka ja tilastolliset menetelmät, 5 credits) will be offered again during 17.1.–6.3.2022. This course can be taken either in Finnish or in English.

The total number of participants will be restricted, but it will be possible to participate the course from outside the University of Helsinki and even from outside Finland. If you are a student from outside the University of Helsinki, please find further details and the link for joining the Moodle area on the course home page (see below). Students from the University of Helsinki should first register via Sisu.

Registration for the course is open until 28.1.2022 (unless the maximum number of participants is exceeded before then).

Home page of the course

 

Find more courses and training by Kielipankki

Donate Speech awarded with Prix Europa: Best European Digital Audio Project of the Year 2021

The Donate Speech campaign, where the Language Bank of Finland has been involved, was awarded with PRIX EUROPA: Best European Digital Audio Project of the Year 2021 (see https://www.prixeuropa.eu/news/2021/10/15winners-y4emh). The award ceremony took place in Potsdam, Germany on 15th October, 2021.

Earlier this year, Donate Speech also won the national Grand One award for Best Mobile Service of the Year, including a distinction for Best Use of Data.

Donate Speech is a joint project of Yle – the Finnish Broadcasting Company, Vake Oy (current Ilmastorahasto), Solita, Aalto University and the University of Helsinki.

 

If you speak and understand Finnish, you can donate your speech here!

University of Helsinki Open Science Award 2021 was granted to the Language Bank of Finland and the Donate Speech campaign

On 29th October 2021, the Language Bank of Finland and the Donate Speech campaign (Lahjoita puhetta) were awarded by the University of Helsinki in recognition of exceptional work in promoting the accessibility and reusability of research data. In addition to the Language Bank, the award was given to Research Coordinator Kati Lassila-Perini.

In the award ceremony, Research Director Krister Lindén gave a presentation that is now available on YouTube with English subtitles. Read more about the award on the website of the University of Helsinki.

Open online course Introduction to Speech Analysis 1.11.-17.12.2021

In this online course, you get a grip of special tools that are available for transcribing and studying speech samples. You also learn about collecting and managing a speech corpus of your own. During the course, you will actively use the Praat program and get familiar with ELAN, too.

The course is open to students in all universities and you can take it either in Finnish or in English. The number of participants may be restricted if required. The course will be taught by Mietta Lennes and Juraj Šimko at the University of Helsinki.

Join the course by 12th November!

Further information and link to the course on Moodle

Join the online course Natural Language Processing for Linguists

The online course Natural Language Processing for Linguists will be taught by Tuomo Hiippala at the University of Helsinki during 15.3.–10.5.2021.

The course is also open to students from universities outside Helsinki, if space allows. Registration is open until 16th March.

Note also that all the course materials will be available online and you can use them even if you cannot make it to the course this time!

Read more and register

 

Welcome to the next Kielipankki Live 14.12.2020 at 13:00

The next Kielipankki Live event will be held on Monday 14th December starting at 13:00 via Zoom. The event will be  in English, but questions are welcome in Finnish as well! The main themes are speech corpora and personal data practices. Join us for the interviews and presentations of special guests and for good discussions! Register preferably by 11the December.

Program and registration details

Suomeksi

European Language Grid (ELG): Introduction and overview

4th Regional ELG Workshop: Finland

15th December 2020, 14.00-16.30
Online event

 

The European Language Grid (ELG) aims to provide a digital marketplace where European companies, organizations and citizens can both offer and efficiently use language technologies, data sets and services. The ELG workshop presents an overview of the ELG platform and the ELG pilot projects. Welcome to see what ELG has to offer for you!

Registration

The workshop is a free online event, but registration is required. Please register via the ELRC website by 10th December. NB: In case you wish to participate in the ELG tutorial session that may be arranged after the workshop, please indicate this in the field for additional information on the registration form. Thanks!

Note that the third ELRC Workshop in Finland will also take place online, in the same virtual room, on the same day at 9.30-12.40. Welcome to participate in both events!

Program (provisional)

14:00Welcome and introduction
14:05ELG Overview
Katrin Marheinecke
14:30ELG online demo
Nils Feldhus
14:50Presentations of Finnish Pilot Projects funded in ELG: PARA4DLM (University of Turku), LSDISCO (Lingsoft); OPUS-MT (University of Helsinki)
15:20Expectations/requirements of Finnish Language Technology providers
Marko Turpeinen, 1001Lakes
15:40Summary and discussion
16:00End of workshop
16:15Tutorial: How to integrate a service into ELG
This tutorial may be organized according to requests from the participants. Please indicate your interest in the registration form.

Last updated: December 7, 2020

The online course Data Clinic 2020-21 will begin soon

This online course can support you with practical issues in managing the research data you need for your MA thesis or PhD project. You can join the course from any university, given that you fulfil the criteria. There is plenty of room left at the moment. Note, however, that the number of participants is restricted and students in the LingDig MA programme at the University of Helsinki have priority.

Read more and register…

 

See all online courses and training

 

Workshop day to be organized together with ELRC and ELG on 15th December 2020

FIN-CLARIN is planning an online event together with ELRC (European Language Resource Coordination) and ELG (European Language Grid), to be organized on 15th December 2020.

Mark your calendars! Further information will be updated on the event page.

Suomeksi

3rd European Language Resource Coordination (ELRC) workshop in Finland

15.12.2020 at 9:30-12:40

Organizers:
The European Language Resource Coordination (ELRC) consortium
Department of Digital Humanities, University of Helsinki

Welcome to the third ELRC workshop in Finland!

Language Technology is shaping our multilingual future. It has already been transforming the way we interact with our devices and with each other, the way we shop, work and travel. More and more it reshapes our interaction with service providers, either public or private. Programs that automatically correct spelling errors and aid sophisticated writing, digital assistants that transform our voices to text messages on mobile phones, bots that answer our calls to the bank or to our social security organisation, systems that automatically translate from a foreign language, and much more, are already empowering our everyday lives, our businesses and our administrations. But can we fully use our own language in our digital interactions? Is our language adequately supported and ready to keep pace with the technological advancements of the AI era?

The third Finnish European Language Resource Coordination (ELRC) workshop will address these questions and it will seek to engage participants in a fruitful discussion on the status and prospects of Language Technology for Finnish. Developers, integrators and users of Language Technology, both from the private and public sector will share experiences, requirements and ways for transforming digital interaction in our multilingual Europe with Language Technologies. Finally, we will discuss how language data, i.e. texts and speech, can fuel development in Artificial Intelligence.

This workshop continues the series of previous ELRC workshops that were organized in Finland on 19.2.2016 and 24.10.2018.

Now in collaboration with European Language Grid (ELG)

This ELRC workshop is organized in collaboration with the European Language Grid (ELG). The 4th Regional ELG Workshop will take place in the afternoon, starting at 14:00. For details, see the ELG workshop page. Welcome to register and attend both events!

Registration

The ELRC workshop is a free event, but registration is required. You can use the same form to register to both the ELRC workshop (morning sessions) and the ELG workshop (afternoon sessions).

Please register via the ELRC website by 10th December. Welcome!

Program (provisional)

09:30 – 09:40

Welcome and introduction (video, pdf)
Krister Lindén, University of Helsinki / FIN-CLARIN

09:40 – 10:00

The potential of Language Technology and AI – where we are, where we should be heading (video, pdf)
Jörg Tiedemann, University of Helsinki

10:00 – 10:30

Language Technologies for the Languages of Finland – Panel session (video, pdf)
Filip Ginter, University of Turku (Moderator)
Sebastian Andersson, Lingsoft
Jörg Tiedemann, University of Helsinki
Sampo Pyysalo, University of Turku
Pasi Tapanainen, Etuma
Kaarina Hyvönen, Kielikone

10:30 – 10:45

Coffee Break

10:45 – 11:15 

The CEF AT Platform (video, pdf)
Vilmantas Liubinas, 
European Commission

11:15 – 11:45

Language technologies by/for the public sector – Panel session (video, pdf)
Jouko Salonen, Finnish Immigration Service (Moderator)
Osma Suominen, National Library of Finland
Ville Viitasaari, Kela
Kaisamari Kuhmonen, Prime Minister’s Office

11:45 – 12:15

Language data creation, management and sharing: existing practices and challenges – Panel session (video)
Aleksi Rossi, YLE (Moderator)
Krister Lindén, University of Helsinki / FIN-CLARIN
Mikko Kurimo, Aalto University
Tommi Kurki, University of Turku

12:15 – 12:30

The EU Council Presidency Translator – Finnish presidency success story and what’s beyond (video, pdf)
Pekka Myllylä, Managing Director at Tilde Eesti OÜ

12:30 – 12:40

Conclusions (video, pdf)
Krister Lindén, University of Helsinki / FIN-CLARIN

12:40 – 14:00

Break

14:00 – 16:30

European Language Grid (ELG): Introduction and overview.
4th Regional European Language Grid (ELG) Workshop in Finland

The ELG workshop is organized in collaboration with the European Language Grid (ELG) and it will take place in the same online meeting room as the ELRC workshop. Please note that the ELG workshop will be held in English only. Welcome to register and participate in both events!

The detailed program for the ELG workshop is updated at https://www.kielipankki.fi/elg-workshop-2020/.

Please register via the ELRC website by 10th December. Welcome!

Contact the local organizers for further details:

Mietta Lennes and Tommi Jauhiainen
University of Helsinki / FIN-CLARIN
fin-clarin [ATT] helsinki.fi

Last updated: December 8, 2020

Join the online course Introduction to Speech Analysis!

The open online course Introduction to Speech Analysis (5 ECTS) has just started. The course is now offered for the first time in both Finnish and in English. Within the group size limits, you can join in from any university until 6th November 2020. See the course home page for instructions on how to enrol the course area on Moodle.

During the course, you learn to transcribe and to annotate speech and to understand some of the most important acoustic displays and measurement methods that can be used in speech research. The main tool of the course is the Praat analysis program, but we will also take a look at ELAN. The course can be relevant for students in phonetics, linguistics and languages, but also in other fields where audio recordings of speech are used for research.

All the courses offered by FIN-CLARIN can be found on the Training page.

Kielipankki Live

Suomeksi

Kielipankki Live is a new series of online events where researcher guests are interviewed and current issues about text and speech resources and the Language Bank of Finland are discussed. After each event, the recorded presentations are made available on YouTube (for materials and links, see the list of past events below). In order to stay tuned on future events, please subscribe to the newsletter of the Language Bank of Finland.

The next Kielipankki Live event: 14.12.2020 at 13.00-15.00


kuva äänen aaltomuodosta

 

Theme: Speech corpora and personal data
Look forward to expert guests and good discussions! The brief presentations will be held in English, but you can also ask questions in Finnish. The event starts at 13:00 and it is expected to finish by 15:00.

Program

  • Mietta Lennes: Introduction and current affairs in the Language Bank of Finland
  • Krister Lindén: A brief introduction to legal issues regarding language data
  • Interview of Rosa González Hautamäki and Tomi Kinnunen: Experiences from collecting and sharing the Corpus of Age-related Voice Disguise (AVOID) and other speech corpora for research in speech technology. 
  • Satu Saalasti: The DELAD initiative for sharing corpora of disordered speech
  • Aleksi Rossi: A brief status report of the Donate Speech campaign (Lahjoita puhetta)
  • Questions & Answers: Ask the staff and experts of the Language Bank of Finland
  • General discussion

Registration

Please register for the event here by 11.12.2020. When registering, you can already ask questions from the researcher guests and from the experts of the Language Bank of Finland. It will also be possible to ask questions during the event.

By registering, you will receive a link for joining Zoom before the event begins. If the event has already started, you can also email fin-clarin [AT] helsinki.fi if you want to join late.

The event will be recorded and the main content will be published online. In case you do not wish your face or voice to be recorded, please keep your camera and microphone off during the event. You may also participate in the discusssion via the chat. The names and contact details of the participants will not be published.


All Kielipankki Live events

  • 14.12.2020 at 13-15, in English (register here)
  • 24.8.2020 (in Finnish)

CLARIN Café II: How to use CLARIN in (online) higher education

In this virtual CLARIN event, Mietta Lennes from FIN-CLARIN gave a short presentation about her online teaching.

YouTube playlist of all the presentation videos in CLARIN Café II, 10.6.2020
Mietta’s presentation Developing online teaching with materials, exercises and videos in two languages

 

 

CLARIN Café II: How to use CLARIN in (online) higher education

Toisen CLARIN ERICin järjestämän CLARIN Café -virtuaalitapahtuman aiheena oli CLARIN-aineistojen ja työkalujen hyödyntäminen korkeakouluopetuksessa. Myös Mietta Lennes FIN-CLARINista piti lyhyen esityksen omasta verkko-opetuksestaan.

YouTube-soittolista CLARIN Café II -tapahtuman kaikista esityksistä 10.6.2020
Mietan esitelmä Developing online teaching with materials, exercises and videos in two languages

The Donate Speech (Lahjoita puhetta) campaign kicks off 16.6.2020

The Language Bank of Finland is working together with the Finnish Broadcasting Company (Yle) and the Finnish State Development Company (Vake Oy) in the Donate Speech campaign (Lahjoita puhetta) launching on 16th June 2020. The aim of this project is to collect all kinds of Finnish speech from all kinds of people, from all over Finland and abroad.

By donating your speech, you can help researchers and companies to study language and to develop technology and services that can be used in Finnish more fluently in the future. All variants of spoken Finnish are welcome – including the speech of second-language Finnish learners. As long as you speak some Finnish and can understand the Finnish instructions in the app, you can donate!

Read more about the contribution of the Language Bank of Finland (in Finnish)

Go and donate your speech!

See also:

Interview of Mietta Lennes in the morning radio program Puheen Aamu on Yle Puhe 16.6.2020 (the program is available in Finnish for one week after time of broadcast)

Kielipankki – The Language Bank of Finland presented at COST Action 18209 ”NexusLinguarum” plenary meeting in Prague

Finland is currently participating in the COST Action 18209 ”NexusLinguarum” that aims to build an European network for web-centred linguistic data science. The first plenary meeting of the COST Action was held in Prague on 27-28 January, 2020. During the poster session of the meeting, FIN-CLARIN and the Language Bank of Finland were presented by Mietta Lennes with this poster:

 

Poster of the Language Bank of Finland (Kielipankki)

Introduction to the Language Bank of Finland at the workshop “Digital Parliamentary data and research”

Friday 3 May at 12.00
Aalto University (Otaniemi), CS-Building, Room T4 / A238 (Konemiehentie 2)

The aim of the workshop was to discuss the novel digital parliamentary datasets—in particular those of Parliament of Finland—their use in research, the related research resources and tools, and their future development for researchers, but also for citizens and the media. FIN-CLARIN and the Korp version 1.1 of the Plenary Sessions of the Parliament of Finland, available in the Language Bank of Finland, was also presented during the afternoon.

Mietta Lennes: FIN-CLARIN and Parliamentary Data in Kielipankki – the Language Bank of Finland (PowerPoint / PDF slides)

Further information including the programme of the workshop can be found at https://www.helsinki.fi/en/helsinki-centre-for-digital-humanities/workshop-digital-parliamentary-data-and-research.

Learn to process your own data in the Corpus Clinic

The registration deadline of the online course Corpus Clinic has been extended to 23rd November, until when it is possible to join the course area on Moodle. Students from the University of Helsinki as well as from other universities can enrol.  Please note, however, that a limited number of participants can be accepted. See further instructions on the course page.

In the Corpus Clinic, you will learn about the various methods and tools that are available for managing, processing and analyzing your data. You will also learn to write a data management plan. If required, it is possible to complete the course fully online.

This year, the course is jointly organized by FIN-CLARIN and HELDIG. During the spring term – after passing the initial stage of the course – each participant will have the opportunity to meet with a member of the supporting group of digital humanities experts who can help you with more specific questions about your data analysis. More information about this will be provided during the course.