The online course Natural Language Processing for Linguists will be taught by Tuomo Hiippala at the University of Helsinki during 15.3.–10.5.2021.
The course is also open to students from universities outside Helsinki, if space allows. Registration is open until 16th March.
Note also that all the course materials will be available online and you can use them even if you cannot make it to the course this time!
The next Kielipankki Live event will be held on Monday 14th December starting at 13:00 via Zoom. The event will be in English, but questions are welcome in Finnish as well! The main themes are speech corpora and personal data practices. Join us for the interviews and presentations of special guests and for good discussions! Register preferably by 11the December.
The European Language Grid (ELG) aims to provide a digital marketplace where European companies, organizations and citizens can both offer and efficiently use language technologies, data sets and services. The ELG workshop presents an overview of the ELG platform and the ELG pilot projects. Welcome to see what ELG has to offer for you!
The workshop is a free online event, but registration is required. Please register via the ELRC website by 10th December. NB: In case you wish to participate in the ELG tutorial session that may be arranged after the workshop, please indicate this in the field for additional information on the registration form. Thanks!
Note that the third ELRC Workshop in Finland will also take place online, in the same virtual room, on the same day at 9.30-12.40. Welcome to participate in both events!
|14:00||Welcome and introduction|
|14:30||ELG online demo|
|14:50||Presentations of Finnish Pilot Projects funded in ELG: PARA4DLM (University of Turku), LSDISCO (Lingsoft); OPUS-MT (University of Helsinki)|
|15:20||Expectations/requirements of Finnish Language Technology providers|
Marko Turpeinen, 1001Lakes
|15:40||Summary and discussion|
|16:00||End of workshop|
|16:15||Tutorial: How to integrate a service into ELG|
This tutorial may be organized according to requests from the participants. Please indicate your interest in the registration form.
Last updated: December 7, 2020
This online course can support you with practical issues in managing the research data you need for your MA thesis or PhD project. You can join the course from any university, given that you fulfil the criteria. There is plenty of room left at the moment. Note, however, that the number of participants is restricted and students in the LingDig MA programme at the University of Helsinki have priority.
Mark your calendars! Further information will be updated on the event page.
The European Language Resource Coordination (ELRC) consortium
Department of Digital Humanities, University of Helsinki
Language Technology is shaping our multilingual future. It has already been transforming the way we interact with our devices and with each other, the way we shop, work and travel. More and more it reshapes our interaction with service providers, either public or private. Programs that automatically correct spelling errors and aid sophisticated writing, digital assistants that transform our voices to text messages on mobile phones, bots that answer our calls to the bank or to our social security organisation, systems that automatically translate from a foreign language, and much more, are already empowering our everyday lives, our businesses and our administrations. But can we fully use our own language in our digital interactions? Is our language adequately supported and ready to keep pace with the technological advancements of the AI era?
The third Finnish European Language Resource Coordination (ELRC) workshop will address these questions and it will seek to engage participants in a fruitful discussion on the status and prospects of Language Technology for Finnish. Developers, integrators and users of Language Technology, both from the private and public sector will share experiences, requirements and ways for transforming digital interaction in our multilingual Europe with Language Technologies. Finally, we will discuss how language data, i.e. texts and speech, can fuel development in Artificial Intelligence.
This ELRC workshop is organized in collaboration with the European Language Grid (ELG). The 4th Regional ELG Workshop will take place in the afternoon, starting at 14:00. For details, see the ELG workshop page. Welcome to register and attend both events!
The ELRC workshop is a free event, but registration is required. You can use the same form to register to both the ELRC workshop (morning sessions) and the ELG workshop (afternoon sessions).
Please register via the ELRC website by 10th December. Welcome!
09:30 – 09:40
09:40 – 10:00
10:00 – 10:30
Language Technologies for the Languages of Finland – Panel session (video, pdf)
10:30 – 10:45
10:45 – 11:15
11:15 – 11:45
Language technologies by/for the public sector – Panel session (video, pdf)
11:45 – 12:15
Language data creation, management and sharing: existing practices and challenges – Panel session (video)
12:15 – 12:30
12:30 – 12:40
|12:40 – 14:00|| |
14:00 – 16:30
European Language Grid (ELG): Introduction and overview.
The ELG workshop is organized in collaboration with the European Language Grid (ELG) and it will take place in the same online meeting room as the ELRC workshop. Please note that the ELG workshop will be held in English only. Welcome to register and participate in both events!
The detailed program for the ELG workshop is updated at https://www.kielipankki.fi/elg-workshop-2020/.
Please register via the ELRC website by 10th December. Welcome!
Mietta Lennes and Tommi Jauhiainen
University of Helsinki / FIN-CLARIN
fin-clarin [ATT] helsinki.fi
Last updated: December 8, 2020
The open online course Introduction to Speech Analysis (5 ECTS) has just started. The course is now offered for the first time in both Finnish and in English. Within the group size limits, you can join in from any university until 6th November 2020. See the course home page for instructions on how to enrol the course area on Moodle.
During the course, you learn to transcribe and to annotate speech and to understand some of the most important acoustic displays and measurement methods that can be used in speech research. The main tool of the course is the Praat analysis program, but we will also take a look at ELAN. The course can be relevant for students in phonetics, linguistics and languages, but also in other fields where audio recordings of speech are used for research.
All the courses offered by FIN-CLARIN can be found on the Training page.
Kielipankki Live is a new series of online events where researcher guests are interviewed and current issues about text and speech resources and the Language Bank of Finland are discussed. After each event, the recorded presentations are made available on YouTube (for materials and links, see the list of past events below). In order to stay tuned on future events, please subscribe to the newsletter of the Language Bank of Finland.
Theme: Speech corpora and personal data
Look forward to expert guests and good discussions! The brief presentations will be held in English, but you can also ask questions in Finnish. The event starts at 13:00 and it is expected to finish by 15:00.
Please register for the event here by 11.12.2020. When registering, you can already ask questions from the researcher guests and from the experts of the Language Bank of Finland. It will also be possible to ask questions during the event.
By registering, you will receive a link for joining Zoom before the event begins. If the event has already started, you can also email fin-clarin [AT] helsinki.fi if you want to join late.
The event will be recorded and the main content will be published online. In case you do not wish your face or voice to be recorded, please keep your camera and microphone off during the event. You may also participate in the discusssion via the chat. The names and contact details of the participants will not be published.
In this virtual CLARIN event, Mietta Lennes from FIN-CLARIN gave a short presentation about her online teaching.
Toisen CLARIN ERICin järjestämän CLARIN Café -virtuaalitapahtuman aiheena oli CLARIN-aineistojen ja työkalujen hyödyntäminen korkeakouluopetuksessa. Myös Mietta Lennes FIN-CLARINista piti lyhyen esityksen omasta verkko-opetuksestaan.
The Language Bank of Finland is working together with the Finnish Broadcasting Company (Yle) and the Finnish State Development Company (Vake Oy) in the Donate Speech campaign (Lahjoita puhetta) launching on 16th June 2020. The aim of this project is to collect all kinds of Finnish speech from all kinds of people, from all over Finland and abroad.
By donating your speech, you can help researchers and companies to study language and to develop technology and services that can be used in Finnish more fluently in the future. All variants of spoken Finnish are welcome – including the speech of second-language Finnish learners. As long as you speak some Finnish and can understand the Finnish instructions in the app, you can donate!
Finland is currently participating in the COST Action 18209 ”NexusLinguarum” that aims to build an European network for web-centred linguistic data science. The first plenary meeting of the COST Action was held in Prague on 27-28 January, 2020. During the poster session of the meeting, FIN-CLARIN and the Language Bank of Finland were presented by Mietta Lennes with this poster:
Introduction to the Language Bank of Finland at the workshop “Digital Parliamentary data and research”
Friday 3 May at 12.00
Aalto University (Otaniemi), CS-Building, Room T4 / A238 (Konemiehentie 2)
The aim of the workshop was to discuss the novel digital parliamentary datasets—in particular those of Parliament of Finland—their use in research, the related research resources and tools, and their future development for researchers, but also for citizens and the media. FIN-CLARIN and the Korp version 1.1 of the Plenary Sessions of the Parliament of Finland, available in the Language Bank of Finland, was also presented during the afternoon.
Further information including the programme of the workshop can be found at https://www.helsinki.fi/en/helsinki-centre-for-digital-humanities/workshop-digital-parliamentary-data-and-research.
The registration deadline of the online course Corpus Clinic has been extended to 23rd November, until when it is possible to join the course area on Moodle. Students from the University of Helsinki as well as from other universities can enrol. Please note, however, that a limited number of participants can be accepted. See further instructions on the course page.
In the Corpus Clinic, you will learn about the various methods and tools that are available for managing, processing and analyzing your data. You will also learn to write a data management plan. If required, it is possible to complete the course fully online.
This year, the course is jointly organized by FIN-CLARIN and HELDIG. During the spring term – after passing the initial stage of the course – each participant will have the opportunity to meet with a member of the supporting group of digital humanities experts who can help you with more specific questions about your data analysis. More information about this will be provided during the course.
Digital resources and technology are used more and more within the humanities and the social sciences. Researchers in digital humanities gather, administer and share rapidly accumulating digital resources. They also need various research methods and tools in working with these resources. The conference Research Data and Humanities (RDHum) seeks to gather researchers around these themes. In addition to researchers, we invite teachers, graduate and postgraduate students as well as other interested parties to participate and to contribute.
RDHum 2019 is jointly organised by the University of Oulu and the University of Jyväskylä, in collaboration with FIN-CLARIN and Kielipankki, The Language Bank of Finland. The event is the first in the series of conferences taking place every other year in one of the universities within the FIN-CLARIN Consortium. The first RDHum Conference is hosted by the University of Oulu where the Oulu Corpus, a comprehensive digital research resource at the time, was collected and compiled 50 years ago. The working languages in the conference are Finnish, Swedish and English.
For more information, please send an inquiry to: RDHum2019 [AT] oulu.fi
Welcome! Please register in advance via the ELRC website.
*in collaboration with the Finnish Language Cluster Kites http://www.kites.fi
09:00 – 10:00 Registration and coffee
10:00 – 10:10 Welcome and introduction
Krister Lindén, ELRC Technology National Anchor Point in Finland, FIN-CLARIN, University of Helsinki
Taru Virtanen, ELRC Public Services National Anchor Point in Finland, Prime Minister’s Office
Mikael Reiman, European Commission Representation in Finland
Session 1. Connecting a multilingual Europe: European context and local needs
10:10 – 10:25 The European Language Resource Coordination (ELRC)
Aivars Bērziņš, European Language Resource Coordination, Tilde
10:25 – 11:30 Multilingual Finland
Challenges in multilingualism
Christoffer Forssell, The Finnish Broadcasting Company YLE
Carola Grönholm, Kela
Government language and translation guidelines
Taru Virtanen, Prime Minister’s Office; ELRC PS-NAP
Translation procurement in the public sector, Hansel as a case study
Anni Airaksinen, Hansel
11:30 – 12:00 Panel session: How MT can help, an outlook into current and future challenges
Moderator: Mikael Reiman, European Commission Representation in Finland
Panelists: Christoffer Forssell/YLE, Simo Kankkunen/Prime Minister’s Office and Jörg Tiedemann/University of Helsinki
12:00 – 13:00 Lunch Break
Session 2. Engage: hands-on data
13:00 – 13:30 The CEF eTranslation platform @ work *
Erkka Vuorinen, European Commission’s Directorate-General for Translation (DGT)
13:30 – 14:00 eTranslation Termbank *
Simon Dahlberg, Language Council of Sweden (Språkrådet)
14:00 – 14:30 Governments, NGOs, MT and accessibility *
Mary Nurminen, University of Tampere and Maarit Koponen, University of Turku
14:30 – 15:00 Coffee Break
15:00 – 15:30 Data sharing myths and challenges *
Jarkko Reittu, National Institute for Health and Welfare
15:30 – 16:00 Identifying and managing your data and how ELRC can assist and help *
Aivars Bērziņš, European Language Resource Coordination, Tilde
16:00 – 16:30 Questions and Answers/Open Discussion/Conclusion
Updated: October 23, 2018
The online course Corpus Linguistics and Statistical Methods (Korpuslingvistiikka ja tilastolliset menetelmät, 5 credits) will be offered by FIN-CLARIN during 15.1.-4.3.2018. This course can be taken either in Finnish or in English.
The total number of participants will be restricted, but it will be possible to participate the course from outside the University of Helsinki and even from outside Finland. For students from outside the University of Helsinki, a registration form is available on the course web page. However, note that students from the University of Helsinki should register via WebOodi as usual.
Registration period: 12.12.2017–14.1.2018
FIN-CLARIN offers several methodological courses online every year. The courses are designed especially for graduate and postgraduate students in Humanities, Social Sciences and Behavioural Sciences who use research materials or corpora containing text or speech.
Students from all Finnish universities can participate for free. If space allows, participants can also be accepted from universities outside Finland. The courses are offered via the Moodle learning environment of the University of Helsinki. It is possible to log in directly with the user credentials of any member organization of the Haka and eduGAIN networks. Please try to log in to Moodle well in advance. If you discover that you have no access, temporary user accounts are available for course participation upon timely request from the teacher.
The CLARIN PLUS Workshop on User Involvement was organized by FIN-CLARIN in Helsinki 8.-9.6.2017. A total of 23 participants from all over Europe gathered together in order to discuss new ideas and good practices on how to engage more users of CLARIN services.
The joint FIN-CLARIN Jubilee Seminar and Nordic CLARIN Network Seminar (9th June 12 a.m. – 4.10 p.m. and 10th June 9 a.m. – 3 p.m.) will be held at Unioninkatu 40, 1st floor, room 1 (B116), Helsinki. See program.