Workshop: Large Language Models and Speech-Centric AI


Wednesday 09.10.2024 at 8:30-14:00, Clarion Hotel Helsinki


Department of Digital Humanities, University of Helsinki
LAREINA project
Kites ry


Clarion Hotel Helsinki, Tyynenmerenkatu 2, Helsinki

Welcome to the Workshop!

The development of language-centric AI during the past few years has been remarkable. It poses challenges but also creates opportunities for organizations both in the private and the public sector. Many of us are curious about how to harness the power of AI in our own business.

Our workshop on Large Language Models and Speech-Centric AI will showcase various use cases and applications both in the public and private sector. Our objective is to introduce the current state of language-centric AI in Finland, and share information about the future of access to language data and modules. The demo presentations and industry talks will illustrate the potential use of language-centric AI.

This workshop is addressed to developers, integrators and users of language technologies and AI solutions in Finland. The workshop will be held in English and on-site only.


Participation is free of charge, but registration is required. We have 50 seats available. Registration has now ended and the event is fully booked. We have a waiting list for possible reopened seats.


Large Language Models and Speech-Centric AI

Programme for the Workshop on Wednesday 09.10.2024


08:30 – 09:00

Registration and Coffee

09:00 – 10:30

LLMs and Speech-Interfaces in Private and Public Sector

Krister Lindén, University of Helsinki
Tomi Paavola, Ministry of Transport and Communications
Jörg Tiedemann, University of Helsinki
Tommi Lehtonen, KAVI & Mikko Kurimo, Aalto University
Markus Koskela, CSC – IT Center for Science

10:30 – 11:30

Demo Presentations and Coffee

11:30 – 13:00

AI and Speech-Interfaces

Antti ’Jogi’ Poikola, Teknologiateollisuus ry
Iftikhar Ahmad, Tietoevry
Manu Setälä, Solita Oy
Iikka Hauhio, Kielikone Oy
Michael Stormbom, Lingsoft Language Services Oy
Peter Smit, Inscripta Oy

13:00 – 14:00



Contact the organizers for further details:

lareina-office [ATT]

Last updated: October 8, 2024

Introducing: LAREINA project (funded by Business Finland)

An article presenting the LAREINA – Language Resource Infrastructure for AI (2023–25) project has been published on the website of the University of Helsinki. The LAREINA project is funded by Business Finland and implemented by Aalto University and the University of Helsinki as part of Tietoevry’s Veturi programme. The project involves companies and public sector organisations as partners.

The LAREINA project develops speech recognition and speech synthesis for Finnish, Finnish-Swedish and the Sámi languages. The project partners will test the components in different tasks and in areas such as call centres and machine translation. The LAREINA project aims to ensure that high-quality speech interfaces and speech-based AI services are also available for speakers of small languages.

The outputs of the LAREINA project will be published under an open licence, allowing also for commercial use, and they will also be available through the Language Bank of Finland – Kielipankki.

Read more about the LAREINA project on the University of Helsinki website: ”Speech-based AI services needed for small languages as well – researchers support companies in product development” (Published on 11.04.2024)

Visit the LAREINA project webpage:

Esittelyssä: Business Finlandin rahoittama LAREINA-hanke

Helsingin yliopiston verkkosivuilla on julkaistu juttu, jossa esitellään LAREINA – Language Resource Infrastructure for AI -hanke (2023–25). Business Finlandin rahoittaman hankkeen toteuttavat Aalto-yliopisto ja Helsingin yliopisto osana Tietoevryn Veturi-ohjelmaa. Hankkeessa on mukana yhteistyökumppaneina yrityksiä ja julkishallinnon puolen organisaatioita.

LAREINA-hankkeessa kehitetään puheentunnistusta ja puhesynteesiä suomen, suomenruotsin sekä saamen kielille. Hankkeessa mukana olevat kumppanit testaavat niitä esimerkiksi puhelinpalveluissa ja kääntämisessä. LAREINA-hankkeen tavoitteena on varmistaa, että laadukkaita puhekäyttöliittymiä ja puhepohjaisia tekoälypalveluita pystytään tuottamaan myös pienten kielten puhujille.

LAREINA-hankkeen tuotoksia julkaistaan avoimella, myös kaupallisen käytön sallivalla lisenssillä myöhemmin myös Kielipankin kautta.

Lue lisää LAREINA-hankkeesta Helsingin yliopiston verkkosivuilta: ”Puheella toimivia tekoälypalveluja tarvitaan myös pienille kielille – tutkijat vauhdittavat yritysten tuotekehitystä” (julkaistu 11.4.2024).

Tutustu LAREINA-hankkeen verkkosivuihin:

Eurooppalainen kielidata-avaruus -työpaja

Vapauta datan mahdollisuudet yrityksille ja kansalaisille EU:ssa


Keskiviikkona 10.04.024 klo 9:00-15:15, Clarion Hotel Helsingissä

European Language Data Space
Digitaalisten ihmistieteiden osasto, Helsingin yliopisto

Tervetuloa Eurooppalaisen kielidata-avaruuden -työpajaan!

Eurooppalainen kielidata-avaruus (European Language Data Space, LDS) ja Helsingin yliopisto kokoavat yhteen suomalaisen teollisuuden, julkishallinnon ja tutkimuksen asiantuntijoita keskustelemaan kielidatan merkityksestä kieliteknologioiden ja tekoälypohjaisten työkalujen kehittämiselle Suomessa. Tilaisuus järjestetään 10.04.2024 Clarion Hotel Helsingissä.

Vuoden 2023 alusta lähtien Euroopan komissio on ohjannut ja tukenut uutta tapaa jakaa kielidataa Eurooppalaisen kielidata-avaruuden (LDS) kautta. Tämä uusi tapa ulottuu kielidataa laajemmalle, ja se kattaa monia aloja ja toimintaympäristöjä niiden niiden omien data-avaruuksiensa kautta. Yhteiseurooppalaisten data-avaruuksien (Common European Data Spaces) perustamisen myötä tiedotus ja välitys datan eri tiedonkuvaus- ja saatavuusmuotojen välillä on toteutumassa kaikissa Euroopan maissa.

Tätä taustaa vasten Eurooppalaisen kielidata-avaruuden tavoitteena on rakentaa luotettavat ja tehokkaat datamarkkinat kielivarojen jakamiseen julkisella ja yksityisellä sektorilla EU:n datastrategian mukaisesti.

Eurooppalainen kielidata-avaruus (LDS) järjestää sarjan maakohtaisia työpajoja, joiden tarkoituksena on auttaa paikallisia yrityksiä, tutkimusryhmiä ja julkishallintoja ottamaan uuden kielidatanvaihtoavaruuden käyttöönsä ja liittymään relevantteihin paikallisiin ja eurooppalaisiin verkostoihin. Samalla ne voivat hyödyntää jo olemassa olevia luotettavia infrastruktuureja. Eurooppalaisena kielidatan jakamisalustana LDS voi auttaa paikallisia toimijoita kaupallistamaan kielidataansa monikielisessä Euroopassa, jossa kieliteknologioiden ja tekoälypohjaisten sovellusten merkitys jatkuvasti kasvaa.


Suomen LDS-työpaja

Suomen työpajassa käsitellään kotimaisen yksityisen ja julkisen sektorin sidosryhmien tarpeita kielidatan tarjoajina, integroijina ja/tai kuluttajina. Tapahtumassa jaetaan näiden tahojen kokemuksia ja vaatimuksia sekä selvitetään, kuinka voitaisiin päästä toivottuun teknologiseen kasvuun ja parantaa kilpailukykyä sekä kansallisella että Euroopan tasolla. Työpajassa keskustellaan siitä, kuinka LDS voi auttaa suomalaisia toimijoita ja tukea niiden pyrkimyksiä tuottaa, kaupallistaa tai hankkia kielidataa kieliteknologioiden ja tekoälypohjaisten työkalujen käyttövoimaksi Suomessa.

Työpaja on suunnattu datan haltijoille ja tarjoajille, kieliteknologioiden kehittäjille ja integraattoreille, pk-yrityksille sekä julkisen hallinnon edustajille, viranomaisille ja yhteistyökumppaneille. Työpaja on englanninkielinen.


Osallistuminen on maksutonta, mutta tilaisuuteen on ilmoittauduttava etukäteen. Ilmoittautuminen on päättynyt 03.04.2024. Ota yhteys järjestäjiin ja tarkista, onko tilaisuuteen vielä paikkoja jäljellä: lareina-office [ATT]


Suomen LDS-työpaja 10.4.2024, ohjelma

09:00 – 09:45


09:55 – 10:05

Welcome and introduction
Krister Lindén, University of Helsinki

10:05 – 10:35

Welcome by the European Commission: The Digital Europe Programme and the Common European Language Data Space
Philippe Gelin, European Commission

10:35 – 11:05

The importance of language data for the development of LT solutions future steps
Aleksander Alafuzoff, Yle

11:05 – 11:30


11:30 – 11:40

Welcome by the Ministry of Finance
Olli-Pekka Rissanen, Ministry of Finance

11:40 – 12:30

Language Data and Language Technologies in Finland and for Finnish
– Panel session

Krister Lindén, University of Helsinki (Moderator)
Mikko Kurimo, Aalto University
Iftikhar Ahmad, Tietoevry
Peter Smit, Inscripta Oy
Riikka Lindroos-Järvitalo, KELA
Patrik Gayer, SiloAI
Kirsi Salmela, Kopiosto

12:30 – 13:00

European Language Data Space: developing a market for language data and services and benefitting from a joint European effort
Georg Rehm, LDS Consortium, German Research Center for Artificial Intelligence (DFKI)

13:00 – 13:50


13:50 – 14:50

Language data production, management, and market development: overcoming obstacles – Panel session
Krister Lindén, University of Helsinki (Moderator)
Manu Setälä, Solita Oy
Kaarina Hyvönen, Kielikone Oy
Tiina Lindh-Knuutila, Lingsoft Language Services Oy
Tommi Lehtonen, KAVI
Ilkka Lavas, City Digital Group
Jörg Tiedemann, University of Helsinki

14:50 – 15:05

Krister Lindén, University of Helsinki

15:05 – 15:15

Kahvitauko ja verkostoituminen

15:15 – 16:15

Kahvitauko ja verkostoituminen jatkuvat Sitran järjestämässä Nordic Data Festival 2024 -tapahtumassa (rinnakkaistapahtumana Clarion Hotel Helsingissä)



Ota yhteys paikallisiin järjestäjiin:

Krister Lindén and Wilhelmina Dyster
Helsingin yliopisto
lareina-office [ATT]

Viimeksi päivitetty: 05.04.2024

European Language Data Space (LDS) workshop in Finland

Unleashing the potential of data – for EU businesses and citizens


Wednesday 10.04.2024 at 9:00-15:15, Clarion Hotel Helsinki

European Language Data Space
Department of Digital Humanities, University of Helsinki

Welcome to the European Language Data Space workshop in Finland!

The European Language Data Space and the University of Helsinki are bringing together experts from the Finnish Industry, Public Administration and Research to discuss the importance of language data for the development of Language Technologies and AI-based tools in Finland. The event is taking place on 10.04.2024 at Clarion Hotel Helsinki.

Since early 2023, the European Commission is providing guidance and support towards a new dimension in language data sharing that is executed through the European Language Data Space (LDS). This new dimension goes beyond language data and addresses many areas and fields through their specific Data Spaces. With the establishment of the Common European Data Spaces, the communication and exchange amongst different modalities of data description and availability is becoming a reality for all European countries.

In this context, the European Language Data Space aims at building a trustworthy and effective data market for the exchange of language resources in the public and – even more importantly – in the private sector, in line with the EU Data Strategy.

For that purpose, the European Language Data Space (LDS) is going to organise a series of Country Workshops to support local industries, research groups and public administrations to integrate this new language data exchange space and connect with relevant local and European networks, while benefiting from the trustworthy infrastructures already available. As European language data sharing platform, the LDS can help local industry stakeholders to monetise their language data in a multilingual Europe where Language Technologies and AI-based applications play an increasingly important role.


The LDS workshop in Finland

The Finnish LDS workshop will address the needs of the Finnish stakeholders from both private and public sectors, be it providers, integrators and/or consumers of language data, while sharing their experiences and requirements and exploring how to meet the desired technological growth to enhance their competitiveness at both national and European levels. The LDS will present and discuss how it can help the Finnish stakeholders and support their efforts to produce/monetise/obtain language data to power LT and AI-based tools in Finland.

The workshop is addressed to data owners and data providers, LT developers and integrators and SMEs, as well as to public administration executives, officers and partners. The workshop will be held in English.


Participation is free of charge, but registration is required. Registration was closed on 03.04.2024. Please contact the organisers and check if there still are seats available: lareina-office [ATT]


European Language Data Space (LDS) workshop in Finland on April 10th, 2024 Programme

09:00 – 09:45


09:55 – 10:05

Welcome and introduction
Krister Lindén, University of Helsinki

10:05 – 10:35

Welcome by the European Commission: The Digital Europe Programme and the Common European Language Data Space
Philippe Gelin, European Commission

10:35 – 11:05

The importance of language data for the development of LT solutions future steps
Aleksander Alafuzoff, Yle

11:05 – 11:30

Coffee Break

11:30 – 11:40

Welcome by the Ministry of Finance
Olli-Pekka Rissanen, Ministry of Finance

11:40 – 12:30

Language Data and Language Technologies in Finland and for Finnish
– Panel session

Krister Lindén, University of Helsinki (Moderator)
Mikko Kurimo, Aalto University
Iftikhar Ahmad, Tietoevry
Peter Smit, Inscripta Oy
Riikka Lindroos-Järvitalo, KELA
Patrik Gayer, SiloAI
Kirsi Salmela, Kopiosto

12:30 – 13:00

European Language Data Space: developing a market for language data and services and benefitting from a joint European effort
Georg Rehm, LDS Consortium, German Research Center for Artificial Intelligence (DFKI)

13:00 – 13:50

Lunch Break

13:50 – 14:50

Language data production, management, and market development: overcoming obstacles – Panel session
Krister Lindén, University of Helsinki (Moderator)
Manu Setälä, Solita Oy
Kaarina Hyvönen, Kielikone Oy
Tiina Lindh-Knuutila, Lingsoft Language Services Oy
Tommi Lehtonen, KAVI
Ilkka Lavas, City Digital Group
Jörg Tiedemann, University of Helsinki

14:50 – 15:05

Krister Lindén, University of Helsinki

15:05 – 15:15

Coffee Break and Networking

15:15 – 16:15

Coffee Break and Networking continue in Sitra’s Nordic Data Festival 2024 event (co-located in Clarion Hotel Helsinki)



Contact the local organizers for further details:

Krister Lindén and Wilhelmina Dyster
University of Helsinki
lareina-office [ATT]

Last updated: April 5, 2024

LAREINA – Language Resource Infrastructure for AI (2023–25)

LAREINA Research Organizations

University of Helsinki, Coordinator, Speech synthesis
Aalto University, Automatic speech recognition (ASR)

Contact the LAREINA project Coordinator and the LAREINA Research Organizations via email:

LAREINA Project Partners

Industry Partners

Tietoevry Finland Oy
Inscripta Oy
Kielikone Oy
Lingsoft Language Services Oy
Solita Oy

Puclic sector Partners

National Audiovisual Institute (KAVI)
Kansaneläkelaitos (KELA)


Business Finland, Funding decision no. 7388/31/2022

<< Suomeksi

LAREINA – Language Resource Infrastructure for AI

LAREINA – Language Resource Infrastructure for AI is a Business Finland-funded project (2023-25) in which the University of Helsinki and Aalto University will collaborate with companies with the aim to research, produce, test and pilot speech technology components. The goal of the LAREINA project is to develop a commercially replicable model for building speech interfaces for small and medium-sized languages.

During the LAREINA project, speech synthesis and automatic speech recognition (ASR) will be developed for Finnish, Finland-Swedish and the Sámi languages. The research methods and outputs will be applicable for other small and medium-sized languages as well.

The results of the LAREINA project will be published under a license which also allows for commercial use.

LAREINA organization

The University of Helsinki is the coordinator and a research organization in the LAREINA project. Area of research: Speech synthesis.

Aalto University is a research organization in the LAREINA project. Area of research: Automatic Speech Recognition (ASR).

The LAREINA project has seven companies and organizations in Finland as project partners. The LAREINA project has received funding from Business Finland for 2023-25 (Funding decision no. 7388/31/2022).

Contact information

Contact information and more details about the LAREINA Project Partners can be found here.

LAREINA in Media


No upcoming events. Please read about the past events below.

Past events

Workshop: ”Large Language Models and Speech-Centric AI”, 09-10-2024, Helsinki
European Language Data Space workshop in Finland, 10-04-2024 @ Hotel Clarion Helsinki

LAREINA Newsletter

Keep updated about the news and events related to the LAREINA project. Sign up for the LAREINA Newsletter here.

<< Takaisin LAREINA-aloitussivulle

LAREINA – Language Resource Infrastructure for AI (2023–25)


Helsingin yliopisto, koordinaattori, puhesynteesi
Aalto-yliopisto, automaattinen puheentunnistus

Yhteydenotot LAREINA-hankkeen koordinaattoriin ja tutkimusorganisaatioihin:


Kaupalliset toimijat

Tietoevry Finland Oy
Inscripta Oy
Kielikone Oy
Lingsoft Language Services Oy
Solita Oy

Julkisen sektorin toimijat

Kansallinen audiovisuaalinen instituutti (KAVI)
Kansaneläkelaitos (KELA)


Business Finland, julkisen tutkimuksen projektit, rahoituspäätös 7388/31/2022

<< In English

LAREINA – Language Resource Infrastructure for AI

LAREINA – Language Resource Infrastructure for AI on Business Finlandin rahoittama hanke (2023–25), jossa Helsingin yliopisto ja Aalto-yliopisto yhteistyössä yritysten kanssa tutkivat, tuottavat, testaavat ja pilotoivat puheteknologisia komponentteja. Hankkeen tavoitteena on kehittää kaupallisesti monistettava malli, jonka avulla puhekäyttöliittymiä voidaan rakentaa pienille ja keskisuurille kielille.

LAREINA-hankkeen aikana kehitetään puhesynteesiä ja automaattista puheentunnistusta suomen, suomenruotsin ja saamen kielille. Hankkeen aikana kehitetyt menetelmät ja aikaansaadut tulokset ovat sovellettavissa myös muille pienille ja keskisuurille kielille.

LAREINA-hankkeen tulokset julkaistaan avoimesti, myös kaupallisen käytön sallivan lisenssin alla.

LAREINA-hankkeen organisaatio

Helsingin yliopisto toimii LAREINA-hankkeen koordinaattorina ja tutkimusorganisaationa, tutkimuskohteenaan puhesynteesi.

Aalto-yliopisto toimii LAREINA-hankkeen tutkimusorganisaationa, joka kehittää hankkeen aikana automaattista puheentunnistusta.

LAREINA-hankkeessa on yhteistyökumppaneina seitsemän Suomessa toimivaa yritystä ja organisaatiota. Rahoituksen hanke on saanut Business Finlandilta kaudelle 2023–25 (rahoituspäätös 7388/31/2022).


Yhteystiedot sekä lisätietoja LAREINA-hankkeen yhteistyökumppaneista löydät täältä.

LAREINA mediassa


Ei tulevia tapahtumia. Tutustu menneisiin tapahtumiin alla.

Menneitä tapahtumia

Workshop: ”Large Language Models and Speech-Centric AI”, 09-10-2024, Helsinki

Eurooppalainen kielidata-avaruus (LDS) -työpaja Suomessa, 10-04-2024 @ Hotel Clarion Helsinki

LAREINA-uutiskirjeen tilaaminen

LAREINA-hankkeen uutisista ja tapahtumista tiedotamme LAREINA-uutiskirjeellä. Tilaa uutiskirje täältä.

General Terms of Speech Material Use (commercial use)



The text below on this page is a copy of the ”General Terms of Speech Material Use” that are included in the agreement made with a company or an organization regarding the use of the ”Speech Material”.

Definitions used in the text:

Speech Material = According to the agreement, one of the following:
Donate Speech: Complete dataset, version 1, URN: urn:nbn:fi:lb-2020090321
Donate Speech: Annotated dataset, URN: urn:nbn:fi:lb-2022060128
Donate Speech: Selected dataset, URN: urn:nbn:fi:lb-2022060127
Donate Speech Corpus: Sample, URN: urn:nbn:fi:lb-2022060126

The Language Bank = University of Helsinki

Licensee = The company or the organization with which the Agreement was signed

General Terms of Speech Material Use

1.     Definitions

1.1.         “Speech Material” refers to material collected in the Lahjoita puhetta (Donate Speech) campaign, as defined in the signature part of this Agreement, which the Language Bank distributes for the purpose of research and development of applications and services that are capable of interpreting and producing speech, as well as for the purpose of language research.

Data Protection Legislation” refers to the EU General Data Protection Regulation (2016/679) (“GDPR”) or any subsequent law that supersedes it, and the national data protection legislation applicable to the Licensee. This Agreement refers to terms that have been defined in the GDPR, including “personal data”, “data subject”, “controller”, “processing” and “processor”. In this Agreement, they are given the same meaning as in the GDPR.

2.     License

2.1.         The Language Bank hereby grants the Licensee a non-exclusive, non-transferable and non-sublicensable license to use to the Speech Material for the purpose of research and development of applications and services that are capable of interpreting and producing speech in accordance with the terms specified in this Agreement.

The license is valid for as long as this Agreement remains in force.

2.2.         The license only applies to the Licensee defined in the signature part of this Agreement. Disclosing or transferring the Speech Material to a third party (including to a company affiliated with the Licensee) is prohibited, except as set forth in Section 4.3.

The Language Bank retains proprietary rights to the Speech Material.

2.3.         For the avoidance of doubt, any results created by the Licensee (such as software and models) from which the Speech Material, the personal data included in the Speech Material, and the voices of the speakers cannot be restored shall belong to the Licensee, and the Licensee may continue to use such results after the term of the license.

The Language Bank shall provide a copy of the Speech Material to the Licensee after the Licensee has paid the fee set forth in the signature part of this Agreement.

2.4.         The Licensee must comply with the data protection terms and conditions presented in Section 4 of this Agreement when processing the Speech Material.

The Licensee must use the Speech Material in accordance with good practice while respecting equality and human rights. The use of the Speech Material for discriminatory purposes or purposes that are derogatory to a specific group of people is prohibited.

3.     License Fee

3.1.         The Licensee shall pay the license fee set forth in the signature part of the Agreement.

4.     Confidentiality and Data Protection

4.1.         The Licensee understands that the Speech Material includes personal data subject to Data Protection Legislation. When processing the Speech Material, the Licensee is considered the controller. The Licensee undertakes to process the Speech Material in accordance with Data Protection Legislation solely for the purpose determined in Section 2.1 of this Agreement. The Licensee shall comply with any obligations imposed on the controller by Data Protection Legislation in the processing of the Speech Material.

The data included in the Speech Material are confidential. The Licensee must implement any technical and organisational measures required to ensure that only the relevant persons have access to the Speech Material. Employees of the Licensee who process the Speech Material must be bound by an obligation of confidentiality pertaining to the content of the Speech Material. The confidentiality obligation must remain in effect after the end of the employment relationship.

4.2.         The Licensee shall not disclose or provide access to the Speech Material to any third party. Publication of the Speech Material is prohibited. Notwithstanding the aforementioned, the Licensee may transfer the Speech Material to its subcontractors or service providers that act as processors of personal data for purposes consistent with this Agreement. When employing processors, Licensee shall comply with the requirements of Data Protection Legislation regarding processors of personal data, and conclude an agreement on the processing of personal data in accordance with Article 28 of the GDPR with the processors. If the processors process personal data outside the European Economic Area, the Licensee shall comply with the provisions of Chapter V of the GDPR on the transfer of personal data to third countries or international organisations. The processing of the Speech Material in cloud-based services aimed at consumers is prohibited.

The Speech Material shall not be used to identify speech donors. Recordings included in the Speech Material must not be combined with personal data available elsewhere, nor may recordings included in the Speech Material be compared to recordings available elsewhere to determine whether the speaker is the same person in both.

4.3.         The Licensee shall maintain an up-to-date data protection statement online on the use of the Speech Material. The data protection statement must contain all information which must be supplied to data subjects according to Data Protection Legislation. The Licensee shall submit to the Language Bank the URL address of the statement before commencing the processing of the Speech Material. The Language Bank will publish the URL address on its website.

If the Licensee processes personal data outside the European Economic Area, the Language Bank and the Licensee shall put in place the safeguards required by Chapter V of the GDPR before the disclosure of the Speech Material to the Licensee. The Language Bank has the right to refrain from transferring the Speech Material for processing outside the European Economic Area if it deems that such transfer in accordance with Chapter V of the GDPR is not possible by reasonable means.

4.4.         The Licensee shall notify the Language Bank without undue delay if the Speech Material is subjected to a personal data breach which results in the accidental or unlawful destruction, loss, alteration, unauthorised disclosure of, or access to, transferred, stored, or otherwise processed personal data.

The Licensee shall securely delete the Speech Material when it no longer has grounds based on Data Protection Legislation to process the Speech Material. In any event, the Licensee shall delete the Speech Material upon expiration or termination of the license granted in this Agreement. The Licensee shall document the deletion of the Speech Material. The Language Bank has the right to request and receive this documentation and an assurance given by the Licensee indicating that the Speech Material has been deleted.

5.     Updates to the Speech Material, Obligation to Notify

5.1.         The Language Bank may produce new versions of the Speech Material to ensure, for example, that the rights of data subjects in accordance with Data Protection Legislation are fulfilled and that there is no unlawful content in the Speech Material. When the Language Bank produces a new version of the Speech Material and notifies the contact person of the Licensee by email, the Licensee shall, without delay, delete the old version of the Speech Material and replace it with the new version. In accordance with Section 8, the Licensee must submit a functional and valid email address to the Language Bank to which notifications of updates to the Speech Material are to be sent.

The Licensee shall notify the Language Bank without delay if it identifies or suspect the presence of the following content in the Speech Material:

(a)   Unauthorised, inaccurate, unnecessary or outdated personal data (such as direct identifiers, including names and contact details, information pertaining to the private life of individuals, rumours or defamatory speech),

(b)  Unauthorised copies of works or other objects protected by copyright or related rights,

(c)   Trade secrets,

(d)  Data whose disclosure would constitute an offence against privacy, public peace or personal reputation (Chapter 24 of the Criminal Code of Finland), incitement to hatred or ethnic agitation (Sections 10 and 10a, Chapter 11 of the Criminal Code of Finland) or another offence, or

(e)   Recordings where speech has been recorded without the speaker’s knowledge, or the recording has been started by accident.

The notification made by the Licensee shall include information that enables the Language Bank to identify the relevant recording.

6.     No Warranty

6.1.         The Language Bank provides the Speech Material to the Licensee “as is”. The Language Bank provides no warranty on the Speech Material and specifically disclaims any warranties of accuracy, completeness or fitness for a particular purpose, or non-infringement upon the rights of any third parties. The Licensee shall use the Speech Material at its own risk. The Language Bank is not responsible for any damage or losses incurred by the Licensee through the use of the Speech Material.

7.     Liability for Damages

7.1.         The Licensee is solely liable for ensuring that it uses the Speech Material in accordance with Data Protection Legislation and any other applicable legislation.

The Parties shall be liable towards each other for the damage they have caused by a breach of contract. The Language Bank shall not be liable for indirect or consequential damage. In all cases, the total liability of the Language Bank is limited to the amount of the license fee paid by the Licensee to the Language Bank. The above limitations of liability do not apply if the damage was caused wilfully or by gross negligence.

7.2.         Neither Party is liable towards the other Party if a failure to fulfil an obligation set out in this Agreement is caused by a force majeure event. Force majeure includes, but is not limited to, fires, floods, explosions, lightning, storms, earthquakes, landslides, shortages of energy supply, interventions by government, revolutions, riots, wars, strikes, labour disputes, transport disruptions, shortages of labour, or another factor beyond the reasonable control of the relevant Party.

8.     Notices

8.1.         Any notices relating to this Agreement shall be sent by post, courier or email to the relevant Party’s contact person indicated in the signature part of this Agreement. If the contact details of a Party change, the Party shall submit new contact details to the other Party without undue delay.

9.     Term and Termination

9.1.         The Agreement shall enter into force on the date of the last signature and shall remain in effect for 10 years from the effective date. The Licensee may terminate the Agreement by giving written notice of termination to the Language Bank, in which case the Agreement shall terminate after 30 days have passed from the date of the notice. The Language Bank shall have no obligation to return any fees paid by the Licensee for the license to use the Speech Material.

As stated in Section 4.8 above, the Speech Material must be deleted when there are no legal grounds under the Data Protection Laws for the processing of personal data. During the term of the license to use the Speech Material, the Language Bank shall provide a new copy of the Speech Material to the Licensee upon request and within a reasonable time if the Licensee wishes to resume processing of the Speech Material.

9.2.         If a Party materially breaches this Agreement and does not remedy the breach within thirty (30) days of receiving written notice concerning the breach, or if the nature of the breach makes it incapable of being remedied, the other Party may terminate this Agreement. For the avoidance of doubt, failure to pay compensation for the use of the Speech Material and a material breach of the Data Protection Laws shall be considered a material breach of contract.

If a Party is evidently insolvent or becomes subject to bankruptcy, composition, insolvency, administration, administrative receivership or other similar proceedings, the other Party may terminate the Agreement with immediate effect.

9.3.         In accordance with Section 4, the license to use to the Speech Material ends immediately at the termination or expiry of the Agreement.

Sections 6, 7, 10 and 11.5 of the Agreement as well as provisions which are intended to remain in effect due to their nature shall remain in effect also after the termination or expiry of the Agreement.

10.  Governing Law and Dispute Resolution

10.1.      This Agreement is governed by the laws of Finland excluding its conflict of law provisions.

Any disputes relating to this Agreement, which cannot be solved amicably, shall be resolved by the Helsinki District Court.

10.2.      If the Licensee’s registered office is in a country in which a judgment from the courts of Finland would not be enforceable, any disputes arising from the Agreement shall be finally settled by arbitration under the rules of arbitration of the Central Chamber of Commerce of Finland. The arbitral tribunal is composed of a sole arbitrator. The seat of arbitration is Helsinki, Finland. The language of arbitration is Finnish.

11.  Miscellaneous

11.1.      The Language Bank has the right to amend these general terms on legitimate grounds, which may relate to, including but not limited to, instructions given by authorities, best practice, or changes in the Data Protection Laws or other applicable laws. A notification of any changes made shall be sent by email to the contact address provided by the Licensee sixty (60) days before their entry into force. If the Licensee does not accept the changes made to the general terms, it may terminate the Agreement before the entry into force of the amendments by giving written notice to the Language Bank no later than thirty (30) days before their entry into force. The Language Bank shall have no obligation to return any fees paid by the Licensee for the license to use the Speech Material.

The Licensee shall not transfer this Agreement or parts thereof to third parties without the express written consent of the Language Bank.

11.2.      This Agreement cancels all prior agreements and together with its appendices represents the entire Agreement between the Parties relating to the subject matter thereof.

If any of the terms of this Agreement are or become invalid, the remainder of the Agreement shall remain valid. If any invalid, unenforceable or illegal provision of this Agreement would be valid, enforceable and legal if some part of it were deleted, the provision shall apply with the minimum modification necessary to make it legal, valid and enforceable.

11.3.      Neither Party may use the name or logo of the other Party in product marketing, media releases or for other similar purposes, unless specifically agreed between the Parties in writing. However, the Licensee has the right to refer, as appropriate, to the Lahjoita puhetta (Donate Speech) campaign and the Language Bank as the source of the Speech Material. For the avoidance of doubt, the Language Bank has the right to mention the Licensee as a recipient of the Speech Material on the Lahjoita puhetta data protection webpage, as well as other contexts where mentioning recipients of the Speech Material is necessary for the Language Bank to comply with its obligations.





This page was last updated on 6.9.2022.

Persistent Identifier of this page: urn:nbn:fi:lb-2022060130


Kielipankki tekee yhteistyötä yritysten kanssa järjestämällä tapahtumia ja yhteisiä hankkeita. Tietyt aineistot ja palvelut voivat olla Kielipankin kautta saatavilla myös yrityskäyttöön.

European Language Data Space (LDS) -työpaja Tapahtuma: Eurooppalainen kielidata-avaruus (LDS) -työpaja Suomessa, Clarion Hotel Helsinki, 10.4.2024
LAREINA – Language Resource Infrastructure for AI -projekti (2023–25) LAREINA – Language Resource Infrastructure for AI -projekti (2023–25)
LUMI-supertietokone CSC:llä LUMI-supertietokone ja CSC:n superlaskentapalvelut yrityksille
Aku Rouhe
Researcher of the Month: Aku Rouhe


