100suom | Hundred Finnish Linguistic Life Stories | ![]() | ![]() | ![]() | ![]() | B | Hanna Lappalainen | https://blogs.helsinki.fi/100suomalaista/ | ||
Akkala | The Corpus of Spoken and Written Akkala Saami | ![]() | ![]() | ![]() | ![]() | Michael Riessler | ||||
amph-korp | amph-Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | Antti Arppe | ||||
coronavirus-2021-05-src | The Coronavirus Corpus - Kielipankki version 2021-05, source | ![]() | ![]() | B | FIN-CLARIN | |||||
DIALUKI | DIALUKI - Diagnosing reading and writing in a second or foreign language | ![]() | ![]() | ![]() | ![]() | ![]() | Ari Huhta | |||
digitala-autumn2021 | DigiTala: L2 Finnish data from upper secondary schools and university, autumn 2021 | ![]() | ![]() | ![]() | ![]() | ![]() | B | Anna von Zansen | https://zenodo.org/communities/digitala/about/ | |
digitala-spring2021 | DigiTala: L2 Finnish data from upper secondary schools, spring 2021 | ![]() | ![]() | ![]() | ![]() | ![]() | B | Anna von Zansen | https://zenodo.org/communities/digitala/about/ | |
digitala-yki | DigiTala's YKI data | ![]() | ![]() | ![]() | ![]() | ![]() | B | Heini Kallio | https://zenodo.org/communities/digitala/about/ | |
dma-v2 | Digital Morphology Archives, new version | ![]() | ![]() | VRT | ![]() | |||||
dma-wn-fn-src | The Word Notes of the Morphology Archives with field reports, source | ![]() | ![]() | |||||||
dma-wn-src | The Word Notes of the Digital Morphology Archives, source | ![]() | ![]() | ![]() | ![]() | |||||
DSPCON2013-2015-korp | Aalto University DSP Course Conversation Corpus 2013-2015, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | Mikko Kurimo, Seppo Enarvi | ||||
eduskunta-v2-dl | Plenary Sessions of the Parliament of Finland, Downloadable Version 2 | ![]() | ![]() | ![]() | ![]() | |||||
eduskunta-v2-korp | Plenary Sessions of the Parliament of Finland, Kielipankki Korp Version 2 | ![]() | ![]() | ![]() | ![]() | |||||
enets | Enets Corpus | ![]() | ![]() | ![]() | ![]() | Olesya Khanina | ||||
english-uhlcs-korp | English Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
erme-dl | ERME Erzya and Moksha Extended Corpora, full text/download version | ![]() | ![]() | ![]() | Jack Rueter | |||||
Ersä | Corpus of Colloquial Erzya | ![]() | ![]() | ![]() | ![]() | Riho Grünthal | ||||
erzya-moksha-komi-uhlcs-korp | Corpus of Erzya and Moksha Mordvin Literature and Journals and Komi Zyrian Literature (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
erzya-moksha-uhlcs-korp | Erzya and Moksha Mordvin Word List Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
estonian1-uhlcs-korp | Estonian Corpus 1 (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
estonian2-uhlcs-korp | Estonian Corpus 2 (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
fcaa | Finnish Conversation Analysis Archive | ![]() | ![]() | ![]() | Mari Siiroinen | https://metashare.csc.fi/repository/browse/finnish-conversation-analysis-archive/65669f5eb7e611eb9cdefa163ec5ae3e69c8f5f510064ad999f16144700b1156/ | ||||
fedidi | Citation Database of Fennistic Dialect Dissertations | ![]() | ![]() | ![]() | ![]() | |||||
findarc | Finnish Dark Web Marketplace Corpus | ![]() | ![]() | ![]() | ![]() | ![]() | Tuomas Harviainen | |||
finears | Finnish electroacoustic music interviews | ![]() | ![]() | ![]() | ![]() | Mikko Ojanen | https://blogs.helsinki.fi/finnish-electroacoustic-resources/ | |||
FinIntas | The FinINTAS Corpus of Spontaneous and Read-aloud Finnish Speech | ![]() | ![]() | ![]() | ![]() | Mietta Lennes | ||||
finlangus | Spoken language and linguistic tasks of Finnish-American immigrants and controls | ![]() | Nana Lehtinen | |||||||
finnish-bibles-uhlcs-korp | Finnish Corpus (Bibles) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
finnish-literature-uhlcs-korp | Finnish Corpus (Literature) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
FinnTreeBank1-korp | The Helsinki Korp Version of the Finnish TreeBank 1 | ![]() | ![]() | ![]() | ![]() | ![]() | ||||
ha-korp | Ha Language Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | ![]() | Lotta Aunio | |||
hanty-uhlcs-korp | Khanty Corpus (North Khanty, Corpora and Translations) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
helpuhe-2010txt | The Longitudinal Corpus of Finnish Spoken in Helsinki (2010 in text form) | ![]() | ![]() | ![]() | ![]() | Hanna Lappalainen | ||||
helpuhe-v2-korp | The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), Helsinki Korp Version 2 | ![]() | ![]() | ![]() | ![]() | ![]() | Hanna Lappalainen | |||
helpuhe-v2-lat | The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), Helsinki LAT Version 2 | ![]() | ![]() | ![]() | ![]() | ![]() | B | Hanna Lappalainen | ||
HS | The Helsingin Sanomat Archive Corpus | ![]() | ![]() | ![]() | ![]() | Jarkko Rahkonen | ||||
ingrian-uhlcs-korp | Ingrian Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Inkerin murteet | The Corpus of Ingrian Finnish | ![]() | ![]() | ![]() | ![]() | Marjatta Palander | www, muuta | |||
iweb-src | The Intelligent Web Corpus - Kielipankki version, source | ![]() | B | FIN-CLARIN | ||||||
kikosa-haa | University of Oulu Kikosa Collection: Group interviews | ![]() | ![]() | ![]() | ![]() | Maria Frick | ||||
kikosa-kok | University of Oulu Kikosa Collection: Student meetings | ![]() | ![]() | ![]() | ![]() | Maria Frick | ||||
Kiltinänsaame | The Corpus of Written Kildin Saami | ![]() | ![]() | ![]() | ![]() | ![]() | Mikael Riessler | |||
Kiltinänsaame (UHLCS) | Kildin Saami Corpus (UHLCS) | ![]() | ![]() | ![]() | ![]() | Pirkko Suihkonen | ||||
komi-ikdp | Spoken Komi Corpus: IKDP | ![]() | ![]() | ![]() | ![]() | Niko Partanen | ||||
komi-uhlcs-korp | Komi Zyrian Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
kra-korp | Jyväskylä Corpus of Middle French, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
latin-uhlcs-korp | Latin Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
long-second | The Long Second Corpus: LONGitudinal Classroom Data about Children’s Development in Finnish as a SECOND Language | ![]() | ![]() | ![]() | ![]() | Maria Ahlholm | ||||
Lönnrot | Elias Lönnrot Letters Online | ![]() | ![]() | ![]() | ![]() | ![]() | Kirsi Keravuori | www | ||
lude-uhlcs-korp | Lude (Ludian) Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
medievalturku | Corpus of landscapes in medieval documents from Turku, source | ![]() | ![]() | ![]() | ![]() | ![]() | B | Hanna-Mari Kupari | ||
mepu-src | Corpus of Spoken Meänkieli, source | ![]() | ![]() | ![]() | ![]() | ![]() | B | Niina Kunnas | ||
mlcca | MLCCA, Multilingual Corpus of Contracts and Agreements | ![]() | ![]() | ![]() | ![]() | ![]() | A | Mikhail Mikhailov | ||
movie-src | The Movie Corpus - Kielipankki version, source | ![]() | B | FIN-CLARIN | ||||||
mutable-src | Multimodal Translation with the Blind | ![]() | ![]() | ![]() | ![]() | B | Maija Hirvonen | https://projects.tuni.fi/mutable/the-mutable-corpus/ | ||
nenets-uhlcs-korp | Nenets Corpus (Tundra Nenets) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Nganasan | Nganasan Speech Corpus | ![]() | ![]() | ![]() | ![]() | Larisa Leisiö | ||||
nmk-korp | Changes in Place Names Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | ![]() | Elisa Stenvall | |||
nmk-lat | Changes in Place Names Corpus, Helsinki LAT Version | ![]() | ![]() | ![]() | ![]() | ![]() | Elisa Stenvall | |||
NorDiga | The Nordica Digital Archive | ![]() | ![]() | ![]() | ![]() | Jan Lindström | www | |||
north-saami-literature-uhlcs-korp | North Saami Corpus (Literature) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
north-saami-report-uhlcs-korp | North Saami Corpus (Sámikultuvradoaibmagotti smiehttamush) (UHLCS), Helsinki Korp Version Corpus | ![]() | ![]() | ![]() | ![]() | |||||
now-2021-05-src | News on the Web - Kielipankki version 2021-05, source | ![]() | B | FIN-CLARIN | ||||||
nzadi | Nzadi Corpus | ![]() | ![]() | ![]() | ![]() | Thera Marie Crane | ||||
ona | The Audio Recordings Archive of Oulu (ONA) | ![]() | ![]() | ![]() | ![]() | ![]() | Niina Kunnas | |||
Opus ECB | Opus ECB Corpus | ![]() | ![]() | ![]() | ![]() | Jörg Tiedemann | ||||
Opus EU | Opus EU Corpus | ![]() | ![]() | ![]() | ![]() | Jörg Tiedemann | ||||
Opus Localization | Opus Localization Corpus | ![]() | ![]() | ![]() | Jörg Tiedemann | |||||
Opus Subtitles | Opus Subtitles Corpus | ![]() | ![]() | ![]() | ![]() | Jörg Tiedemann | ||||
oulu-korp | Oulu Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
parole-fi-korp | The Finnish Parole Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
PERSO | PERSO Databases for Finnish Speech Synthesis | ![]() | ![]() | ![]() | ![]() | Martti Vainio, Heini Kallio | ||||
ProoF | ProoF - Pronunciation of Finnish by Immigrants in Finland | ![]() | ![]() | ![]() | ![]() | Mietta Lennes | ||||
Prosodiakorpus | Corpus of Prosodic Variation of Finnish | ![]() | ![]() | ![]() | ![]() | Tommi Kurki, Tommi Nieminen | ||||
puhelahjat-annotated | Donate Speech: Annotated dataset (for commercial use) | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-annotated | Donate Speech: Annotated dataset | ![]() | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-dev | Donate Speech: Selected dataset, Development data (10h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-dev | Donate Speech, Selected dataset: Development data (10h) (commercial use) | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-korp | Donate Speech Corpus, Korp | ![]() | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | ||
puhelahjat-selected | Donate Speech: Selected dataset (for commercial use) | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-selected | Donate Speech: Selected dataset | ![]() | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-test | Donate Speech: Selected dataset, Test data (10h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-test | Donate Speech, Selected dataset: Test data (10h) (commercial use) | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-test-mtr | Donate Speech: Selected dataset, Multi-transcriber test data (1h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-test-mtr | Donate Speech, Selected dataset: Multi-transcriber test data (1h) (commercial use) | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-test-mtrs | Donate Speech: Selected dataset, Test data from multi-transcriber speakers (10h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-test-mtrs | Donate Speech, Selected dataset: Test data from multi-transcriber speakers (10h) (commercial use) | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-train | Donate Speech: Selected dataset, Training data (100h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-train | Donate Speech, Selected dataset: Training data (100h) (commercial use) | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
quantlang-uhlcs-korp | Quantifiers and Quantification in Finnish and Languages Spoken in the Central Volga–Kama Region (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Saamen kielen korpus | Giellagas Corpus of Spoken Saami Languages | ![]() | ![]() | ![]() | ![]() | Marko Jouste | ||||
sapu | The Corpus of Sociolinguistic Variation in the Province of Satakunta | ![]() | ![]() | ![]() | ![]() | ![]() | Tommi Kurki | |||
sfnet-korp | SFNET Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
SignWiki | The SignWiki Project of the Sign Languages in Finland | ![]() | ![]() | ![]() | ![]() | Leena Savolainen | www | |||
skk-vrt | Classics of Finnish Literature, VRT | ![]() | ![]() | VRT | Petri Lauerma | |||||
soap-src | Corpus of American Soap Operas - Kielipankki version, source | ![]() | B | FIN-CLARIN | ||||||
stat-fi-en | Statistics Finland Translation Memory Finnish-English | ![]() | ![]() | ![]() | ||||||
stat-fi-sv | Statistics Finland's Finnish to Swedish Translation Memory | ![]() | ![]() | ![]() | ||||||
stt-fi-1992-2018-korp | Finnish News Agency Archive 1992-2018, Kielipankki Korp Version | ![]() | ![]() | ![]() | ![]() | Olli Viitala | ||||
sus-fieldwork | The Finno-Ugrian Society Fieldwork Corpus | ![]() | ![]() | ![]() | ![]() | ![]() | Jack Rueter | |||
Suvi | Suvi Finnish Sign Language Online Dictionary | ![]() | ![]() | ![]() | ![]() | Leena Savolainen | www | |||
TAITO | Written and Oral Data of the TAITO-project | ![]() | ![]() | ![]() | ![]() | Marjo Vesalainen | www | |||
tampuhe | Longitudinal data of Tampere spoken language | ![]() | ![]() | ![]() | ![]() | ![]() | Liisa Mustanoja | |||
tboneslim-src | T-Bone Slim Corpus, source | ![]() | ![]() | ![]() | ![]() | ![]() | A | Kirsti Salmi-Niklander | https://blogs.helsinki.fi/tboneslim | |
testipiste | Testipiste Corpus | ![]() | ![]() | ![]() | Janne Laitinen | |||||
Turjansaame | The Corpus of Spoken and Written Ter Saami | ![]() | ![]() | ![]() | ![]() | ![]() | Michael Riessler | |||
tv-src | The TV Corpus - Kielipankki version, source | ![]() | B | FIN-CLARIN | ||||||
tver-1980 | The Corpus of Tver Karelian 1957-1971 | ![]() | ![]() | ![]() | ![]() | B | Marjatta Palander | |||
tver-2020 | The Corpus of Tver Karelian 2016-2019 | ![]() | ![]() | ![]() | ![]() | B | Marjatta Palander | |||
ume-saami-uhlcs-korp | Ume Saami Corpus (UHLCS), Helsinki Korp Version Corpus | ![]() | ![]() | ![]() | ![]() | |||||
uralic-uhlcs-korp | Uralic, Turkic, Indo-Iranian and Mongol languages; languages of Siberia and Caucasia (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
uzbek-uhlcs-korp | Uzbek-English Dictionary (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
VVKS | Virtual Old Literary Finnish (VVKS) - Kielipankki Korp version | ![]() | ![]() | ![]() | ![]() | ![]() | Mari Siiroinen | |||
wikipedia-fi-2017-korp | Finnish Wikipedia 2017, Korp | ![]() | ![]() | ![]() | ![]() | ![]() | Tatu Huovilainen | |||
wordlists-uhlcs-korp | Lists of Words Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Yle-subtitle | The Finnish Broadcasting Company Corpus of Subtitles | ![]() | ![]() | ![]() | ![]() | Jukka Mäkisalo | ||||
ylenews-fi-2019-2021-selko-korp | Yle News Archive Easy-to-read Finnish 2019-2021, Korp | ![]() | ![]() | A | ||||||
ylenews-fi-2019-2021-selko-s-korp | Yle News Archive Easy-to-read Finnish 2019-2021, scrambled, Korp | ![]() | ![]() | A |