Akkala | The Corpus of Spoken and Written Akkala Saami | ![]() | ![]() | ![]() | ![]() | Michael Riessler | ||||
amph-korp | amph-Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | Antti Arppe | ||||
DIALUKI | DIALUKI - Diagnosing reading and writing in a second or foreign language | ![]() | ![]() | ![]() | ![]() | ![]() | Ari Huhta | |||
dma-v2 | Digital Morphology Archives, new version | ![]() | ![]() | VRT | ![]() | |||||
dma-wn-fn-src | The Word Notes of the Morphology Archives with field reports, source | ![]() | ![]() | |||||||
dma-wn-src | The Word Notes of the Digital Morphology Archives, source | ![]() | ![]() | ![]() | ![]() | |||||
DSPCON2013-2015-korp | Aalto University DSP Course Conversation Corpus 2013-2015, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | Mikko Kurimo, Seppo Enarvi | ||||
eduskunta-v2-dl | Plenary Sessions of the Parliament of Finland, Downloadable Version 2 | ![]() | ![]() | ![]() | ![]() | |||||
eduskunta-v2-korp | Plenary Sessions of the Parliament of Finland, Kielipankki Korp Version 2 | ![]() | ![]() | ![]() | ![]() | |||||
enets | Enets Corpus | ![]() | ![]() | ![]() | ![]() | Olesya Khanina | ||||
english-uhlcs-korp | English Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
erme-dl | ERME Erzya and Moksha Extended Corpora, full text/download version | ![]() | ![]() | ![]() | Jack Rueter | |||||
Ersä | Corpus of Colloquial Erzya | ![]() | ![]() | ![]() | ![]() | Riho Grünthal | ||||
erzya-moksha-komi-uhlcs-korp | Corpus of Erzya and Moksha Mordvin Literature and Journals and Komi Zyrian Literature (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
erzya-moksha-uhlcs-korp | Erzya and Moksha Mordvin Word List Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
estonian1-uhlcs-korp | Estonian Corpus 1 (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
estonian2-uhlcs-korp | Estonian Corpus 2 (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
fcaa | Finnish Conversation Analysis Archive | ![]() | ![]() | ![]() | Mari Siiroinen | https://metashare.csc.fi/repository/browse/finnish-conversation-analysis-archive/65669f5eb7e611eb9cdefa163ec5ae3e69c8f5f510064ad999f16144700b1156/ | ||||
fedidi | Citation Database of Fennistic Dialect Dissertations | ![]() | ![]() | ![]() | ![]() | |||||
finchat-src | Finnish conversational chat corpus, source | Mikko Kurimo | ||||||||
findarc | Finnish Dark Web Marketplace Corpus | ![]() | ![]() | ![]() | ![]() | ![]() | Teemu Ruokolainen | |||
finears | Finnish electroacoustic music interviews | ![]() | ![]() | ![]() | ![]() | Mikko Ojanen | https://blogs.helsinki.fi/finnish-electroacoustic-resources/ | |||
FinIntas | The FinINTAS Corpus of Spontaneous and Read-aloud Finnish Speech | ![]() | ![]() | ![]() | ![]() | Mietta Lennes | ||||
finlangus | Spoken language and linguistic tasks of Finnish-American immigrants and controls | ![]() | Nana Lehtinen | |||||||
finnish-bibles-uhlcs-korp | Finnish Corpus (Bibles) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
finnish-literature-uhlcs-korp | Finnish Corpus (Literature) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
FinnTreeBank1-korp | Finnish TreeBank 1, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | ![]() | ||||
ha-korp | Ha Language Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | ![]() | Lotta Aunio | |||
hanty-uhlcs-korp | Khanty Corpus (North Khanty, Corpora and Translations) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
helpuhe-2010txt | The Longitudinal Corpus of Finnish Spoken in Helsinki (2010 in text form) | ![]() | ![]() | ![]() | ![]() | Hanna Lappalainen | ||||
helpuhe-v2-korp | The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), Helsinki Korp Version 2 | ![]() | ![]() | ![]() | ![]() | ![]() | Hanna Lappalainen | |||
helpuhe-v2-lat | The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), Helsinki LAT Version 2 | ![]() | ![]() | ![]() | ![]() | ![]() | B | Hanna Lappalainen | ||
HS | The Helsingin Sanomat Archive Corpus | ![]() | ![]() | ![]() | ![]() | Jarkko Rahkonen | ||||
ingrian-uhlcs-korp | Ingrian Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Inkerin murteet | The Corpus of Ingrian Finnish | ![]() | ![]() | ![]() | ![]() | Marjatta Palander | www, muuta | |||
Kiltinänsaame | The Corpus of Written Kildin Saami | ![]() | ![]() | ![]() | ![]() | ![]() | Mikael Riessler | |||
Kiltinänsaame (UHLCS) | Kildin Saami Corpus (UHLCS) | ![]() | ![]() | ![]() | ![]() | Pirkko Suihkonen | ||||
komi-ikdp | Spoken Komi Corpus: IKDP | ![]() | ![]() | ![]() | ![]() | Niko Partanen | ||||
komi-uhlcs-korp | Komi Zyrian Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
kra-korp | Jyväskylä Corpus of Middle French, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
latin-uhlcs-korp | Latin Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
long-second | The Long Second Corpus: LONGitudinal Classroom Data about Children’s Development in Finnish as a SECOND Language | ![]() | ![]() | ![]() | ![]() | Maria Ahlholm | ||||
lonnrot-src | Elias Lönnrot Letters Online, source | ![]() | XML | ![]() | ||||||
lude-uhlcs-korp | Lude (Ludian) Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Lönnrot | Elias Lönnrot Letters Online | ![]() | ![]() | ![]() | ![]() | ![]() | Kirsi Keravuori | www | ||
nenets-uhlcs-korp | Nenets Corpus (Tundra Nenets) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Nganasan | Nganasan Speech Corpus | ![]() | ![]() | ![]() | ![]() | Larisa Leisiö | ||||
nmk-korp | Changes in Place Names Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | ![]() | Elisa Stenvall | |||
nmk-lat | Changes in Place Names Corpus, Helsinki LAT Version | ![]() | ![]() | ![]() | ![]() | ![]() | Elisa Stenvall | |||
NorDiga | The Nordica Digital Archive | ![]() | ![]() | ![]() | ![]() | Jan Lindström | www | |||
north-saami-literature-uhlcs-korp | North Saami Corpus (Literature) (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
north-saami-report-uhlcs-korp | North Saami Corpus (Sámikultuvradoaibmagotti smiehttamush) (UHLCS), Helsinki Korp Version Corpus | ![]() | ![]() | ![]() | ![]() | |||||
nzadi | Nzadi Corpus | ![]() | ![]() | ![]() | ![]() | Thera Marie Crane | ||||
ona | The Audio Recordings Archive of Oulu (ONA) | ![]() | ![]() | ![]() | ![]() | ![]() | Niina Kunnas | |||
Opus ECB | Opus ECB Corpus | ![]() | ![]() | ![]() | ![]() | Jörg Tiedemann | ||||
Opus EU | Opus EU Corpus | ![]() | ![]() | ![]() | ![]() | Jörg Tiedemann | ||||
Opus Localization | Opus Localization Corpus | ![]() | ![]() | ![]() | Jörg Tiedemann | |||||
Opus Subtitles | Opus Subtitles Corpus | ![]() | ![]() | ![]() | ![]() | Jörg Tiedemann | ||||
oracc-korp-2021-06 | Open Richly Annotated Cuneiform Corpus, Korp Version, June 2021 | ![]() | ![]() | |||||||
oulu-korp | Oulu Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
parole-fi-korp | The Finnish Parole Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
PERSO | PERSO Databases for Finnish Speech Synthesis | ![]() | ![]() | ![]() | ![]() | Martti Vainio, Heini Kallio | ||||
ProoF | ProoF - Pronunciation of Finnish by Immigrants in Finland | ![]() | ![]() | ![]() | ![]() | Mietta Lennes | ||||
Prosodiakorpus | Corpus of Prosodic Variation of Finnish | ![]() | ![]() | ![]() | ![]() | Tommi Kurki, Tommi Nieminen | ||||
puhelahjat | Donate Speech Corpus, version 1.0 (for research use) | ![]() | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-ann | Donate Speech: Annotated dataset (for commercial use) | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-complete | Donate Speech: Complete dataset (version 1, for commercial use) | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-dev | Donate Speech Corpus: Development data (10h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-sample | Donate Speech Corpus: Sample (yrityskäyttöön) | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-sel | Donate Speech: Selected dataset (for commercial use) | ![]() | ![]() | ![]() | ![]() | A | FIN-CLARIN | https://www.kielipankki.fi/lahjoita-puhetta/ | ||
puhelahjat-test | Donate Speech Corpus: Test data (10h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-test-mtr | Donate Speech Corpus: Multi-transcriber test data (1h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-test-mtrs | Donate Speech Corpus: Test data from multi-transcriber speakers (10h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
puhelahjat-train | Donate Speech Corpus: Training data (100h) | ![]() | ![]() | ![]() | ![]() | ![]() | A | Anssi Moisio | https://www.kielipankki.fi/lahjoita-puhetta/ | |
quantlang-uhlcs-korp | Quantifiers and Quantification in Finnish and Languages Spoken in the Central Volga–Kama Region (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Saamen kielen korpus | Giellagas Corpus of Spoken Saami Languages | ![]() | ![]() | ![]() | ![]() | Marko Jouste | ||||
sfnet-korp | SFNET Corpus, Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
SignWiki | The SignWiki Project of the Sign Languages in Finland | ![]() | ![]() | ![]() | ![]() | Leena Savolainen | www | |||
skk-vrt | Classics of Finnish Literature, VRT | ![]() | ![]() | VRT | Petri Lauerma | |||||
skn-vrt | Samples of Spoken Finnish, VRT Version | ![]() | ![]() | ![]() | ![]() | |||||
stat-fi-en | Statistics Finland Translation Memory Finnish-English | ![]() | ![]() | ![]() | ||||||
stat-fi-sv | Statistics Finland's Finnish to Swedish Translation Memory | ![]() | ![]() | ![]() | ||||||
stt-fi-1992-2018-korp | Finnish News Agency Archive 1992-2018, Kielipankki Korp Version | ![]() | ![]() | ![]() | ![]() | Olli Viitala | ||||
sus-fieldwork | The Finno-Ugrian Society Fieldwork Corpus | ![]() | ![]() | ![]() | ![]() | ![]() | Jack Rueter | |||
Suvi | Suvi Finnish Sign Language Online Dictionary | ![]() | ![]() | ![]() | ![]() | Leena Savolainen | www | |||
TAITO | Written and Oral Data of the TAITO-project | ![]() | ![]() | ![]() | ![]() | Marjo Vesalainen | www | |||
testipiste | Testipiste Corpus | ![]() | ![]() | ![]() | Janne Laitinen | |||||
Turjansaame | The Corpus of Spoken and Written Ter Saami | ![]() | ![]() | ![]() | ![]() | ![]() | Michael Riessler | |||
ume-saami-uhlcs-korp | Ume Saami Corpus (UHLCS), Helsinki Korp Version Corpus | ![]() | ![]() | ![]() | ![]() | |||||
uralic-uhlcs-korp | Uralic, Turkic, Indo-Iranian and Mongol languages; languages of Siberia and Caucasia (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
uzbek-uhlcs-korp | Uzbek-English Dictionary (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
VVKS | Virtual Old Literary Finnish (VVKS) - Kielipankki Korp version | ![]() | ![]() | ![]() | ![]() | ![]() | Mari Siiroinen | |||
wikipedia-fi-2017-korp | Finnish Wikipedia 2017, Korp | ![]() | ![]() | ![]() | ![]() | ![]() | Tatu Huovilainen | |||
wordlists-uhlcs-korp | Lists of Words Corpus (UHLCS), Helsinki Korp Version | ![]() | ![]() | ![]() | ![]() | |||||
Yle-subtitle | The Finnish Broadcasting Company Corpus of Subtitles | ![]() | ![]() | ![]() | ![]() | Jukka Mäkisalo |