News


Researcher of the Month: Olli Kuparinen

12.7.2021

Olli Kuparinen tells us about his research on language variation and change where he has used The Longitudinal Corpus of Finnish Spoken in Helsinki (1970s, 1990s and 2010s), the Samples of Spoken Finnish and The Finnish Dialect Syntax Archive.


Now available: Route to A wing Corpus, Downloadable Version

1.7.2021

The Route to A wing Corpus is now available for download The good old ”Reittidemo” corpus is now downloadable. The material was originally created for demonstration and testing purposes, and the content can be used freely under the open CC0 (public domain) license. Route to A wing Corpus, Downloadable Version: see the metadata, open the […]


Helsinki Corpus of English Texts, VRT in download

1.7.2021

Helsinki Corpus of English Texts, VRT  available for download The Helsinki Corpus of English Texts in VRT format is available in the download service at Kielipankki www.kielipankki.fi/download. Helsinki Corpus of English Texts, VRT: corpus description, corpus in download More information can be found from the resource group page


Helsinki Corpus of Scottish Correspondence (1540-1750), VRT in download

1.7.2021

Helsinki Corpus of Scottish Correspondence (1540-1750), VRT in download The Helsinki Corpus of Scottish Correspondence (1540-1750) in VRT format is available in the download service at Kielipankki www.kielipankki.fi/download. Helsinki Corpus of Scottish Correspondence (1540-1750), VRT: corpus description, corpus in download More information can be found from the resource group page.


Yle News Archive Easy-to-read Finnish 2019-2020, source material published in download service

1.7.2021

Yle News Archive Easy-to-read Finnish 2019-2020, source material published in download service The corpus, containing articles from the YLE website https://yle.fi/uutiset/osasto/selkouutiset/ from 2019 and 2020, is available in the download service at Kielipankki www.kielipankki.fi/download/. Yle News Archive Easy-to-read Finnish 2019-2020, source: corpus description, corpus in download All available corpora of the Yle News Archive can […]


Now available for download: Yves Montand in the USSR interviews, source

29.6.2021

The corpus Yves Montand in the USSR interviews, source (MONTINT) is now available for download in Kielipankki – the Language Bank of Finland.


The Downloadable Version of Classics of English and American Literature as translated by Kersti Juva, English-Finnish parallel corpus, scrambled

22.6.2021

The Downloadable Version of Classics of English and American Literature as translated by Kersti Juva, English-Finnish parallel corpus, scrambled The Downloadable Version of Classics of English and American Literature as translated by Kersti Juva, English-Finnish parallel corpus, scrambled is available in the download service.


Researcher of the Month: Karita Suomalainen

15.6.2021

Karita Suomalainen tells us about her research on interactional linguistics where she has used the ArkiSyn Database of Finnish Conversational Discourse, The Finnish Dialect Syntax Archive and The Suomi24 Sentences Corpus 2001-2017.


ANEE lexical portals of Akkadian

11.6.2021

ANEE lexical portals of Akkadian Team 1 of the Centre of Excellence in Ancient Near Eastern Empires (ANEE) has created lexical portals that function as a graphic semantic dictionary. Via these portals the user can explore semantic networks for one (or multiple) words that one is interested in. By following the links, one can also […]


The language identifier HeLI-OTS 1.0 is now downloadable from Zenodo

9.6.2021

The language identifier HeLI-OTS 1.0 is now downloadable from Zenodo The general language identifier HeLI-OTS 1.0 is an automatic tool that is capable of identifying the language of each line of text in the input file. HeLI-OTS 1.0 selects the best match among 200 languages. The publication of HeLI-OTS 1.0 is one of the results […]


Yle Finnish News Archive 2019-2020, source material published in download service

27.5.2021

Yle Finnish News Archive 2019-2020, source material published in download service The corpus, containing the articles from YLE https://yle.fi from 2019 and 2020, is available in the download service at Kielipankki www.kielipankki.fi/download. Yle Finnish News Archive 2019-2020, source: corpus description, corpus in download All available corpora of the Yle News Archive can be found from […]


Yle Swedish News Archive 2019-2020, source material published in download service

27.5.2021

Yle Swedish News Archive 2019-2020, source material published in download service The corpus, containing the articles from Svenska YLE https://svenska.yle.fi from 2019 and 2020, is available in the download service at Kielipankki www.kielipankki.fi/download. Yle Swedish News Archive 2019-2020, source: corpus description, corpus in download All available corpora of the Yle News Archive can be found […]


The Language Bank of Finland, speech technology and Donate Speech campaign presented in Telia podcast

12.5.2021

Mietta Lennes from the Language Bank of Finland discusses speech technology and the Donate Speech campaign with Kia Tolppanen and Harri Moisio in a podcast by Telia Finland on 12.5.2021.


Researcher of the Month: Mila Oiva

10.5.2021

Mila Oiva tells us about her research in Cultural History, including the making of the resource ”Yves Montand in the USSR interviews”.


Corpus of Old Church Slavonic Texts in the download service

23.4.2021

Corpus of Old Church Slavonic Texts in the download service Corpus Cyrillo-Methodianum Helsingiense: Corpus of Old Church Slavonic Texts, source is available in the download service. The corpus is available as a zip package and as web pages.


Iijoki collection in the download service

16.4.2021

Iijoki collection in text and VRT format in the download service Iijoki, the University of Oulu Päätalo collection is available in the download service as source version in text format and as analyzed version in VRT format.


Researcher of the Month: Gwenaëlle Bauvois

12.4.2021

Gwenaëlle Bauvois tells us about her research based on various media data sources, including the Plenary Sessions of the Parliament of Finland, Downloadable Version 1 available via Kielipankki.


YLE News Archive

1.4.2021

Yle News Archive available for download in VRT format Additionally to the source material, the resources are now also available in VRT format from the download service as two variants, with the same sentences but with different availability and features: the variant available for academic users logged in to the download service has sentences in […]


FinEst BERT in the download service

30.3.2021

FinEst BERT in the download service FinEst BERT, a multilingual cased BERT base model trained on three languages (Finnish, Estonian and English) is available in the download service at Kielipankki korp.csc.fi/download. FinEst BERT: corpus description, corpus in download


Finnish News Agency Archive 1992-2018, CoNLL-U, source in download

23.3.2021

Finnish News Agency Archive 1992-2018, CoNLL-U, source in download The corpus is available in the download service at Kielipankki korp.csc.fi/download. This is the parsed version of the Finnish News Agency Archive 1992-2018 corpus. The corpus was parsed by Khalid Alnajjar (University of Helsinki) using Turku neural parser pipeline (http://turkunlp.org/Turku-neural-parser-pipeline/). Finnish News Agency Archive 1992-2018, CoNLL-U, […]


< Older news Newer news >