News


How to cite a corpus that is available in the Language Bank of Finland

19.5.2020

Did you know that the Language Bank of Finland offers reference instructions for every corpus version? It is easy to include an accurate reference in publications!


Researcher of the Month: Juhana Salonen

7.5.2020

Researcher of the Month: Juhana Salonen Photo: Hanna-Kaisa Hämäläinen   Kielipankki – The Language Bank of Finland is a service for researchers using language resources. Juhana Salonen, project researcher at the University of Jyväskylä tells us about publishing the resource Corpus of Finnish Sign Language. Who are you? My name is Juhana Salonen and I […]


Tero Aalto: What a researcher should know about persistent identifiers (blog)

27.4.2020

Even the most useful information has no practical value if nobody can access it. Persistence is especially important for scientific knowledge. By using persistent identifiers, data, publications and people can still be found years from now and can receive the attention and appreciation they deserve.


Blog: Language technology paving the way for digital humanities (Tero Aalto, CSC)

7.4.2020

Computational methods are nothing new in digital humanities and social sciences. In fact, language research has a special tool that puts it ahead of the pack: language technology, a field that combines linguistics and computer science.


Researcher of the Month: Mikhail Mikhailov

6.4.2020

Researcher of the Month: Mikhail Mikhahilov Photo: University of Helsinki   Kielipankki – The Language Bank of Finland is a service for researchers using language resources. Mikhail Mikhail, professor of Translation Studies at the Tampere University tells us about how he makes use of the resources ParFin, Finnish-Russian Parallel Corpus of Literary Texts and ParRus […]


The Suomi24 Corpus 2001-2017, VRT version 1.1 published in Download service

18.3.2020

The Suomi24 Corpus 2001-2017, VRT version 1.1 published in Download service In January, we published a new version of the Suomi24 2017H2 in Korp service with the name ”The Suomi24 Sentences Corpus 2001-2017, Korp version 1.1” and now we published the respective new version in the Download service as ”The Suomi24 Corpus 2001-2017, VRT version […]


Beta-status removed from Wanca 2016, Korp Version and the source and VRT versions published in the Download service

13.3.2020

Beta-status removed from Wanca 2016, Korp Version and the source and VRT versions published in the Download service The Korp version of Wanca 2016  is a collection of web corpora in small Uralic languages. The collection is composed of 29 sentence corpora in different languages. The corpora have been collected from the Internet using the […]


Researcher of the Month: Markus Mattila

7.3.2020

Researcher of the Month: Markus Mattila Photo: Markus Mattila   Kielipankki – The Language Bank of Finland is a service for researchers using language resources. Markus Mattila, MA graduated from Åbo Akademi tells us about how he makes use of the resource The Suomi 24 Sentences Corpus (2017H2) (beta). Who are you? I am Markus […]


Suomi24 2001–2017: Corrected writer nickname information

21.2.2020

Suomi24 2001–2017: Corrected writer nickname information The Suomi24 2001–2017 corpus has been updated to Korp version 1.1, in which writer nickname information has been corrected. The following modifications have been made to the corpus: – All messages have the writer nickname information also in the years 2009–2012 and 2014, in which a large number of […]


New: Search Google Scholar for publications related to individual corpora in the Language Bank of Finland

17.2.2020

Kielipankin aineistojen viittausohjeisiin on nyt lisätty Google Scholar -hakulinkit, joilla voi kätevästi etsiä aiempia samaan aineistoon liittyviä julkaisuja. Viittausohjeiden noudattaminen kannattaa!


New corpus (beta): Classics of English and American Literature as translated by Kersti Juva, English–Finnish parallel corpus

11.2.2020

The Classics of English and American Literature as translated by Kersti Juva, English-Finnish parallel corpus, scrambled, Korp is now available in Korp as a beta test version: metadata, link to Korp. The corpus is offered for academic users logged in to Korp. The paragraphs of each text have been scrambled. The corpus contains the following […]


Researcher of the Month: Anita Nuopponen

7.2.2020

Researcher of the Month: Anita Nuopponen Photo: Harri Huusko   Kielipankki – The Language Bank of Finland is a service for researchers using language resources. Anita Nuopponen, professor in technical communication at the University of Vaasa tells us about how she makes use of the resource The Finnish Sub-corpus of the Newspaper and Periodical Corpus […]


FIN-CLARIN and the Language Bank of Finland presented at CA18209 ”NexusLinguarum” plenary meeting in Prague

28.1.2020

Finland is currently participating in the COST Action 18209 ”NexusLinguarum”. FIN-CLARIN and the Language Bank of Finland were presented by Mietta Lennes. You can see the poster here.


ParlaCLARIN II: LREC2020 workshop on creating, using and linking parliamentary corpora with other types of political discourse

17.1.2020

The 2020 ParlaCLARIN workshop will be held in Marseille (France), as part of the 12th edition of the Language Resources and Evaluation Conference (LREC2020). The Call for Papers is open until 14 February 2020.


Researcher of the Month: Emma Sepänaho

8.1.2020

Researcher of the Month: Emma Sepänaho Photo: Sofia Tikanmäki   Kielipankki – The Language Bank of Finland is a service for researchers using language resources. Emma Sepänaho, graduate student at the University of Helsinki tells us about how she makes use of the resources Corpus of Finnish Magazines and Newspapers from the 1990s and 2000s, […]


Vocabulary of Safety and Health (TSK-35) and The Terminological Vocabulary of Kela – Benefit-related Concepts, 4th Edition (TSK-49) in the download service

20.12.2019

The following vocabularies are now available for download: The Finnish Terminology Centre TSK The Vocabulary of Safety and Health at Work (TSK-35) More information, link to the resource and the lisence are available in the META-SHARE article at http://urn.fi/urn:nbn:fi:lb-2017090407 and The Terminological Vocabulary of Kela – Benefit-related Concepts, 4th Edition (TSK-49) More information, link to […]


Translation memories Hallituskausi 2007-2011 and Hallituskausi 2011-2015 in the download service

10.12.2019

Translation memories Hallituskausi 2007-2011 and Hallituskausi 2011-2015 are available in the download service Two translation memories (Finnish into English) published by the translation and language services of the Prime Minister’s Office (https://vnk.fi/en/translation-and-language-services) as open data are now available in the Kielipankki download service korp.csc.fi/download in the folder hallituskausi. Resource metadata: The ”Hallituskausi 2007–2011” Translation Memory. […]


Finnish OpenSubtitles 2017 and Finnish Wikipedia 2017 in the download service

10.12.2019

Finnish OpenSubtitles 2017 and Finnish Wikipedia 2017 (source materials) in the download service The corpora Finnish OpenSubtitles 2017, source and Finnish Wikipedia 2017, source are available in the download service at Kielipankki in korp.csc.fi/download. Finnish OpenSubtitles 2017 contains Finnish subtitles in movies and TV-series from Opensubtitles.org. Resource metadata: http://urn.fi/urn:nbn:fi:lb-2019110801 Finnish Wikipedia 2017 contains all Finnish […]


Researcher of the Month: Katri Leino

9.12.2019

Researcher of the Month: Katri Leino Photo: Katri Leino   Kielipankki – The Language Bank of Finland is a service for researchers using language resources. Katri Leino, doctoral candidate at the Aalto University tells us about how she makes use of the resources Yle News Archive Easy-to-read Finnish 2011-2018, source and The Suomi 24 Corpus […]


Yle Swedish News Archive 2012–2018

9.12.2019

The corpus Yle Swedish News Archive (Yle svenska webbartiklar) 2012–2018, Korp is now available in Korp for beta testing as two variants with the same sentences but with different availability and features: the variant available for academic users logged in to Korp (metadata, corpus in Korp) has sentences in the original order and supports paragraph […]


< Older news