News from the Language Bank of Finland 3rd February 2014

FIN-CLARIN accepted to the updated Finnish national roadmap

FIN-CLARIN has now more than 3 billion Swedish words

  • You can now search for sentences from a corpus of more than 3 billion words in Swedish in the FIN-CLARIN Language Bank of Finland. This is a corpus based on Finnish newspapers and magazines published from 1770 onwards, scanned and automatically digitized by the Finnish National Library.

The software can be best used with Firefox. The service is hosted by CSC (, while UHEL is responsible for the content (

Signing in to Korp

  • The concordance tool Korp has been updated. You can sign in to the new version by using your Haka or CSC credentials. Corpora that can be accessed only after signing in will be added to Korp later on.

The Finnish n-grams 1820-2000 of the Newspaper and Periodical Corpus of the National Library of Finland available for download

The sentences of the corpus can be studied in Korp

Upcoming events

  • LREC Workshop CCURL 2014: Collaboration and Computing for Under-Resourced Languages in the Linked Open Data Era, Reykjavik, 26 May 2014:

FIN-CLARIN and the Language Bank of Finland wish you a succesful new year!

Imre Bartis
Project Coordinator / FIN-CLARIN