Frequency List of Written Finnish Word Forms

This resource is offered by Kotus, Kotimaisten kielten keskus, the Institute for the Languages of Finland.

The resource contains a ranked frequency list of Finnish word forms as they appear in the Finnish Parole text corpus of 17 million written tokens. The list is available for download in three different sizes: all tokens, tokens that occur more than once, and tokens that occur more than twice, all in ISO-8859-1 (Latin-1) one entry per line. The five thousand most frequent forms are also available for browsing on the web site.

Latest versions/subcorpora:  
Frequency List of Written Finnish Word Forms
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Open the website
Search for all versions in META-SHARE  

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021092005

Search the Language Bank Portal:
Katri Hiovain-Asikainen
Researcher of the Month: Katri Hiovain-Asikainen

 

Upcoming events


Contact

The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information