Oulu Corpus


Latest versions/subcorpora:
Oulu Corpus
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Apply for access

This version is available via the computing environment Puhti

Search for all versions in META-SHARE


The Oulu Corpus is a research corpus of Standard Finnish in the 1960’s. The original material was collected by a group led by prof. Pauli Saukkonen at the University of Oulu. The original corpus project aimed to collect a representative sample of Standard Finnish language in the 1960’s media in order to create a frequency dictionary of Finnish. The annotated text material was converted into SGML format by the Institute for the Languages of Finland in 1997.

The resource is available via the computing environment. Access rights can be granted for research use by individual application.

Last updated: 10.5.2023



This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2023040502