Oulu Corpus


Latest versions/subcorpora:
Oulu Corpus
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Apply for access

This version is available via the computing environment Puhti

Search for all versions in META-SHARE


The Oulu Corpus is a research corpus of Standard Finnish in the 1960’s. The original material was collected by a group led by prof. Pauli Saukkonen at the University of Oulu. The original corpus project aimed to collect a representative sample of Standard Finnish language in the 1960’s media in order to create a frequency dictionary of Finnish. The annotated text material was converted into SGML format by the Institute for the Languages of Finland in 1997.

The resource is available via the computing environment. Access rights can be granted for research use by individual application.

Last updated: 10.5.2023



This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2023040502

Search the Language Bank Portal:
Lotta Leiwo
Researcher of the Month: Lotta Leiwo


Upcoming events


The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information