The Magazine Corpus of the Institute for the Languages of Finland

The corpus contains different volumes of four magazines: Suomen Kuvalehti, Historiallinen aikakauskirja, Lakimies and Suomi.

Suomen Kuvalehti’s volumes: 1917, 1925, 1935, 1945, 1955, 1965, 1972 (approximately 5,4 million tokens).

Historiallinen Aikakauskirja’s volumes : 1917, 1920, 1925, 1935, 1945.

Lakimies’ volumes: 1917, 1920, 1925, 1935, 1945, 1955, 1965, 1972.

Suomi’s volumes: 1917, 1920, 1923, 1935, 1938.

The corpus is made up of two parts: one whose OCR (optical character recognition) has been checked and another one whose OCR hasn’t been checked. The former part’s size is 670 000 tokens and contains one 1935 issue from Historiallinen Aikakauskirja, Lakimies and Suomi, as well as 4 issues of Suomen Kuvalehti from each of the years mentioned above (1917, 1925, 1935, 1945, 1955, 1965 and 1972). These issues were chosen so that there would be an equal amount of texts from all year round.

Latest versions/subcorpora:
The Magazine Corpus of the Institute for the Languages of Finland, revised
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Magazine Corpus of the Institute for the Languages of Finland, unrevised
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Downloadable Version of the Magazine Corpus of the Institute for the Languages of Finland, revised
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
The resource will be available soon
The Downloadable Version of the Magazine Corpus of the Institute for the Languages of Finland, unrevised
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
The resource will be available soon
Search for these versions in META-SHARE

Of this language corpus different versions are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-201407301

Search the Language Bank Portal:
Harri Uusitalo
Researcher of the Month: Harri Uusitalo

 

Upcoming events


Contact

The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information