T-Bone Slim Corpus


Current versions of this resource:
T-Bone Slim Corpus, source
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
The corpus will be available soon
Look for other versions of this resource

Corpus contents

The T-Bone Slim corpus consists of columns, poems and manuscripts by the Finnish-American writer T-Bone Slim (Matti V. Huhta, 1882-1942), published in newspapers, as well as the pamphlet ”Starving Amidst Too Much” (1923). The total number of texts is around 1500-2000, with the number of words in individual texts varying between 500 and 2000. The database also includes publication details and other background information on the texts in tabular form.

T-Bone Slim published his texts in the labour movement’s newspapers of the IWW (Industrial Workers of the World). Original texts in English were published in the following magazines:
– Industrial Worker 1921–1942
– Solidarity 1921
– Truth 1921–1923
– Industrial Solidarity 1921–1932
– Industrial Pioneer 1925–1926; 1929
– One Big Union Monthly 1938 (1920?)

Finnish translations or texts originally written in Finnish were published in the following journals:
– Amerikan Sanomat 1903 (nimellä Mathew Houghton)
– Industrialisti 1921
– Tie Vapauteen 1923

The material comes from the following libraries and archives: the University of Michigan, Labadie Collection; Newberry Library, Minnesota Historical Society, Minnesota Digital Newspaper Hub; Walter Reuther Library, Wayne State University; Westmoreland family archives.

The collection is part of the Kone Foundation funded project ”T-Bone Slim and the transnational poetics of the migrant left in North America” (2022-2023).
Project homepage: https://blogs.helsinki.fi/tboneslim

Further details about the content and the terms and conditions regarding the different corpus versions are available in the corresponding metadata records.

Last updated: 16.01.2024

This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2024011204

The Newspaper and Periodical Corpus of the National Library of Finland, Kielipankki Version


This corpus contains newspapers and magazines from Finland starting from 1770, compiled by the National Library of Finland.

NB: The Finnish acronym for the corpora The Newspaper and Periodical OCR Corpus of the National Library of Finland used to be ”Digilib”. Currently, however, the acronym ”klk” and the short names klk-fi-1874-dl and klk-fi-1920-dl are recommended instead.

Latest versions/subcorpora:  
The Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
The resource will be available for download soon
The Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Example queries in Korp
Select the corpus in Korp
The Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Swedish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Newspaper and Periodical OCR Corpus of the National Library of Finland (1771-1874)
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Newspaper and Periodical OCR Corpus of the National Library of Finland (1875-1920)
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Newspaper and Periodical Corpus of the National Library of Finland, Swedish sub-corpus, 1771–1879, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Newspaper and Periodical Corpus of the National Library of Finland, Swedish sub-corpus, 1880–1948, scrambled, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Search for these versions in META-SHARE  

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.


Based on the KLK data, word-level collections of uni-, bi- and trigrams have been created and are available for download. These are their own data sets:

The N-grams of the Newspaper and Periodical Corpus of the National Library of Finland


Example queries from Korp


Concordance view of any form of the word 'sosialismi' in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp
Concordance view of any form of the word ’sosialismi’ in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp


Word picture of the word 'sosialismi' in klk-fi-v2-korp
Word picture of the word ’sosialismi’ in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp


Trend diagram of all forms of the word 'sosialismi' occurring in klk-fi-v2-korp
Trend diagram of all forms of the word ’sosialismi’ occurring in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp



This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021092404

Last updated: 19.12.2023


In English

Aineiston viimeisimmät versiot:  
Fenno-Ugrica, Kielipankki-versio
icon-info-circle Kuvailutiedot ja lisenssi

icon-quote-right Tämän version viittausohje

Avaa aineisto Korp-palvelussa icon-question-circle
Fenno-Ugrica, Kielipankin ladattava versio
icon-info-circle Kuvailutiedot ja lisenssi

icon-quote-right Tämän version viittausohje

Lataa aineisto
Etsi muut saatavilla olevat versiot  

Aineiston sisältö

Fenno-Ugrica on Kansalliskirjaston suomalais-ugrilaisten julkaisujen digitaalinen kokoelma, joka on saatavilla eri versioina myös Kielipankin kautta. Fenno-Ugrica-kokoelma sisältää monografioita inkeroisen, vepsän, marin (vuorimari ja niittymari) ja mordvan (ersä ja mokša) kielillä sekä 1920- ja 1930-lukujen sanomalehtiä marin ja mordvan kielillä. Kokoelma käsittää kaikkiaan yli 120 monografiaa ja lähes 20000 sanomalehtisivua.

Fenno-Ugrican aineiston on tuottanut Kansalliskirjasto suomen sukukielten digitointihankkeessa, joka oli osa Koneen Säätiön kieliohjelmaa.

Lisätietoa Kansalliskirjaston kokoelmasta: http://fennougrica.kansalliskirjasto.fi/

Aineistossa esiintyvät kielet ja niiden kolmikirjaimiset ISO 639-3 -koodit ovat seuraavat:

  • niittymari (itämari): mhr
  • ersä: myv
  • inkeroinen: izh
  • hanti: kca
  • mansi: mns
  • mokša: mdf
  • tundranenetsi: yrk
  • selkuppi: sel
  • vepsä: vep
  • vuorimari (länsimari): mrj

 Viimeksi päivitetty: 31.5.2023

Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2023053121

Elias Lönnrot Letters Online

The corpus consists of the correspondence of Elias Lönnrot with private individuals as well as institutions from 1823 until Lönnrot’s death. Elias Lönnrot was the creator of the Kalevala, medical doctor and professor of language (1802 – 1884). The letters and drafts of letters belong to the Archive of the Finnish Literature Society and have been transliterated for the project Elias Lönnrot’s Letters Online, http://lonnrot.finlit.fi/omeka/.


Latest versions/subcorpora:  
Elias Lönnrot Letters Online, source
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Finnish sub-corpus of Elias Lönnrot Letters Online – Kielipankki version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
resource will be available soon
The Swedish sub-corpus of Elias Lönnrot Letters Online – Kielipankki version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
resource will be available soon
Search for all versions of this resource in META-SHARE  

Of this language corpus different versions/subcorpora are (or will be) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2022051701

Erzya and Moksha Extended Corpora (ERME)

ERME contains predominantly original Erzya and Moksha literature. It consists of several media publications from the 19th to the 20th century. ERME was mapped in Saransk in 1997-2004, while in Helsinki it has been mapped since 2004. The most basic format used is XML, with a granularity extending to chapter level. The goal is to create corpora with a granularity extending to word level with bibliographic reference to the sentence level.

The new version contains the literature found in the older instance and has grown markedly. While the old version was merely text divided to sentence level, the new version has lemmatization and dependencies. At sentence level contextual translation may be present (English or Finnish translation), while at word level there is morphological encoding, corresponding to each context. Preliminary morpho-syntactic analysis is carried out using HFST-based transducers and Constraint Grammar disambiguation, function and dependency tagging, which have been developed in the Giellatekno infrastructure of the University of Tromsø.

The grammatical analysis and labeling comply with the practices developed in the Giellatekno infrastructure of the University of Tromsø. These practices are applied in the documentation of several Uralic languages.

The amount of the processed material is to be increased subsequently.

Latest versions/subcorpora:  
Erzya and Moksha Extended Corpora (ERME) version 2, Korp
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for all versions of this resource in META-SHARE  

Of this language corpus different versions/subcorpora are (or will be) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2022052001

Italian Letters from the Sixteenth Century

This corpus offers a collection of italian letters from the sixteenth century, from different sources.

1) Giovanni FERRO, Lettere varie di complimenti amorose, e giocose [&], In Venetia, Ad istanza di Stefano Curti, 1679 (Universitäts- und Forschungsbibliothek Erfurt/Gotha – Phil 8° 01183/05)

2) Ferrante PALLAVICINO Luca ASSARINO, Lettere amorose, s.n.t. [XVII sec.] (Vicenza, Biblioteca Bertoliana, B.4.3.26) (c) Vicenza, Biblioteca Bertoliana

3) Concetti amorosi, cioè lettere giovenili, et amorose [&], In Modena, ad instantia di Mafeo Tagietti, detto il Verginio, et Ieronimo da Vinetia Compagni, 1553 (Milano, Biblioteca Trivulziana, Triv M 235/1) (c) Historical Archive and Library Trivulziana. City of Milan.

The resource has been made available for download in jpeg format.

Latest versions/subcorpora:  
Lettere varie di complimenti amorose, e giocose
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Concetti amorosi, cioè lettere giovenili, et amorose
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Lettere amorose
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Search for all versions in META-SHARE  

Of this language corpus different versions are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2016031701

Corpus of Historical American English

The Corpus of Historical American English (COHA) contains about 385 million words and 115 000 texts from the years 1810-2009. Each decade has roughly the same balance of fiction, popular magazine, newspaper, and non-fiction books.

For general terms and conditions for this and other corpora from BYU please see https://www.corpusdata.org/restrictions.asp

More information on the BYU corpora at Kielipankki

Latest versions/subcorpora:
Corpus of Historical American English – Kielipankki Korp version 2017H1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Corpus of Historical American English – Kielipankki download version 2017H1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Search for all versions in META-SHARE

Of this language corpus different versions/subcorpora are (or might be in the future) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2017061924

The Morpho-Syntactic Database of Mikael Agricola’s Works

The Morpho-Syntactic Database of Mikael Agricola’s Works contains the Finnish parts of Mikael Agricola’s works (Abckiria, Rukouskiria, Se Wsi testamenti, Käsikiria, Messu, Piina, Psaltari, Veisut, Profeetat). The database was created from 2004 to 2008, when the texts offered by the Institute for the Languages of Finland were coded and annotated by the Finnish Language Department of the University of Turku in the project ’The Scientific Edition and the Morpho-Syntactic Database of Mikael Agricola’s Works’ by broadening the model used in the Finnish Dialect Syntax Archive. The project was funded by the Academy of Finland and the Alfred Kordelin Foundation. The words of the corpus have been annotated by keyword, part of speech, morphological components and syntactical function. All the grammatical units have been coded according to their places in the works and in the books of the Bible.

Latest versions/subcorpora:
The Morpho-Syntactic Database of Mikael Agricola’s Works version 1.1, Korp
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for all versions in META-SHARE

Of this language corpus different versions/subcorpora are, or will be, published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051204

The Finnish Gutenberg Corpus

The corpus contains Finnish books, for which the copyright already has expired, available on the web page of the Gutenberg project (https://www.gutenberg.org/) in 2014. The texts have not been linguistically annotated.

A list of the works the Finnish Gutenberg Corpus contains can be found here: Gutenberg.pdf

Latest versions/subcorpora:  
The Finnish Gutenberg Corpus
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for all versions in META-SHARE  

Of this language corpus different versions/subcorpora are (or might be in the future) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051203

Finnish Folk Poetry

A 34-volume collection of Finnic oral poetry, lyric, short rhymes, incantations etc., collected and recorded from the 16th century to the 1930s and published mostly between 1908 and 1948, with a supplement volume published in 1997. The corpus is multilingual, with texts in Finnish, Karelian, Olonets, Ludian, Votic, Izhorian, Latin and Swedish.

More information on the corpus: https://skvr.fi/skvr-teos

Latest versions/subcorpora:
Finnish Folk Poetry
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for these versions in META-SHARE

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051201



Current versions of this resource:
Fenno-Ugrica, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp icon-question-circle
Fenno-Ugrica, Kielipankki Downloadable Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Look for other versions of this resource

Corpus contents

Fenno-Ugrica is the digital collection of Finno-Ugric publications at the National Library of Finland. Various versions of Fenno-Ugrica are also available via Kielipankki – the Language Bank of Finland, as described on this page.

The Fenno-Ugrica collection includes monograph publications in Ingrian, Veps, Mari (Hill Mari and Meadow Mari) and Mordvinic (Erzya and Moksha) languages and newspapers in Mari and Mordvinic languages from the 1920s and the 1930s. All in all, the collection consists of more than 120 monographs and nearly 20 000 pages of newspapers.

The material of Fenno-Ugrica was produced by the National Library of Finland in the Digitisation Project of Kindred Languages as part of Language Programme of Kone Foundation.

More information: http://fennougrica.kansalliskirjasto.fi/

The languages in the corpus and their three-letter ISO 639-3 codes are the following:

  • Eastern Mari: mhr
  • Erzya: myv
  • Ingrian: izh
  • Khanty: kca
  • Mansi: mns
  • Moksha: mdf
  • Nenets: yrk
  • Selkup: sel
  • Veps: vep
  • Western Mari: mrj

Last updated: 31.5.2023

This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050706

Corpus of Old Literary Finnish

Written Finnish texts from the years between 1543 and 1810, browsable and searchable on the web. The collection contains bible translations and religious texts (e.g. all of Mikael Agricola’s Finnish works), legal texts, poems, and texts concerning agriculture, nature, health etc., among others. It was compiled for lexicographic use.

More information on the corpus: http://kaino.kotus.fi/korpus/vks/meta/vks_coll_rdf.xml

Latest versions/subcorpora:
Corpus of Old Literary Finnish
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Virtual Old Literary Finnish (VVKS) – Kielipankki Korp version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Resource will be available soon
Search for all versions in META-SHARE

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050705

Corpus of Early Modern Finnish

Written Finnish from the 19th century (mostly from the years between 1810 and 1880), browsable and searchable on the web. The collection contains published literature, periodicals, newspapers, and dictionaries, among others, with a focus on the earliest and most important publications and a wide thematic coverage. Texts written originally in Finnish were preferred to translations.

More information on the corpus: http://kaino.kotus.fi/korpus/1800/meta/1800_coll_rdf.xml

Latest versions/subcorpora:
Corpus of Early Modern Finnish, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for these versions in META-SHARE

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050704

Classics of Finnish Literature, Kielipankki Version

This corpus contains works of established Finnish authors published from 1880s to 1940s. It includes prose fiction, plays, poetry and aphorisms, some written originally in Swedish.

More information on the corpus: https://kaino.kotus.fi/korpus/klassikot/meta/klassikot_coll_rdf.xml

Latest versions/subcorpora:  
Classics of Finnish Literature, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Classics of Finnish Literature, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Resource will be available soon
Search for these versions in META-SHARE  

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050703

Helsinki Corpus of English Texts

The Helsinki Corpus of English Texts is a structured multi-genre diachronic corpus, which includes periodically organized text samples from Old, Middle and Early Modern English. Each sample is preceded by a list of parameter codes giving information on the text and its author. The corpus is useful particularly in the study of the change of linguistic features in long diachrony. It can be used as a diagnostic corpus giving general information of the occurrence of forms, structures and lexemes in different periods of English. This information can be supplemented by evidence yielded by more special and focused historical corpora.

More information on the corpus: https://varieng.helsinki.fi/CoRD/corpora/HelsinkiCorpus/

Latest versions/subcorpora:  
Helsinki Corpus TEI-XML Edition (2011), Korp
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Helsinki Corpus of English Texts, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Helsinki Corpus of English Texts, Early Modern English section, source
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Search for all versions in META-SHARE  

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050302

Aleksis Kivi Corpus (SKS)

This corpus contains all the known letters, manuscripts and published works by the Finnish author Aleksis Kivi (1834–1872), collected by the Finnish Literature Society (Suomalaisen Kirjallisuuden Seura). Most of the texts were written in Finnish while some of the letters and manuscripts are in Swedish.

More information: https://www.finlit.fi/tutkimus/suomalaisen-kirjallisuuden-kriittiset-editiot-edith/aleksis-kivi-korpus/

Latest versions/subcorpora:  
Aleksis Kivi Corpus (SKS)
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for these versions in META-SHARE  

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050301

Classics Library of the National Library of Finland – Kielipankki version

This corpus comprises works written in Finnish and Swedish, which are part of the Classics Library of the National Library of Finland and have been published under the license Public Domain.
The data set in Finnish includes 686 works and the data set in Swedish includes 282 works out of the whole data set of 968 works in Finnish and Swedish, gathered from Doria and processed by Niklas Alén in April 2017.

The data set in Doria is an accumulating resource and it comprises works of established Finnish authors published from 1549 onwards. The time coverage for the Kielipankki version is 1549-1944 with the exception of Maria Jotuni’s ’Huojuva talo’ published in 1963 in the Finnish sub-corpus.
The corpus includes classical literature, e.g. prose, plays and poetry.

A list of all works in Finnish in the Kielipankki version sorted by the author
A list of all works in Swedish in the Kielipankki version sorted by the author

Latest versions/subcorpora:  
The Finnish sub-corpus of the Classics Library of the National Library of Finland – Kielipankki version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Finnish sub-corpus of the Classics Library of the National Library of Finland – Kielipankki version, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
A copy of this version is available in the computing environment. icon-question-circle
The Swedish sub-corpus of the Classics Library of the National Library of Finland – Kielipankki version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Swedish sub-corpus of the Classics Library of the National Library of Finland – Kielipankki version, VRT
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
A copy of this version is available in the computing environment. icon-question-circle
Search for all versions of this resource in META-SHARE  

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021092407

The Finland-Swedish Text Corpus

The corpus contains modern written Swedish texts published in Finland (1990s). The kernel corpus is accompanied by a minor section on spoken language. The Finland Swedish Text Corpus is a part of the UHLCS corpus collection.

More information on the corpus

Latest versions/subcorpora:  
The Helsinki Korp Version of the Finland-Swedish Text Corpus (UHLCS)
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for all versions in META-SHARE  

Of this language corpus different versions/subcorpora are (or might be in the future) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2016050212


Corpus Cyrillo-Methodianum Helsingiense: Corpus of Old Church Slavonic Texts

Viimeisimmät versiot: 
Corpus Cyrillo-Methodianum Helsingiense: Corpus of Old Church Slavonic Texts, source
icon-info-circle Metadata and license
icon-quote-right How to cite this version
Browse the content as a web site
Search for other versions of this resource 

Content of this resource

The Corpus Cyrillo-Methodianum Helsingiense (CCMH) is a corpus of Old Church Slavonic (OCS) texts. It was collected at the University of Helsinki from 1986 to 2017 in various research projects.

Until 2020, the resource was publicly available as a website maintained by the University of Helsinki, and it is now located in Kielipankki – The Language Bank of Finland.

Further details on the content and license are provided in the metadata record of each corpus version.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021041522

The Letters of Paul Sinebrychoff, Kielipankki Version

Paul and Fanny Sinebrychoff created an art collection unique for Finland. This corpus contains Paul Sinebrychoff’s and various experts’ and collectors’ correspondence from 1895 to 1909 related to the acquisitions of the collection.

The letters were translated into Finnish and the original hand-written letters in Swedish were transcribed with support from Sinebrychoff Oy Ab. The translations were done by students from the Department of Scandinavian Languages and Literature (Nordica) of the University of Helsinki.

More information: http://kirjearkisto.siff.fi/default.aspx

Latest versions/subcorpora:
The Finnish Sub-corpus of the Letters of Paul Sinebrychoff, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Swedish Sub-corpus of the Letters of Paul Sinebrychoff, Kielipankki Version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for these versions in META-SHARE

Of this language corpus different versions/subcorpora are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.

Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021092408

Search the Language Bank Portal:
Lotta Leiwo
Researcher of the Month: Lotta Leiwo


Upcoming events


The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information