Suomalaisen kirjallisuuden klassikoita (skk)

In English


Saatavilla olevat versiot

LyhenneNimi ja kuvailutiedotLisenssiSijaintiViiteAineistoryhmä ja ohjeHae käyttöoikeuttaJulkaisuvuosiTukitaso
LyhenneNimi ja kuvailutiedotLisenssiSijaintiViiteAineistoryhmä ja ohjeHae käyttöoikeuttaJulkaisuvuosiTukitaso

Tulossa olevat versiot

Nämä aineistoversiot eivät vielä ole saatavilla Kielipankin kautta.

LyhenneNimi ja kuvailutiedotLisenssiMuotoTukitasoYhteyshenkilöSijaintiAineistoryhmä ja ohjeMuu tieto
LyhenneNimi ja kuvailutiedotLisenssiMuotoTukitasoYhteyshenkilöSijaintiAineistoryhmä ja ohjeMuu tieto

Tietoa aineistosta

Suomalaisen kirjallisuuden klassikoita -korpuksessa on asemansa vakiinnuttaneiden suomalaisten kaunokirjailijoiden teoksia 1880-luvulta 1940-luvulle. Mukana on erityyppistä proosaa ja näytelmiä sekä lyriikkaa ja aforismeja. Alun perin suomeksi kirjoitettujen teosten lisäksi korpukseen sisältyy joitain etenkin ruotsista tehtyjä runokäännöksiä.

Kunkin aineistoversion tarkemmat tiedot päivitetään kuvailutietueeseen, joka löytyy pysyvällä tunnisteella (ks. linkki aineiston otsikon kohdalla).

Lue lisää tämän korpuksen alkuperäisestä versiosta Kotimaisten kielten keskuksessa: https://kaino.kotus.fi/korpus/klassikot/meta/klassikot_coll_rdf.xml

 

Lisenssi ja pääsy aineistoon

  • Kaikki tämän aineiston versiot ovat saatavilla julkisesti (PUB).
  • Lisenssikuvaketta napauttamalla näet tarkan aineistokohtaisen lisenssin.

 


Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2025071822

Varhaisnykysuomen korpus (vnsk)

In English


Saatavilla olevat versiot

LyhenneNimi ja kuvailutiedotLisenssiSijaintiViiteAineistoryhmä ja ohjeHae käyttöoikeuttaJulkaisuvuosiTukitaso
LyhenneNimi ja kuvailutiedotLisenssiSijaintiViiteAineistoryhmä ja ohjeHae käyttöoikeuttaJulkaisuvuosiTukitaso

Tulossa olevat versiot

Nämä aineistoversiot eivät vielä ole saatavilla Kielipankin kautta.

LyhenneNimi ja kuvailutiedotLisenssiMuotoTukitasoYhteyshenkilöSijaintiAineistoryhmä ja ohjeMuu tieto
LyhenneNimi ja kuvailutiedotLisenssiMuotoTukitasoYhteyshenkilöSijaintiAineistoryhmä ja ohjeMuu tieto

Tietoa aineistosta

Tämä aineisto sisältää kirjoitettua suomea 1800-luvulta (enimmäkseen vuosilta 1810-1880) haettavassa muodossa. Korpus sisältää mm. julkaistua kirjallisuutta, aikakauslehtiä, sanomalehtiä ja sanakirjoja. Aineiston koostamisessa on keskitytty vanhimpiin ja tärkeimpiin julkaisuihin, pyritty kattamaan laajasti eri aiheita sekä suosittu alun perin suomeksi kirjoitettuja tekstejä käännösten sijaan.

Kunkin aineistoversion tarkemmat tiedot päivitetään kuvailutietueeseen, joka löytyy pysyvällä tunnisteella (ks. linkki aineiston otsikon kohdalla).

Lue lisää tämän korpuksen alkuperäisestä versiosta Kotimaisten kielten keskuksessa: http://kaino.kotus.fi/korpus/1800/meta/1800_coll_rdf.xml

 

Lisenssi ja pääsy aineistoon

  • Kaikki tämän aineiston versiot ovat saatavilla julkisesti (PUB).
  • Lisenssikuvaketta napauttamalla näet tarkan aineistokohtaisen lisenssin.

 


Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2025071821

ORACC – Open Richly Annotated Cuneiform Corpus

Suomeksi

Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

Open Richly Annotated Cuneiform Corpus (Oracc) brings together the work of several Assyriological projects to publish online editions of cuneiform texts. The Korp version of Oracc allows extensive searches on the texts and presents the results as a KWIC concordance list. Korp also offers statistical information and comparison of the search results. Downloading the query results is possible as well.

Lists of texts

The second column in the list indicates if the text has been lemmatized in Oracc.

License and access

  • All versions of this resource are available publicly (PUB). Click on the license image to see the resource-specific license text.

Additional documentation

For how to use Oracc in Korp, please see the Oracc in Korp user guide.

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2019111601

 

The Newspaper and Periodical Corpus of the National Library of Finland, Kielipankki Version

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This corpus contains newspapers and magazines from Finland starting from 1770, compiled by the National Library of Finland. Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).

Contents

N-grams (separate resource)

Important notes

  • Previously, the Finnish acronym for the corpora The Newspaper and Periodical OCR Corpus of the National Library of Finland used to be ”Digilib”. Currently, the acronym ”klk” and the short names klk-fi-1874-dl and klk-fi-1920-dl are recommended instead.

License and access

  • Some versions of this resource are available publicly (PUB), whereas others may require you to log in as an academic user (ACA) or to apply for individual access rights (RES). Click on the license image to see the resource-specific license text.

Examples of use (Korp versions)

 

Concordance view of any form of the word 'sosialismi' in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp
Concordance view of any form of the word ’sosialismi’ in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp

 

Word picture of the word 'sosialismi' in klk-fi-v2-korp
Word picture of the word ’sosialismi’ in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp

 

Trend diagram of all forms of the word 'sosialismi' occurring in klk-fi-v2-korp
Trend diagram of all forms of the word ’sosialismi’ occurring in the Finnish Sub-corpus of the Newspaper and Periodical Corpus of the National Library of Finland version 2, Korp

OCR quality

The corpora consist mainly of digitized versions of texts originally printed on paper. These physical papers have been scanned, and optical character recognition (OCR) was performed on the resulting images. The digitized material spans a long period and contains different kinds of texts, writing styles and fonts. Scanning some parts of the material is more complex than scanning other parts, and the physical condition of the original texts also varies. The OCR techniques used have also varied, and there is the possibility that some of the texts have gone through manual post-correction. This results in some parts of the corpora being of terrible quality while others are of good quality. We have collected a list of publications related to OCR quality and collection processing:

 


This page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021092404

 

T-Bone Slim Corpus (tboneslim)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

The T-Bone Slim corpus consists of columns, song lyrics, poems and manuscripts by the Finnish-American writer T-Bone Slim (Matti V. Huhta, 1882-1942), published in newspapers and other leftist publications. Most of the material is included in the openly available version  (T-Bone Slim Corpus, source), but part of the manuscripts and photographs will be made available under a restricted license (T-Bone Slim Corpus, Westmoreland materials).

T-Bone Slim published his texts in the labour movement’s newspapers of the IWW (Industrial Workers of the World). Original texts in English were published in the following magazines:

  • General Construction Workers Bulletin 1922; 1923
  • Industrial Solidarity 1921–1931
  • Industrial Pioneer 1921; 1923; 1925
  • Industrial Worker 1921–1942
  • Junior Recruit 1934
  • Little Red Songbook 1921/1922
  • Lumber Workers Bulletin Port Arthur 1935
  • Lumber Workers Industrial Union 1923
  • One Big Union Monthly 1938 (1920?)
  • Truth 1921–1923

In addition, individual texts and advertisements were published in the following publications:

  • Aberdeen American 1919 (under the name Matt Arnold)
  • Erie Times News 1904; 1925; 1926 (under the name Mathew Huhta)
  • Evening World-Herald Omaha 1932
  • New Yorker Volkszeitung 1921
  • Producers News 1931

Finnish translations or texts originally written in Finnish were published in the following journals:

  • Amerikan Sanomat 1903 (under the name Mathew Houghton)
  • Industrialisti 1922–1923; 1926; 1930; 1941–1942
  • Tie Vapauteen 1923

The material comes from the following libraries and archives: Columbia University, Rare Book & Manuscript Library; Erie County Public Library; Genealogy Bank, Newspaper Archives; Janet Guinnane’s family photo collection; Library of Congress, Chronicling America; National Library of Finland; Lakehead University Archives; Minnesota Historical Society, Minnesota Digital Newspaper Hub; Newberry Library; State Library of New South Wales; University of Michigan, Labadie Collection; Walter Reuther Library, Wayne State University; Westmoreland family archives.

The collection is part of the Kone Foundation funded project ”T-Bone Slim and the transnational poetics of the migrant left in North America” (2022-2023).

Project homepage: https://blogs.helsinki.fi/tboneslim

License and access

  • Some versions of this resource are available publicly (PUB), whereas others require you to log in as an academic user (ACA) or to apply for individual access rights (RES).
  • Click on the license image to see the resource-specific license text.
  • Some versions of this resource contain personal data (license condition +PRIV). The license then includes additional data protection terms and conditions that you must follow. If processing personal data, maintain a public Privacy Notice regarding your project and provide the link to the Language Bank of Finland, see instructions.)

 

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2024011204

Fenno-Ugrica (fenno-ugrica)

In English


Saatavilla olevat versiot

LyhenneNimi ja kuvailutiedotLisenssiSijaintiViiteAineistoryhmä ja ohjeHae käyttöoikeuttaJulkaisuvuosiTukitaso
LyhenneNimi ja kuvailutiedotLisenssiSijaintiViiteAineistoryhmä ja ohjeHae käyttöoikeuttaJulkaisuvuosiTukitaso

Tulossa olevat versiot

Nämä aineistoversiot eivät vielä ole saatavilla Kielipankin kautta.

LyhenneNimi ja kuvailutiedotLisenssiMuotoTukitasoYhteyshenkilöSijaintiAineistoryhmä ja ohjeMuu tieto
LyhenneNimi ja kuvailutiedotLisenssiMuotoTukitasoYhteyshenkilöSijaintiAineistoryhmä ja ohjeMuu tieto

Tietoa aineistosta

Fenno-Ugrica on Kansalliskirjaston suomalais-ugrilaisten julkaisujen digitaalinen kokoelma, joka on saatavilla eri versioina myös Kielipankin kautta. Fenno-Ugrica-kokoelma sisältää monografioita inkeroisen, vepsän, marin (vuorimari ja niittymari) ja mordvan (ersä ja mokša) kielillä sekä 1920- ja 1930-lukujen sanomalehtiä marin ja mordvan kielillä. Kokoelma käsittää kaikkiaan yli 120 monografiaa ja lähes 20000 sanomalehtisivua.

Fenno-Ugrican aineiston on tuottanut Kansalliskirjasto suomen sukukielten digitointihankkeessa, joka oli osa Koneen Säätiön kieliohjelmaa.

Lisätietoa Kansalliskirjaston kokoelmasta: http://fennougrica.kansalliskirjasto.fi/

Aineistossa esiintyvät kielet ja niiden kolmikirjaimiset ISO 639-3 -koodit ovat seuraavat:

  • niittymari (itämari): mhr
  • ersä: myv
  • inkeroinen: izh
  • hanti: kca
  • mansi: mns
  • mokša: mdf
  • tundranenetsi: yrk
  • selkuppi: sel
  • vepsä: vep
  • vuorimari (länsimari): mrj

Lisenssi ja pääsy aineistoon

  • Jotkin tämän aineiston versiot ovat saatavilla julkisesti (PUB), kun taas toisiin täytyy kirjautua akateemisena käyttäjänä (ACA) tai hakea erikseen henkilökohtaista käyttöoikeutta (RES).
  • Lisenssikuvaketta napauttamalla näet tarkan aineistokohtaisen lisenssin.

Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2023053121

Elias Lönnrot Letters Online (lonnrot)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

The corpus consists of the correspondence of Elias Lönnrot with private individuals as well as institutions from 1823 until Lönnrot’s death. Elias Lönnrot was the creator of the Kalevala, medical doctor and professor of language (1802 – 1884). The letters and drafts of letters belong to the Archive of the Finnish Literature Society and have been transliterated for the project Elias Lönnrot’s Letters Online, http://lonnrot.finlit.fi/omeka/.

License and access

  • The versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2022051701

Erzya and Moksha Extended Corpora (ERME)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

ERME contains predominantly original Erzya and Moksha literature. It consists of several media publications from the 19th to the 20th century. ERME was mapped in Saransk in 1997-2004, while in Helsinki it has been mapped since 2004. The most basic format used is XML, with a granularity extending to chapter level. The goal is to create corpora with a granularity extending to word level with bibliographic reference to the sentence level.

The new version contains the literature found in the older instance and has grown markedly. While the old version was merely text divided to sentence level, the new version has lemmatization and dependencies. At sentence level contextual translation may be present (English or Finnish translation), while at word level there is morphological encoding, corresponding to each context. Preliminary morpho-syntactic analysis is carried out using HFST-based transducers and Constraint Grammar disambiguation, function and dependency tagging, which have been developed in the Giellatekno infrastructure of the University of Tromsø.

The grammatical analysis and labeling comply with the practices developed in the Giellatekno infrastructure of the University of Tromsø. These practices are applied in the documentation of several Uralic languages.

The amount of the processed material is to be increased subsequently.

License and access

  • All versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2022052001

Italian Letters from the Sixteenth Century

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This corpus offers a collection of italian letters from the sixteenth century, from different sources.

1) Giovanni FERRO, Lettere varie di complimenti amorose, e giocose [&], In Venetia, Ad istanza di Stefano Curti, 1679 (Universitäts- und Forschungsbibliothek Erfurt/Gotha – Phil 8° 01183/05)

2) Ferrante PALLAVICINO Luca ASSARINO, Lettere amorose, s.n.t. [XVII sec.] (Vicenza, Biblioteca Bertoliana, B.4.3.26) (c) Vicenza, Biblioteca Bertoliana

3) Concetti amorosi, cioè lettere giovenili, et amorose [&], In Modena, ad instantia di Mafeo Tagietti, detto il Verginio, et Ieronimo da Vinetia Compagni, 1553 (Milano, Biblioteca Trivulziana, Triv M 235/1) (c) Historical Archive and Library Trivulziana. City of Milan.

The resource has been made available for download in jpeg format.

License and access

  • The versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2016031701

Corpus of Historical American English (COHA)

Suomeksi


Currently available versions of this resource group

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource group

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

The Corpus of Historical American English (COHA) is the largest structured corpus of historical English. The corpus is balanced by genre across the decades. The original version of COHA is provided by Mark Davies via the corpus interface at english-corpora.org. The Language Bank of Finland offers several ”snapshot” versions of COHA under a restricted academic license that is available for users affiliated with a university in Finland.

For the description of an individual corpus version, please see the metadata record (click on the link at the corpus title).

More information about all corpora from english-corpora.org that are available via the Language Bank

License and access

For the license text of an individual corpus, click on the license image in the corpus list, or see the metadata record (click on the link at the corpus title). Note that there are specific additional terms and conditions that apply on this and other corpora from BYU, see https://www.corpusdata.org/restrictions.asp. The link is included in the official license.

Korp versions

  • Some of the corpus versions are available for searching via the Korp concordancer tool (click on the link under ’Location’).
  • Access to the Korp versions requires academic login via a university in Finland.

Downloadable versions

  • Access to the downloadable corpora mentioned above is restricted to researchers affiliated to member universities of the FIN-CLARIN consortium in Finland. Download access can usually be provided to graduate or postgraduate students in case the applicant needs the corpora for an MA thesis or for a PhD dissertation.
  • To obtain access to restricted corpora, please submit an application via the Language Bank Rights (after logging in to the LBR service, search the catalogue for ’Mark Davies’ downloadable corpora at Kielipankki.’).
  • To access the download service, click on the link under ’Location’, or see the metadata record for the link.

This page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2017061924

 

The Morpho-Syntactic Database of Mikael Agricola’s Works (agricola)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

The Morpho-Syntactic Database of Mikael Agricola’s Works contains the Finnish parts of Mikael Agricola’s works (Abckiria, Rukouskiria, Se Wsi testamenti, Käsikiria, Messu, Piina, Psaltari, Veisut, Profeetat). The database was created from 2004 to 2008, when the texts offered by the Institute for the Languages of Finland were coded and annotated by the Finnish Language Department of the University of Turku in the project ’The Scientific Edition and the Morpho-Syntactic Database of Mikael Agricola’s Works’ by broadening the model used in the Finnish Dialect Syntax Archive. The project was funded by the Academy of Finland and the Alfred Kordelin Foundation. The words of the corpus have been annotated by keyword, part of speech, morphological components and syntactical function. All the grammatical units have been coded according to their places in the works and in the books of the Bible.

License and access

  • All versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051204

The Finnish Gutenberg Corpus (Gutenberg)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

The corpus contains Finnish books, for which the copyright already has expired, available on the web page of the Gutenberg project (https://www.gutenberg.org/) in 2014. The texts have not been linguistically annotated.

A list of the works the Finnish Gutenberg Corpus contains can be found here: Gutenberg.pdf

License and access

  • The versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051203

Finnish Folk Poetry (SKVR)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

A 34-volume collection of Finnic oral poetry, lyric, short rhymes, incantations etc., collected and recorded from the 16th century to the 1930s and published mostly between 1908 and 1948, with a supplement volume published in 1997. The corpus is multilingual, with texts in Finnish, Karelian, Olonets, Ludian, Votic, Izhorian, Latin and Swedish.

More information on the corpus: https://skvr.fi/skvr-teos

License and access

  • This resource is available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021051201

Fenno-Ugrica (fenno-ugrica)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

Fenno-Ugrica is the digital collection of Finno-Ugric publications at the National Library of Finland. Various versions of Fenno-Ugrica are also available via Kielipankki – the Language Bank of Finland, as described on this page.

The Fenno-Ugrica collection includes monograph publications in Ingrian, Veps, Mari (Hill Mari and Meadow Mari) and Mordvinic (Erzya and Moksha) languages and newspapers in Mari and Mordvinic languages from the 1920s and the 1930s. All in all, the collection consists of more than 120 monographs and nearly 20 000 pages of newspapers.

The material of Fenno-Ugrica was produced by the National Library of Finland in the Digitisation Project of Kindred Languages as part of Language Programme of Kone Foundation.

More information: http://fennougrica.kansalliskirjasto.fi/

The languages in the corpus and their three-letter ISO 639-3 codes are the following:

  • Eastern Mari: mhr
  • Erzya: myv
  • Ingrian: izh
  • Khanty: kca
  • Mansi: mns
  • Moksha: mdf
  • Nenets: yrk
  • Selkup: sel
  • Veps: vep
  • Western Mari: mrj

License and access

  • Some versions of this resource are available publicly (PUB), whereas others require you to log in as an academic user (ACA) or to apply for individual access rights (RES).
  • Click on the license image to see the resource-specific license text.

This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050706

Corpus of Old Literary Finnish (vks)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

Written Finnish texts from the years between 1543 and 1810, browsable and searchable on the web. The collection contains bible translations and religious texts (e.g. all of Mikael Agricola’s Finnish works), legal texts, poems, and texts concerning agriculture, nature, health etc., among others. It was compiled for lexicographic use.

More information on the corpus: http://kaino.kotus.fi/korpus/vks/meta/vks_coll_rdf.xml

License and access

  • This resource is available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050705

Corpus of Early Modern Finnish (vnsk)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This resource contains written Finnish from the 19th century (mostly from the years between 1810 and 1880), browsable and searchable on the web. The collection contains published literature, periodicals, newspapers, and dictionaries, among others, with a focus on the earliest and most important publications and a wide thematic coverage. Texts written originally in Finnish were preferred to translations.

Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).

Read more about the original version of the corpus provided by the Institute for the Languages of Finland: http://kaino.kotus.fi/korpus/1800/meta/1800_coll_rdf.xml

License and access

  • All versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050704

 

Classics of Finnish Literature, Kielipankki Version (skk)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This corpus contains works of established Finnish authors published from 1880s to 1940s. It includes prose fiction, plays, poetry and aphorisms, some written originally in Swedish.

Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).

Read more about the original version of the corpus provided by the Institute for the Languages of Finland: https://kaino.kotus.fi/korpus/klassikot/meta/klassikot_coll_rdf.xml

License and access

  • All versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050703

Helsinki Corpus of English Texts (HC)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

The Helsinki Corpus of English Texts is a structured multi-genre diachronic corpus, which includes periodically organized text samples from Old, Middle and Early Modern English. Each sample is preceded by a list of parameter codes giving information on the text and its author. The corpus is useful particularly in the study of the change of linguistic features in long diachrony. It can be used as a diagnostic corpus giving general information of the occurrence of forms, structures and lexemes in different periods of English. This information can be supplemented by evidence yielded by more special and focused historical corpora.

More information on the corpus: https://varieng.helsinki.fi/CoRD/corpora/HelsinkiCorpus/

License and access

  • The versions of this resource require you to log in as an academic user (ACA).
  • Click on the license image to see the resource-specific license text.

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021050302

Aleksis Kivi Corpus (SKS)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This corpus contains all the known letters, manuscripts and published works by the Finnish author Aleksis Kivi (1834–1872), collected by the Finnish Literature Society (Suomalaisen Kirjallisuuden Seura). Most of the texts were written in Finnish while some of the letters and manuscripts are in Swedish.

More information: https://www.finlit.fi/tutkimus/suomalaisen-kirjallisuuden-kriittiset-editiot-edith/aleksis-kivi-korpus/

License and access

  • This resource is available publicly (PUB).
  • Click on the license image to see the resource-specific license text.

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050301

Classics Library of the National Library of Finland – Kielipankki version (nlfcl)

Suomeksi


Currently available versions of this resource

ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level
ShortnameName and metadataLicenseLocationCiteResource group and helpApplyPublication yearSupport level

Upcoming versions of this resource

These resource versions are not yet available in the Language Bank of Finland.

ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information
ShortnameName and metadataLicenseFormatsSupport levelContact PersonResource group and helpLocationOther information

Resource information

This corpus comprises works written in Finnish and Swedish, which are part of the Classics Library of the National Library of Finland and have been published under the license Public Domain.
The data set in Finnish includes 686 works and the data set in Swedish includes 282 works out of the whole data set of 968 works in Finnish and Swedish, gathered from Doria and processed by Niklas Alén in April 2017.

The data set in Doria is an accumulating resource and it comprises works of established Finnish authors published from 1549 onwards. The time coverage for the Kielipankki version is 1549-1944 with the exception of Maria Jotuni’s ’Huojuva talo’ published in 1963 in the Finnish sub-corpus.
The corpus includes classical literature, e.g. prose, plays and poetry.

A list of all works in Finnish in the Kielipankki version sorted by the author
A list of all works in Swedish in the Kielipankki version sorted by the author

License and access

  • All versions of this resource are available publicly (PUB).
  • Click on the license image to see the resource-specific license text.
  • Some versions of this resource are available in the computing environment (see column ’Location’). icon-question-circle

 


This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021092407

Viimeksi muokattu 2025-10-10

Hae Kielipankki-portaalista:
Krista Ojutkangas
Kuukauden tutkija: Krista Ojutkangas

 

Tulevat tapahtumat


Yhteystiedot

Kielipankin tekninen ylläpito:
kielipankki (ät) csc.fi
p. 09 4572001

Aineistoihin ja muuhun sisältöön liittyvät asiat:
fin-clarin (ät) helsinki.fi
p. 029 4129317

Tarkemmat yhteystiedot