
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Sijainti | Viite | Aineistoryhmä ja ohje | Hae käyttöoikeutta | Julkaisuvuosi | Tukitaso |
|---|---|---|---|---|---|---|---|---|
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Sijainti | Viite | Aineistoryhmä ja ohje | Hae käyttöoikeutta | Julkaisuvuosi | Tukitaso |
Nämä aineistoversiot eivät vielä ole saatavilla Kielipankin kautta.
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Muoto | Tukitaso | Yhteyshenkilö | Sijainti | Aineistoryhmä ja ohje | Muu tieto |
|---|---|---|---|---|---|---|---|---|
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Muoto | Tukitaso | Yhteyshenkilö | Sijainti | Aineistoryhmä ja ohje | Muu tieto |
Suomalaisen kirjallisuuden klassikoita -korpuksessa on asemansa vakiinnuttaneiden suomalaisten kaunokirjailijoiden teoksia 1880-luvulta 1940-luvulle. Mukana on erityyppistä proosaa ja näytelmiä sekä lyriikkaa ja aforismeja. Alun perin suomeksi kirjoitettujen teosten lisäksi korpukseen sisältyy joitain etenkin ruotsista tehtyjä runokäännöksiä.
Kunkin aineistoversion tarkemmat tiedot päivitetään kuvailutietueeseen, joka löytyy pysyvällä tunnisteella (ks. linkki aineiston otsikon kohdalla).
Lue lisää tämän korpuksen alkuperäisestä versiosta Kotimaisten kielten keskuksessa: https://kaino.kotus.fi/korpus/klassikot/meta/klassikot_coll_rdf.xml
Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2025071822
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Sijainti | Viite | Aineistoryhmä ja ohje | Hae käyttöoikeutta | Julkaisuvuosi | Tukitaso |
|---|---|---|---|---|---|---|---|---|
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Sijainti | Viite | Aineistoryhmä ja ohje | Hae käyttöoikeutta | Julkaisuvuosi | Tukitaso |
Nämä aineistoversiot eivät vielä ole saatavilla Kielipankin kautta.
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Muoto | Tukitaso | Yhteyshenkilö | Sijainti | Aineistoryhmä ja ohje | Muu tieto |
|---|---|---|---|---|---|---|---|---|
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Muoto | Tukitaso | Yhteyshenkilö | Sijainti | Aineistoryhmä ja ohje | Muu tieto |
Tämä aineisto sisältää kirjoitettua suomea 1800-luvulta (enimmäkseen vuosilta 1810-1880) haettavassa muodossa. Korpus sisältää mm. julkaistua kirjallisuutta, aikakauslehtiä, sanomalehtiä ja sanakirjoja. Aineiston koostamisessa on keskitytty vanhimpiin ja tärkeimpiin julkaisuihin, pyritty kattamaan laajasti eri aiheita sekä suosittu alun perin suomeksi kirjoitettuja tekstejä käännösten sijaan.
Kunkin aineistoversion tarkemmat tiedot päivitetään kuvailutietueeseen, joka löytyy pysyvällä tunnisteella (ks. linkki aineiston otsikon kohdalla).
Lue lisää tämän korpuksen alkuperäisestä versiosta Kotimaisten kielten keskuksessa: http://kaino.kotus.fi/korpus/1800/meta/1800_coll_rdf.xml
Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2025071821
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The University of Oulu Päätalo collection contains the literary output of the author Kalle Päätalo published so far. The works are to be made available via the Language Bank of Finland as several text corpora, the first of which was the Iijoki corpus.
Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).
The available resources can be accessed by logging in as an academic user (”ACA”). Click on the license image to see the resource-specific license text.
The Päätalo collection of the University of Oulu includes works by the author Kalle Päätalo (November 11, 1919 – November 20, 2000). The Iijoki series, composed of 26 works, is Päätalo’s autobiographical main work, depicting his life from the 1910s to the 1990s.
At the initiative of University Lecturer Maija Saviniemi of the University of Oulu, Kalle Päätalo’s relatives Riitta Päätalo, Aliisa Oksanen and Emmi Oksanen as well as Gummerus Kustannus have made it possible to publish the material in the Language Bank. The material is available through the Language Bank of Finland for research purposes.
In the FIN-CLARIN project, the first Korp version of the Iijoki dataset was structured by Erik Axelson with the Turku Neural Parser Pipeline (TNPP) parser of the Turku NLP group. The data has also been structured in Kielipankki with the TDPP parser, which is based on the TDT parser developed by the Turku BioNLP group and further developed in Kielipankki. Based on the TDPP parsing, a list of elements was created that the parser could not reliably determine in their basic form. Instead, the annotation is marked as OTHER_UNK. A large number of these words are dialect words in different forms, so it is useful to look for them in the data using their basic forms.
Aakkostettu lista OTHER_UNK (txt; 1,5 Mt)
Iijoki-sarjan 200 yleisintä murresanaa (pdf; 31 kt)
A wide range of searches and statistics on the material can be made in the Korp service of the Language Bank of Finland. The Korp Extended Search tab can be used to narrow searches, for example, by selecting the title or date of a work as a search criterion and entering the title or year of publication in the selection field.
The Iijoki series consists of 26 volumes, containing around 17 000 pages of fictional text based on the author’s own life:
Huonemiehen poika (1971)
Tammettu virta (1972)
Kunnan jauhot (1973)
Täysi tuntiraha (1974)
Nuoruuden savotat (1975)
Loimujen aikaan (1976)
Ahdistettu maa (1977)
Miinoitettu rauha (1978)
Ukkosen ääni (1979)
Liekkejä laulumailla (1980)
Tuulessa ja tuiskussa (1981)
Tammerkosken sillalla (1982)
Pohjalta ponnistaen (1983)
Nuorikkoa näyttämässä (1984)
Nouseva maa (1985)
Ratkaisujen aika (1986)
Pyynikin rinteessä (1987)
Reissutyössä (1988)
Oman katon alle (1989)
Iijoen kutsu (1990)
Muuttunut selkonen (1991)
Epätietoisuuden talvi (1992)
Iijoelta etelään (1993)
Pato murtuu (1994)
Hyvästi, Iijoki (1995)
Pölhökanto Iijoen törmässä (1998)
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2023110921
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Sijainti | Viite | Aineistoryhmä ja ohje | Hae käyttöoikeutta | Julkaisuvuosi | Tukitaso |
|---|---|---|---|---|---|---|---|---|
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Sijainti | Viite | Aineistoryhmä ja ohje | Hae käyttöoikeutta | Julkaisuvuosi | Tukitaso |
Nämä aineistoversiot eivät vielä ole saatavilla Kielipankin kautta.
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Muoto | Tukitaso | Yhteyshenkilö | Sijainti | Aineistoryhmä ja ohje | Muu tieto |
|---|---|---|---|---|---|---|---|---|
| Lyhenne | Nimi ja kuvailutiedot | Lisenssi | Muoto | Tukitaso | Yhteyshenkilö | Sijainti | Aineistoryhmä ja ohje | Muu tieto |
Oulun yliopiston Päätalo-kokoelma sisältää kirjailija Kalle Päätalon tähän asti julkaistun kirjallisen tuotannon. Teoksia tuodaan saataville Kielipankin kautta useina kokonaisuuksina, joista ensimmäinen oli Iijoki-korpus.
Toisessa vaiheessa on tarkoitus julkaista korpusmuodossa seuraavat teokset:
Kunkin aineistoversion tarkemmat tiedot päivitetään kuvailutietueeseen, joka löytyy pysyvällä tunnisteella (ks. linkki aineiston otsikon kohdalla).
Tämän aineiston versioihin täytyy kirjautua akateemisena käyttäjänä (ACA). Lisenssikuvaketta napauttamalla näet tarkan aineistokohtaisen lisenssin.
Oulun yliopiston Päätalo-kokoelma sisältää kirjailija Kalle Päätalon (11.11.1919-20.11.2000) teoksia. Iijoki-sarja on 26 teoksesta koostuva Päätalon omaelämäkerrallinen pääteos, jossa kirjailija kuvaa elämäänsä 1910-luvulta aina 1990-luvulle asti.
Aineiston julkaisemisen Kielipankissa ovat tehneet mahdolliseksi Oulun yliopiston yliopistonlehtori Maija Saviniemen aloitteesta Kalle Päätalon omaiset Riitta Päätalo, Aliisa Oksanen ja Emmi Oksanen sekä Gummerus Kustannus. Aineisto on Kielipankin kautta saatavilla tutkimuskäyttöön.
Iijoki-aineiston ensimmäisen Korp-version on FIN-CLARIN-hankkeessa jäsentänyt Erik Axelson Turku NLP -ryhmän Turku Neural Parser Pipeline (TNPP) -jäsentimellä. Aineisto on myös jäsennetty Kielipankissa TDPP-jäsentimellä, joka on Turku BioNLP -ryhmän kehittämän TDT-jäsentimen pohjalta Kielipankissa edelleen kehitetty jäsennin. TDPP-jäsennyksen pohjalta on luotu lista aineiston sisältämistä saneista, joita jäsennin ei ole kyennyt luotettavasti perusmuotoistamaan. Sen sijaan annotaatiossa on merkintä OTHER_UNK. Suuri osa näistä saneista on murresanoja eri muodoissaan joten murresanoja tutkivan kannattaa etsiä niitä aineistosta pintamuotojen avulla.
Aakkostettu lista OTHER_UNK (txt; 1,5 Mt)
Iijoki-sarjan 200 yleisintä murresanaa (pdf; 31 kt)
Aineistosta voi tehdä monenlaisia hakuja ja tilastoida tuloksia Kielipankin Korp-palvelussa. Korpin laajennettu haku -välilehdellä voi rajata hakuja esimerkiksi valitsemalla hakukriteeriksi teoksen nimen tai ajankohdan ja kirjoittamalla valintakenttään vastaavasti teoksen nimen tai julkaisuvuoden.
Iijoki-sarjassa on 26 osaa, jotka sisältävät yhteensä noin 17000 sivua kaunokirjallista, kirjailijan omaan elämään pohjautuvaa tekstiä:
Huonemiehen poika (1971)
Tammettu virta (1972)
Kunnan jauhot (1973)
Täysi tuntiraha (1974)
Nuoruuden savotat (1975)
Loimujen aikaan (1976)
Ahdistettu maa (1977)
Miinoitettu rauha (1978)
Ukkosen ääni (1979)
Liekkejä laulumailla (1980)
Tuulessa ja tuiskussa (1981)
Tammerkosken sillalla (1982)
Pohjalta ponnistaen (1983)
Nuorikkoa näyttämässä (1984)
Nouseva maa (1985)
Ratkaisujen aika (1986)
Pyynikin rinteessä (1987)
Reissutyössä (1988)
Oman katon alle (1989)
Iijoen kutsu (1990)
Muuttunut selkonen (1991)
Epätietoisuuden talvi (1992)
Iijoelta etelään (1993)
Pato murtuu (1994)
Hyvästi, Iijoki (1995)
Pölhökanto Iijoen törmässä (1998)
Tämän sivun pysyvä tunniste: http://urn.fi/urn:nbn:fi:lb-2023110922
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The corpus consists of the correspondence of Elias Lönnrot with private individuals as well as institutions from 1823 until Lönnrot’s death. Elias Lönnrot was the creator of the Kalevala, medical doctor and professor of language (1802 – 1884). The letters and drafts of letters belong to the Archive of the Finnish Literature Society and have been transliterated for the project Elias Lönnrot’s Letters Online, http://lonnrot.finlit.fi/omeka/.
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2022051701
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
ERME contains predominantly original Erzya and Moksha literature. It consists of several media publications from the 19th to the 20th century. ERME was mapped in Saransk in 1997-2004, while in Helsinki it has been mapped since 2004. The most basic format used is XML, with a granularity extending to chapter level. The goal is to create corpora with a granularity extending to word level with bibliographic reference to the sentence level.
The new version contains the literature found in the older instance and has grown markedly. While the old version was merely text divided to sentence level, the new version has lemmatization and dependencies. At sentence level contextual translation may be present (English or Finnish translation), while at word level there is morphological encoding, corresponding to each context. Preliminary morpho-syntactic analysis is carried out using HFST-based transducers and Constraint Grammar disambiguation, function and dependency tagging, which have been developed in the Giellatekno infrastructure of the University of Tromsø.
The grammatical analysis and labeling comply with the practices developed in the Giellatekno infrastructure of the University of Tromsø. These practices are applied in the documentation of several Uralic languages.
The amount of the processed material is to be increased subsequently.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2022052001
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The Corpus of Translated Finnish has been compiled in 1999 in the University of Eastern Finland (University of Joensuu at the time and it’s School of Translation Studies) in the project ’Translation Universals’, led by professor Anna Mauranen.
The corpus comprises two parts: texts originally written in Finnish and texts translated into Finnish from different languages. The following text types are represented in the corpus: academic texts, literature, childrens’ literature, biography, popular literature and fiction, detective fiction and popular science.
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021081102
The North Saami Corpus contains Kerttu Vuolab’s novel Cheppari cháráhus written in Northern Sami. The corpus is a part of the UHLCS corpus collection.
UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).
| Latest versions/subcorpora: | |
| North Saami Corpus (Literature) (UHLCS) Metadata and license Attribution instructions |
The data is available upon request via CSC’s computing environment |
| North Saami Corpus (Literature) (UHLCS), Helsinki Korp Version Metadata and license Attribution instructions |
The resource will be available soon |
| Search for all versions in META-SHARE |
Of this language corpus different versions/subcorpora are (or might be in the future) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.
Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021061605
The Finnish Corpus is a part of the UHLCS corpus collection.
UHLCS has many different IPR holders. Should you have any questions regarding the collection, please contact Pirkko Suihkonen (suihkonen.pirkko@gmail.com).
| Latest versions/subcorpora: | |
| Finnish Corpus (Literature) (UHLCS) Metadata and license Attribution instructions |
The data is available upon request via CSC’s computing environment |
| Finnish Corpus (Literature) (UHLCS), Helsinki Korp Version Metadata and license Attribution instructions |
The resource will be available soon |
| Search for all versions in META-SHARE |
Of this language corpus different versions/subcorpora are (or might be in the future) published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found from the list above.
Detailed information on the content of each version, user rights and licenses can be found from it’s specific metadata record in META-SHARE.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021061604
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The Morpho-Syntactic Database of Mikael Agricola’s Works contains the Finnish parts of Mikael Agricola’s works (Abckiria, Rukouskiria, Se Wsi testamenti, Käsikiria, Messu, Piina, Psaltari, Veisut, Profeetat). The database was created from 2004 to 2008, when the texts offered by the Institute for the Languages of Finland were coded and annotated by the Finnish Language Department of the University of Turku in the project ’The Scientific Edition and the Morpho-Syntactic Database of Mikael Agricola’s Works’ by broadening the model used in the Finnish Dialect Syntax Archive. The project was funded by the Academy of Finland and the Alfred Kordelin Foundation. The words of the corpus have been annotated by keyword, part of speech, morphological components and syntactical function. All the grammatical units have been coded according to their places in the works and in the books of the Bible.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051204
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The corpus contains Finnish books, for which the copyright already has expired, available on the web page of the Gutenberg project (https://www.gutenberg.org/) in 2014. The texts have not been linguistically annotated.
A list of the works the Finnish Gutenberg Corpus contains can be found here: Gutenberg.pdf
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021051203
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
A 34-volume collection of Finnic oral poetry, lyric, short rhymes, incantations etc., collected and recorded from the 16th century to the 1930s and published mostly between 1908 and 1948, with a supplement volume published in 1997. The corpus is multilingual, with texts in Finnish, Karelian, Olonets, Ludian, Votic, Izhorian, Latin and Swedish.
More information on the corpus: https://skvr.fi/skvr-teos
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021051201
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
This resource contains written Finnish from the 19th century (mostly from the years between 1810 and 1880), browsable and searchable on the web. The collection contains published literature, periodicals, newspapers, and dictionaries, among others, with a focus on the earliest and most important publications and a wide thematic coverage. Texts written originally in Finnish were preferred to translations.
Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).
Read more about the original version of the corpus provided by the Institute for the Languages of Finland: http://kaino.kotus.fi/korpus/1800/meta/1800_coll_rdf.xml
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050704
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
This corpus contains works of established Finnish authors published from 1880s to 1940s. It includes prose fiction, plays, poetry and aphorisms, some written originally in Swedish.
Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).
Read more about the original version of the corpus provided by the Institute for the Languages of Finland: https://kaino.kotus.fi/korpus/klassikot/meta/klassikot_coll_rdf.xml
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050703
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
This corpus contains all the known letters, manuscripts and published works by the Finnish author Aleksis Kivi (1834–1872), collected by the Finnish Literature Society (Suomalaisen Kirjallisuuden Seura). Most of the texts were written in Finnish while some of the letters and manuscripts are in Swedish.
More information: https://www.finlit.fi/tutkimus/suomalaisen-kirjallisuuden-kriittiset-editiot-edith/aleksis-kivi-korpus/
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021050301
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
This corpus comprises works written in Finnish and Swedish, which are part of the Classics Library of the National Library of Finland and have been published under the license Public Domain.
The data set in Finnish includes 686 works and the data set in Swedish includes 282 works out of the whole data set of 968 works in Finnish and Swedish, gathered from Doria and processed by Niklas Alén in April 2017.
The data set in Doria is an accumulating resource and it comprises works of established Finnish authors published from 1549 onwards. The time coverage for the Kielipankki version is 1549-1944 with the exception of Maria Jotuni’s ’Huojuva talo’ published in 1963 in the Finnish sub-corpus.
The corpus includes classical literature, e.g. prose, plays and poetry.
A list of all works in Finnish in the Kielipankki version sorted by the author
A list of all works in Swedish in the Kielipankki version sorted by the author
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021092407
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The corpus contains modern written Swedish texts published in Finland (1990s). The kernel corpus is accompanied by a minor section on spoken language. The Finland Swedish Text Corpus is a part of the UHLCS corpus collection.
More information on the corpus (in Finnish)
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2016050212
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The corpus contains the sub-corpora ParFin 2016, Finnish-Russian Parallel Corpus of Literary Texts and ParRus 2016, Russian-Finnish Parallel Corpus of Literary Texts.
The sub-corpus ParRus2016 contains Russian literary texts (classical literature & 20th century) and their translations into Finnish aligned at paragraph level.
The sub-corpus ParFin2016 contains Finnish literary texts from 1990-2010 and their translations into Russian aligned at sentence level.
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2021092405
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The Finnish Language Text Collection (Suomen kielen tekstikokoelma) is a selection of electronic Finnish texts from the 1990s. The collection contains texts from newspapers, journals as well as books. See the content details in Finnish.
All of the material is available for academic research use. A large part of the texts is also available for commercial use.
The collection was compiled by the Institute for the Languages of Finland, the Department of General Linguistics of the University of Helsinki and the Foreign Languages Department of the University of Joensuu.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-201403268
Viimeksi muokattu 2025-10-01
