The University of Helsinki Language Corpus Server (UHLCS) is a multilingual data bank and data server which has been located at the Department of General Linguistics, the University of Helsinki. In Septemberg 2007, the UHLCS was moved to CSC (the Finnish IT Center for Science). The UHLCS, which is maintained by the University of Helsinki, was founded late in 1980. At present, the UHLCS contains computer corpora from more than 50 languages, including samples of minority languages and extensive corpora representing different text types. In 2000, the corpora from the Uralic, Turkic, Tungusic, Mongolic, Chukotko-Kamchatkan, Iranian and North-East Caucasian languages were edited for public use with the financial support of the Max Planck Institute for Evolutionary Anthropology, Leipzig. In summer 2003, the basis for the metadata descriptions of the corpora were prepared with the financial support of the ECHO-project (ECHO = European Cultural Inheritance Online). There are also tools at the UHLCS which can be used in analyzing the corpora. The use of most of the corpora is restricted for research and teaching.
The following corpora are available in Kielipankki – the Language Bank of Finland (puhti.csc.fi, access rights instructions).
Latest versions/subcorpora: | |
Chuvash Corpus (UHLCS) |
Puhti | Access the corpus in
English Corpus (UHLCS) |
Puhti | Access the corpus in
Corpus of Erzya and Moksha Mordvin Literature and Journals and Komi Zyrian Literature (UHLCS) |
Puhti | Access the corpus in
Erzya and Moksha Mordvin Word List Corpus (UHLCS) |
Puhti | Access the corpus in
Estonian Corpus 1 (UHLCS) |
Puhti | Access the corpus in
Estonian Corpus 2 (UHLCS) |
Puhti | Access the corpus in
Finnish Corpus (Bibles) (UHLCS) |
Puhti | Access the corpus in
Finnish Corpus (Literature) (UHLCS) |
Puhti | Access the corpus in
The Helsinki Korp Version of the Finland-Swedish Text Corpus (UHLCS) |
Korp | Access the corpus in
The Taito Version of the Finland-Swedish Text Corpus (UHLCS) |
Puhti | Access the corpus in
Ingrian Corpus (UHLCS) |
Puhti | Access the corpus in
Khanty Corpus (North Khanty, Corpora and Translations) (UHLCS) |
Puhti | Access the corpus in
Komi Zyrian Corpus (UHLCS) |
Puhti | Access the corpus in
Latin Corpus (UHLCS) |
Puhti | Access the corpus in
Lude (Ludian) Corpus (UHLCS) |
Puhti | Access the corpus in
Nenets Corpus (Tundra Nenets) (UHLCS) |
Puhti | Access the corpus in
North Saami Corpus (Literature) (UHLCS) |
Puhti | Access the corpus in
North Saami Corpus (Sámikultuvradoaibmagotti smiehttamush) (UHLCS) |
Puhti | Access the corpus in
Quantifiers and Quantification in Finnish and Languages Spoken in the Central Volga–Kama Region (UHLCS) |
Puhti | Access the corpus in
The Susanne Corpus (UHLCS) |
Puhti | Access the corpus in
Ume Saami Corpus (UHLCS) |
Puhti | Access the corpus in
Uralic, Turkic, Indo-Iranian and Mongol languages; languages of Siberia and Caucasia (UHLCS) |
Puhti | Access the corpus in
Uzbek-English Dictionary (UHLCS) |
Puhti | Access the corpus in
Lists of Words Corpus (UHLCS) |
Puhti | Access the corpus in
Search for all versions in META-SHARE |
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2023030901