Follow-up Study of Dialects of Finnish

Suomeksi

Current versions of this resource:
Follow-up Study of Dialects of Finnish, downloadable version
icon-info-circle Metadata and license
icon-exclamation-triangle PRIV: This resource contains personal data, see license.
icon-quote-right Attribution instructions
Apply for access rights

This version will soon be available for download.
Follow-up Study of Dialects of Finnish
icon-info-circle Metadata and license
icon-exclamation-triangle PRIV: This resource contains personal data, see license.
icon-quote-right Attribution instructions


Since the LAT platform was taken out of use in late 2020, this version is no longer available. However, the same content will be available for download (see above).
Look for other versions of this resource

Resource contents

The follow-up study of Finnish dialects was started in 1989. It is a sociolinguistic and dialectological longitudinal study which is carried out in cooperation with the universities. The goal of the project has been to study the development of regional dialects in real time in 10 rural municipalities at the interval of 10 years. The municipalities chosen represent the traditional main dialect groups of Finnish. In each town, altogether 15 speakers have been used as informants. The external variables used in the study include age and sex: in each town, speakers of three generations have been studied (O = old, M = middle-aged, Y = young), both men and women. The data have been collected by using the traditional dialect interview method, and the study has focused on phonological and morphological features. Reports of ten sub-studies and a final report of the first round have been completed.

The second round of the project was started in 1999 and completed in 2007. Eight of the sub-studies have been completed, one of them as a form of the dissertation (Kurki 2005). During the second round, other types of data collected in addition to interviews. The part of the sub-studies focus on other than traditional sociolinguistic variables.

Further details about the content and the terms and conditions regarding the different corpus versions are available in the corresponding metadata records.



Last updated: 30.8.2023

This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2023082101

Uralic UD

Latest versions/subcorpora:  

Uralic UD v2.10, Kielipankki Korp version
icon-info-circle Metadata and license
icon-quote-right Attribution instructions

Select the corpus in Korp
Search for all versions in META-SHARE  

The latest version of this corpus has been annotated according to the Universal Dependencies version 2.10 for the following Uralic languages: Erzya, Estonian, Finnish, Hungarian, Karelian, Komi-Permyak, Komi-Zyrian, Livvi, Moksha, North Sami and Skolt Sami.

Treebanks and their licenses:

Erzya (JR); CC BY-SA 4.0
Estonian (EDT, EWT); CC BY-NC-SA 4.0
Finnish (FTB, OOD, PUD, TDT); FTB: CC BY 4.0, other: CC BY-SA 4.0
Hungarian (Szeged); CC BY-NC-SA 3.0
Karelian (KKPP); CC BY-SA 4.0
Komi-Permyak (UH); CC BY-SA 4.0
Komi-Zyrian (IKDP, Lattice); CC BY-SA 4.0
Livvi (KKPP); CC BY-SA 4.0
Moksha (JR); CC BY-SA 4.0
North Sami (Giella); CC BY-SA 4.0
Skolt Sami (Giellagas); CC BY-SA 4.0

Universal Dependencies v2.10 License Agreement

 

This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2022061003

Finnish TreeBank (FTB)

These treebanks and parsebanks for Finnish were created by the FinnTreeBank project. The data in FinnTreeBank 1 is based on model sentences in Iso suomen kielioppi (The Large Grammar of Finnish), manually annotated with dependency-syntactic descriptions (see the tagset and the annotation manual). FinnTreeBank 1 was built as a Grammar Definition Corpus and intended as a model for further automatic analysis of Finnish. FinnTreeBank 2 is a small extension to FinnTreeBank 1, and it was manually annotated in the same fashion as the first treebank. FinnTreeBank 3 is a large treebank that was only automatically annotated, using an experimental method. As a result, the annotations in the third treebank are of much lower quality in comparison to the manually annotated treebanks.

The UD version of FinnTreeBank 1 was derived from FinnTreeBank 1 2014 by a scripted mapping of labels and some restructuring in an attempt to conform approximately to the UD Finnish model.

More information on UD Finnish FTB

UD versions:  
UD Finnish-FTB: The UD version of FinnTreeBank 1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
Search for these versions in META-SHARE  
Latest versions/subcorpora:  
The Downloadable Version of the Finnish TreeBank 1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Helsinki Korp Version of the Finnish TreeBank 1
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp (as part of FTB2)
The Downloadable Version of the Finnish TreeBank 2
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Helsinki Korp Version of the Finnish TreeBank 2
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
The Downloadable Version of the Finnish TreeBank 3
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Download the resource
The Helsinki Korp Version of the Finnish TreeBank 3
icon-info-circle Metadata and license
icon-quote-right Attribution instructions
Select the corpus in Korp
Search for these versions in META-SHARE  

 

Several different versions of these resources are published in the Language Bank of Finland. The versions are available through the Language Bank Download Service and/or through the Korp concordance tool. The links to the different versions can be found on the list above. Details on the content and license of each version are available via the metadata records.

Annotation details

Publications related to FinnTreeBank

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021031604

 

Search the Language Bank Portal:
Tanja Säily
Researcher of the Month: Tanja Säily

 

Upcoming events


Contact

The Language Bank's technical support:
kielipankki (at) csc.fi
tel. +358 9 4572001

Requests related to language resources:
fin-clarin (at) helsinki.fi
tel. +358 29 4129317

More contact information