Resource-specific data protection terms and conditions (sapu)

Suomeksi

Title of the Resource: The Corpus of Sociolinguistic Variation in the Province of Satakunta (sapu)

Metadata: http://urn.fi/urn:nbn:fi:lb-2022092121
License:
http://urn.fi/urn:nbn:fi:lb-2022092122

This page describes the specific conditions regarding the processing of the personal data in this Resource. In addition to these conditions, see the guidelines for processing personal data in the Language Bank of Finland.

Controller of the data stored in the Language Bank of Finland

University of Turku
20014 University of Turku

Data Protection Officer of the Controller

Data Protection Officer of the University of Turku
Email: dpo@utu.fi

 

The Language Bank of Finland is a Data Processor on behalf of the University of Turku. For further details on the data protection of the resources in the Language Bank of Finland, please contact FIN-CLARIN helpdesk.

Description of the personal data

Types of personal data in the Resource

For the persons using their voice in the corpus, age is expressed at the level of decade of birth and place of residence at the level of municipality. The voices of the individuals appear on the audio recordings and thus their gender is also expressed indirectly.

The names of the speakers and of the persons that were discussed have been pseudonymized from the transcripts and from the annotation files. Likewise, the names of houses or farms have been modified in the transcripts and annotation files. Only the names of public figures appear in their original form in the aforementioned types of files.

On the audio recordings, the names appear in their original form, and the recordings may not be used in order to identify individual people or to harm them in any way.

No recordings were selected for the corpus that would in principle include talk about sensitive topics, but a large proportion of the conversation topics discuss or mention the person’s place of recidence or place of birth, events of their own life or other people of the same locality. Also in this sense, the recordings have been carefully selected so as not to reveal sensitive information.

Categories of data subjects

The material includes everyday language use of the people of the 2000s in the region of Satakunta. The languages of three south-western dialect localities (Rauma, Luvia, Honkilahti) and three transitional dialect localities (Pori, Ulvila-Nakkila, Kokemäki) are represented. The recordings and speakers are divided into five different groups according to their decade of birth (those born in the 20s and 30s, those born in the 40s and 50s, those born in the 60s and 70s, those born in the 80s and those born in the 90s). The speakers are also divided into urban (Pori and Rauma) and rural ones (Luvia, Honkilahti, Ulvila-Nakkila and Kokemäki).

Data protection terms and conditions for this Resource

In these data protection terms and conditions, End-User means the party acting as the Controller for the Resource received, in accordance with the General Data Protection Regulation (EU) 2016/679. Depending on the case and the purpose of Resource use, End-User may therefore mean the CLARIN service user’s employer or organisation (e.g., a university, university of applied sciences or other research organisation) or the service user personally.

The End-User understands that when receiving the Resource, it becomes a Controller, as referred to in the data protection legislation. The End-User must ensure that it complies with the applicable data protection legislation when processing personal data.

The purpose of use of personal data

  • The Resource may only be used for the research purpose described in the research plan approved by the Controller.
  • The Resource may only be used for scholarly, non-commercial research purposes in the field of language research.

Location and transfer of the personal data

  • Personal data may not be processed outside Finland.

Other conditions for data processing

The material may not be used to identify speakers. The recordings contained in the material may not be combined with personal data available elsewhere, nor may the recordings contained in the material be compared with recordings available elsewhere in order to determine whether the speaker is the same person.

Publish a link to your Privacy Notice

When you start using this Resource, share the title of your project that is understandable to the general public as well as the link to the publicly available privacy notice (see instructions). This information will be published on the website of the Language Bank of Finland.

 

Updates

This page was last updated on 31.3.2025.


Persistent identifier of this page: http://urn.fi/urn:nbn:fi:lb-2022092124