
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Location | Cite | Resource group and help | Apply | Publication year | Support level |
These resource versions are not yet available in the Language Bank of Finland.
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
|---|---|---|---|---|---|---|---|---|
| Shortname | Name and metadata | License | Formats | Support level | Contact Person | Resource group and help | Location | Other information |
The Sapu corpus is a sociolinguistic corpus representing the spoken language of Satakunta in the 21st century (and, more broadly, contemporary Finnish spoken language), which has been lemmatized and annotated morphologically and syntactically. The corpus contains samples of the spoken language of six localities (Rauma, Honkilahti, Luvia, Pori, Ulvila-Nakkila, Kokemäki). The records consist of audio recordings and the corresponding transcripts.
Further information about the recordings
Further details of each version of the resource are maintained in the metadata record, findable via the persistent identifier (see the link at the resource title).
A general description of the corpus can be found, e.g., in the following publication:
Kurki, Tommi, Huhtala, Atte, Koivunen, Tomi & Mäkitalo, Nelli (2022). Satakuntalaisuus puheessa -korpus ja siitä tehtyjä synkretismihavaintoja: Syncretism in Colloquial Finnish – Observations of the Satakunta corpus. AFinLA-teema, 14, pp. 103-134. doi:10.30660/afinla.111247
This page has a persistent identifier: http://urn.fi/urn:nbn:fi:lb-2025091121
Last modified on 2025-11-10