<< Back to the resource group page
The downloadable corpus contains a file metadata.tsv, which shows the attributes for each recording. There are 20 attributes in total. They are listed below showing the field number, the name of the attribute and a short description. For some attributes, the possible values and their frequencies are also listed in descending order, either all values or the most common ones (”…” means a break in the list). Empty values are marked with an underscore ’_’.
1 ’recordingId’ Recording id, replaced with shuffled consecutive numbering for anonymity
2 ’topic’ Topic of the recording
335 Mitt husdjur
237 Ute i naturen
222 Djur som berör
190 Jaga eller inte?
185 I grannskapet
181 Hjälp djuret
180 Klimathot
179 På bondgården
158 Best of
153 Min drömpartner
145 Ditt eget språk
128 Förklara ordet
125 Vänsterprassel
115 Snacka snaps
114 Gissa hemorten
111 Monogami
108 Evig kärlek?
104 Klarar man sig?
98 Språkpoliser
98 På resa i Sverige
97 Fester att minnas
92 Bästa evenemanget
91 Vänner för livet
87 Småbarnsliv
86 Mitt gulisår
85 Mitt bästa ligg
78 Distansstudier
76 Mina idoler
60 Dramatiska händelser
55 Udda grenar
53 Fina minnen
52 Stafettkarnevalen
50 Skoljumppa
48 Första frågan
48 Fejk eller inte?
46 Drömreklamen
42 Falska nyheter
32 Första ämnet
17 Viktiga nyheter?
15 Bilder som ljuger
14 Tro det eller ej
13 Skenet bedrar
8 Tolka somespråk
7 Redigerade bilder
6 Botar på some
3 ’L1’ L1 language of the speaker
1997 [’svenska’]
1735 _
332 [’svenska’, ’finska’]
81 [’finska’, ’svenska’]
46 [’finska’]
34 [’svenska’, ’Finska’]
19 [’svenska’, ’Engelska’]
13 [’svenska’, ’engelska’]
…
4 ’gender’ Gender of the speaker
1968 [’Kvinna’]
1814 _
549 [’Man’]
52 [’Annan’]
41 [’Vill inte säga’]
5 ’age’ Age group of the speaker
1798 _
764 [’21-30 år’]
585 [’31-40 år’]
432 [’51-60 år’]
300 [’11-20 år’]
286 [’41-50 år’]
146 [’61-70 år’]
73 [’71-80 år’]
18 [’1-10 år’]
16 [’101+ år’]
6 [’81-90 år’]
6 ’dialect’ Dialect of the speaker
1775 _
773 [’Mellersta Österbotten’]
471 [’Mellersta Nyland’]
426 [’Västra Nyland’]
288 [’Östra Nyland’]
179 [’Norra Österbotten’]
145 [’Södra Österbotten’]
116 [’Västra Åboland’]
103 [’Övriga Finland’]
50 [’Västra Åland’]
36 [’Östra Åboland’]
35 [’Något annat land’]
27 [’Östra Åland’]
7 ’placeOfResidence’ Place of residence of the speaker
2138 _
439 [’Helsingfors’]
174 [’Vasa’]
142 [’Esbo’]
115 [’Jakobstad’]
95 [’Åbo’]
87 [’Borgå’]
67 [’Raseborg’]
56 [’Korsholm’]
56 []
…
8 ’education’ Education of the speaker
2000 _
1615 [’Högskoleexamen’]
306 [’Yrkesexamen’]
189 [’Utbildning efter grundskola’]
169 [’Forskarutbildning’]
145 [’Grundskola’]
9 ’occupation’ Occupation of the speaker
3552 _
261 [’Expert eller specialist, t.ex. lärare’]
114 [’Kontors- och kundtjänstpersonal’]
112 [’Övrig arbetstagare’]
71 [’Chef’]
55 [’Service- och försäljningspersonal’]
29 [’Skokelev’]
17 [’Jordbrukare, skogsarbetare m.fl.’]
11 [’Studerande’]
…
10 ’clientId’ Client id, replaced with shuffled consecutive numbering for anonymity
11 ’sessionId’ Session id
12 ’itemId’ Item id
13 ’recordingTimestamp’ Recording timestamp, [YYYY]-[MM]-[DD]T[HH]:[MM]:[SS.SSS]Z
14 ’recordingDuration’ Recording duration in seconds, with varying precision
15 ’recordingSampleRate’ Recording sample rate
4408 48000
16 44100
16 ’recordingBitDepth’ Recording bit depth, ’16’ for all recordings
17 ’recordingNumberOfChannels’ Number of channels for the recording
4415 1
9 2
18 ’contentType’ Original content type (flac files have been converted to wav)
4408 audio/wave
16 audio/flac
19 ’clientPlatformName’ Client platform name, shows operating system and browser
20 ’clientPlatformVersion’ Client platform version, shows operating system and browser
Last modified on 2025-06-11