Grant agreement: Academy of Finland no. 345610
Start date: 01-01-2022
Duration: 24 months
WP 1.2: Report on Forced-Alignment Service
Date of reporting: 2022-09
Report author: Martin Matthiesen (CSC)
Contributors: Juho Leinonen (Aalto), Sam Hardwick, Mietta Lennes (UHEL)
Deliverable location: Language Bank Tools Demos (kielipankki.fi) | Forced Alignment
The forced alignment tool provides time stamps for transcribed words or utterances of an audio file. The tool can be used in puhti.csc.fi and a web interface can be accessed on the Language Bank Demo Tools page, included on the list of tools at kielipankki.fi.
The source code for the original forced aligner is provided on GitHub, https://github.com/aalto-speech/finnish-forced-alignment, and the Docker image on which the tool is based can be found on Dockerhub, https://hub.docker.com/r/juholeinonen/kaldi-align. The specific endpoints for the forced aligner versions installed in the Language Bank of Finland are included in the code repository at https://github.com/Traubert/kielipankki-services, under services/finnish-forced-align.
finnish-forced-alignment: J. Leinonen, S. Virpioja and M. Kurimo. ”Grapheme-Based Cross-Language Forced Alignment: Results with Uralic Languages” NoDaLiDa. 2021.