Tools

Suomeksi

The tools and services maintained by the Language Bank may be accessible via a web interface, or they can be installed via download from e.g. GitHub or Korp. You can also find other tools developed by member organizations of FIN-CLARIN / CLARIN ERIC.

Service levels

Our language resources have three different levels of support.

A: The resource is under active development. The Language Bank of Finland fixes any issues as soon as possible.
B: The resource is developed only upon user request. The Language Bank of Finland aims to fix issues concerning the resource, but external contributions may be required.
C: The resource is available ”as is”. The Language Bank of Finland does not fix nor develop the resource.

If you are looking for a tool not listed here, please have a look in META-SHARE or CLARIN Virtual Language Observatory (VLO).

Please find an overview of all our resources sorted by resource families on Resource families Fin-Clarin.

StartNameDescriptionInstructionsInstallInfoAdministratorService level
Korp at the Language Bank of FinlandKorpA web-based concordance tool that can be used for corpus queries based on morphosyntactic analysis and various other features.Instructionsicon-question-circleThe Language Bank of FinlandA
DownloadDownload serviceDownload certain corpora.icon-question-circleThe Language Bank of FinlandA
META-SHAREMETA-SHAREMetadata repository of all the language resources at the Language Bank of Finland.icon-question-circleThe Language Bank of FinlandA
MyllyMyllyVersatile data analysis platform with interactive visualizations and workflows.Instructionsicon-question-circleThe Language Bank of FinlandC
SanatSanatA platform for publishing lexica and word lists.icon-question-circleThe Language Bank of FinlandB
FinTagFinnish TagtoolsA part-of-speech and morphology tagger and a named entity recogniser for Finnish.Installicon-question-circleThe Language Bank of FinlandA
DemoDemo tools at the Language Bank of FinlandDemos of tools that are in development at the Language Bank of Finland: FinTag and FiNER, FinSentiment, FinnWordNet, HFST POS taggers, HFST morphological analyzers, Lemmamatch, etc. (In Finnish)The Language Bank of FinlandC
WebAnnoText annotation tool.User GuideStandalone
installation
icon-question-circleThe Language Bank of FinlandA
SignbankLexical database of Finnish Sign Language.icon-question-circleUniversity of JyväskyläA
OPUSAn interface for open source parallel corpora.icon-question-circleUniversity of Helsinki
The Helsinki Term Bank for the Arts and SciencesA multidisciplinary project that aims to gather a permanent terminological database for all fields of research in Finland.icon-question-circleUniversity of HelsinkiA
LääketutkaLääketutka, "the Medicine Radar", provides analytics about health, medicine and symptom-related discussions in the Suomi24 discussion forum.icon-question-circleUniversity of HelsinkiC
ANEE lex­ical portals of Akka­dianANEE lex­ical portals of Akka­dianThe ANEE Lexical Portal is a graphic semantic dictionary represented as a network. You can use the portal for exploring the meanings of singular Akkadian words in a visual way.icon-question-circleUniversity of Helsinki
Proto-Indo-European LexiconA generative etymological dictionary of Indo-European languagesicon-question-circleUniversity of Helsinki
WancaWancaWanca is a portal for websites in Uralic languages. icon-question-circleUniversity of HelsinkiA
Turku Neural Parser
Pipeline
Turku Neural Parser PipelineA tool developed by the Turku NLP group for parsing Finnish text.Install (GitHub)icon-question-circleUniversity of Turku
word2vecSemantic similarity of words (word2vec)A tool developed by the Turku NLP group for analyzing the semantic similarity of words.Documentationicon-question-circleUniversity of Turku
Finnish Internet Parsebank:
SETS
Syntax-based search (SETS)
from the Finnish Internet Parsebank
Syntax-based search (SETS)
from parts of the Finnish Internet Parsebank.
DocumentationUniversity of Turku
FinBERTFinBERTBERT model trained from scratch on Finnish.Install (GitHub)icon-question-circleUniversity of Turku
TexthammerTexthammerA search and analysis toolkit for parallel corpora provided by the University of Tampere.Documentation (PDF)icon-question-circleUniversity of Tampere
nimiarkisto.fiNimiarkistoNimiarkisto.fi is a portal with the most important digital resources of names and named entities collected from and archived in Finland.icon-question-circleInstitute for the Languages of Finland
Terminology ForumTerminology ForumTerminology Forum – A collection of links to special field glossaries, University of Vaasaicon-question-circleUniversity of Vaasa
SparvSparvA multilingual toolkit provided by the Swedish Språkbanken for parsing and annotating text in various languages.icon-question-circleSWE-CLARIN (Språkbanken)
TranskribusTranskribusA toolkit for transcribing and managing historical documents (e.g., images and scanned text).Instructions (PDF)Installicon-question-circleUniversity of Innsbruck
Aalto-ASRAalto University Automatic
Speech Recognition System
An automatic speech recognition toolkit that can be used in the CSC computing environment. Some features are available via the Mylly service.InstructionsInstall (GitHub)icon-question-circleAalto University
ELANELANELAN is a program for transcribing and annotating audio and video files. It can also be used for searching locally stored collections of annotated material.InstructionsInstallicon-question-circleThe Language Archive
PraatPraatPraat is a comprehensive toolkit for annotating, processing, analyzing and visualizing speech. Praat includes a scripting language.InstructionsInstallicon-question-circleUniversity of Amsterdam
CLARIN Federated Content SearchRun a centralized query from all the resources provided by CLARIN centers.icon-question-circleCLARIN ERIC
GephiGephiA program for network analysis and visualization.Install (GitHub)
LAT at the Language Bank of FinlandLAT (Language Archive Tools)A toolkit for browsing and querying annotated speech and video corpora.
NB: The LAT service will be discontinued as of 30 November 2020, see details.
InstructionsThe Language Bank of FinlandC
digi.kansalliskirjasto.fiDigital collectionsA search and download service for digital collections from the National Library of Finland. In addition to newspapers and magazines, the collections include, e.g., books, pictures and maps. Note that a large proportion of the newspapers and magazines can also be used via the Korp service in the Language Bank (see KLK).icon-question-circle
textreuse.sls.fiText reuse in the Swedish-language press, 1645-1918A search engine for searching and analyzing clusters of text reuse in the Swedish-language press from 1645 to 1918.icon-question-circle
FinnONTOFinnONTOFinnish and international ontologies, vocabularies and thesauri needed for publishing content cost-efficiently on the Semantic Web.icon-question-circle
Dictionaries of NeahttadigisánitDictionaries of NeahttadigisánitA collection of free digital dictionaries for small languages.icon-question-circle
Dictionary of Contemporary FinnishDictionary of Contemporary FinnishDictionary of standard Finnish made by the Institute for the Languages of Finland.icon-question-circle
Dictionary of Finnish dialectsDictionary of Finnish dialectsThe dictionary presents the entire vocabulary of all the Finnish dialects.icon-question-circleInstitute for the Languages of Finland
Dictionary of Old Literary FinnishDictionary of Old Literary FinnishThe dictionary presents all the words of the Finnish literary sources from 1543-1810.icon-question-circleInstitute for the Languages of Finland
Etymological Database of the Sami LanguagesEtymological Database of the Sami LanguagesEtymological database of the Saami languagesicon-question-circleInstitute for the Languages of Finland
Etymological Reference DatabaseEtymological Reference DatabaseReferences to texts on the etymology of Finnish words.icon-question-circleInstitute for the Languages of Finland
Names of Countries in Seven LanguagesNames of Countries in Seven LanguagesThe website contains the names of the independent states of the world and their geographically separate regions in 7 languages.icon-question-circleInstitute for the Languages of Finland
Frequencies of Early Modern Finnish WordsFrequencies of Early Modern Finnish WordsThe list includes the word forms included in the corpus of old literary Finnish of the Institute for the Languages of Finland together with their frequency information.icon-question-circleInstitute for the Languages of Finland
Frequencies of Old Literary Finnish WordsFrequencies of Old Literary Finnish WordsList of frequencies of old literary Finnish words and information about their frequency.icon-question-circleInstitute for the Languages of Finland
Frequency list of Written Finnish Word FormsFrequency list of Written Finnish Word FormsA ranked frequency list of Finnish word forms as they appear in the Finnish Parole text corpus of 17 million written tokens.icon-question-circleInstitute for the Languages of Finland
Headword List of the Karelian DictionaryHeadword List of the Karelian DictionaryLists compiled of the headwords of the Karelian dictionary released by the Institute for the Languages of Finland and Finno-Ugrian Society in 1968-2005.icon-question-circleInstitute for the Languages of Finland
Modern Finnish Word ListModern Finnish Word ListThe entries of the word list indicate the lemma and inflection type for basic words.icon-question-circleInstitute for the Languages of Finland

Vastaa