Transkribus is a comprehensive platform for the digitisation, AI-powered text recognition, transcription and searching of historical documents.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021110305
The tool is developed by the Turku NLP group for analyzing the semantic similarity of words.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021110304
WebAnno is a multi-user tool supporting different roles such as annotator, curator, and project manager. The progress and quality of annotation projects can be monitored and measuered in terms of inter-annotator agreement. Multiple annotation projects can be conducted in parallel.
The Language Bank of Finland’s instance of WebAnno
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021110303
Mylly is a versatile data analysis platform with interactive visualizations and workflows. It can be used to build workflows with a variety of tools, including morphosyntactic parsing, character set conversion and speech recognition.
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021110302
Sparv, Språkbanken’s text analysis tool, is a multilingual toolkit provided by the Swedish Språkbanken for parsing and annotating text in various languages.
Latest Sparv release on GitHub
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021110301
A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more than 50 languages.
The pipeline is installed in CSC’s computing environment as a Singularity container for the languages Finnish, Swedish and English.
Latest version: | |
Turku Neural Parser Pipeline Metadata and license | Access to Puhti |
Look for all versions of this tool in META-SHARE |
On Puhti you can see a list of all installed versions and languages using:
module use /appl/soft/ai/singularity/modulefiles/
module spider turku-neural-parser
For more information on this tool have a look at the following links:
GitHub source
Turku-neural-parser-pipeline manual
TurkuNLP DockerHub
CSC’s Singularity installation of the Turku Neural Parser
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021101102
This software package provides finnish-postag, a part-of-speech and morphology tagger for Finnish, and finnish-nertag, a named entity recogniser for Finnish.
This software is also installed in CSC’s computing environment (module load finnish-tagtools).
Both tools take running text from standard input and produce tabular output (one token per line) to standard output. See –help messages for more details.
An installer is provided in the form of a Makefile. More information can be found in the README file in the download folder.
Latest version: | |
Finnish Tagtools 1.5 Metadata and license |
Download the resource |
Look for all versions of this tool in META-SHARE |
This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2021101101