Mink

Suomeksi          

At kielipankki.fi/future/mink, a browser-based tool called Mink is available, where users logged in via Haka can upload their own text materials for processing. The file formats supported by Mink include plain text (UTF-8), XML (where the analysis pipeline preserves the structures), Microsoft Word (.docx), Open Document (.odt), PDF, and CoNLL-U.

You can perform advanced searches on your own text corpora within the Korp environment accessible through the Mink service. If necessary, texts can first be automatically parsed and annotated in Mink, which improves the search capabilities in Korp. For now, the Mink platform supports lemmatization (i.e., the analysis of the base forms of the words) as well as morphological and dependency-based syntactic analysis for Finnish, Swedish, and English text, and the recognition of named phrases in English text. In addition to using your corpus via Korp, you can also save the analyzed texts to your own computer.

With Mink, users can prepare, test, and explore their own Korp corpus. For now, only the user themselves can access the materials they have transferred to the Korp environment within Mink. At a later stage, the plan is to make it possible to share the data stored in Mink with the members of the user’s own research group, for example. Separate arrangements can also be made to make the finalized corpus available to other researchers through the public Korp service of the Language Bank. 

For now, more detailed instructions on how to use Mink can be found on the Swedish Språkbanken website. Please note that the Mink environment developed by Språkbanken has been slightly adapted for users of the Language Bank of Finland, so not all features work in exactly the same way in both Mink services.

The Mink platform is currently being further developed, and the Language Bank welcomes feedback on its functionality; see contact information.

Access Mink

Mink (Språkbanken Text)

 


This resource group page has a Persistent Identifier: http://urn.fi/urn:nbn:fi:lb-2026042421

Last modified on 2026-05-25