Mylly category: Korp data | fi

Korp data in Mylly

TODO search interface

TODO JSON-form data either obtained through API in Mylly or saved in Korp and uploaded to Mylly (or some otherwise)

KWIC data

TODO Key Word in Context

TODO sentence-like fragments that matched a query; tokens have annotations and metadata as key-value pairs (positional, structural, all by name)

KWIC tools

TODO turn JSON-form concordance into data and meta relations that can be joined on the sentence number

TODO extract 2-grams, 3-grams, dependency triples of attribute combinations from sentences, also joinable to meta (if have sentence number?) (dependencies require the relevant annotation)

Other data

TODO Korp also provides other kinds of data sets; Mylly needs to become conversant

