Mylly category: Korp data | fi
Korp data in Mylly
TODO korp.csc.fi search interface
TODO JSON-form data either obtained through API in Mylly or saved in Korp and uploaded to Mylly (or some otherwise)
KWIC data
TODO Key Word in Context
TODO sentence-like fragments that matched a query; tokens have annotations and metadata as key-value pairs (positional, structural, all by name)
KWIC tools
TODO turn JSON-form concordance into data and meta relations that can be joined on the sentence number
TODO extract 2-grams, 3-grams, dependency triples of attribute combinations from sentences, also joinable to meta (if have sentence number?) (dependencies require the relevant annotation)
Other data
TODO Korp also provides other kinds of data sets; Mylly needs to become conversant