Mylly category: Korp data | fi

Korp data in Mylly

TODO korp.csc.fi search interface

TODO JSON-form data either obtained through API in Mylly or saved in Korp and uploaded to Mylly (or some otherwise)

KWIC data

TODO Key Word in Context

TODO sentence-like fragments that matched a query; tokens have annotations and metadata as key-value pairs (positional, structural, all by name)

KWIC tools

TODO turn JSON-form concordance into data and meta relations that can be joined on the sentence number

TODO extract 2-grams, 3-grams, dependency triples of attribute combinations from sentences, also joinable to meta (if have sentence number?) (dependencies require the relevant annotation)

Other data

TODO Korp also provides other kinds of data sets; Mylly needs to become conversant

See also

Hae Kielipankki-portaalista:
Tommi Kurki
Kuukauden tutkija: Tommi Kurki

 

Yhteystiedot

Kielipankin tekninen ylläpito:
kielipankki (ät) csc.fi
p. 09 4572001

Aineistoihin ja muuhun sisältöön liittyvät asiat:
fin-clarin (ät) helsinki.fi
p. 029 4144036 / 029 4129317