Back to main demo page

finnish-tokenize demo

Split running text into tokens.
Show help

Help

Entering input

You have a choice between three options: enter text in the text box, choose a demo text, or upload a file. A variety of file formats are supported: plain utf-8 text (.txt), and unless the formatting is especially convoluted, .pdf, .doc, .docx, .csv, .epub, .html, .odt, .rtf and .xls files.

Understanding output

The output is presented as one row per token, with empty rows representing sentence boundaries. Output may be downloaded as text or a spreadsheet.

Or

Page generated in 0.01 seconds