proycon / python-uctoView on GitHub
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is regular-expression based, extensible, and advanced tokeniser writt…
31Feb 2, 2026Updated 3 weeks ago

Alternatives and similar repositories for python-ucto

Users that are interested in python-ucto are comparing it to the libraries listed below

Sorting:

Are these results useful?