clipperhouse / uax29

A tokenizer based on Unicode text segmentation (UAX #29), for Go. Split words, sentences and graphemes.
54Updated 4 months ago

Alternatives and similar repositories for uax29:

Users that are interested in uax29 are comparing it to the libraries listed below