csebuetnlp / normalizer

This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batch filtering, and new datasets for Bengali-English machine translation". It is intended to be used for normalizing / cleaning Bengali and English text.
35Updated 9 months ago

Alternatives and similar repositories for normalizer:

Users that are interested in normalizer are comparing it to the libraries listed below