riotu-lab / aranizer

Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling
15Updated 3 months ago

Related projects

Alternatives and complementary repositories for aranizer