Unsupervised text tokenizer focused on computational efficiency
☆980Mar 29, 2024Updated 2 years ago
Alternatives and similar repositories for YouTokenToMe
Users that are interested in YouTokenToMe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fast BPE☆678Jun 18, 2024Updated 2 years ago
- Unsupervised text tokenizer for Neural Network-based text generation.☆11,925Updated this week
- A list of pretrained Transformer models for the Russian language.☆177Feb 3, 2020Updated 6 years ago
- Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)☆1,220Oct 1, 2024Updated last year
- Unsupervised Word Segmentation for Neural Machine Translation and Text Generation☆2,272Aug 7, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- PyTorch original implementation of Cross-lingual Language Model Pretraining.