EleutherAI / tokengrams
Efficiently computing & storing token n-grams from large corpora
☆19Updated 5 months ago
Alternatives and similar repositories for tokengrams:
Users that are interested in tokengrams are comparing it to the libraries listed below
- Training hybrid models for dummies.☆20Updated last month
- Minimum Description Length probing for neural network representations☆19Updated last month
- URL downloader supporting checkpointing and continuous checksumming.☆19Updated last year
- Submission to the inverse scaling prize☆23Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆31Updated last year