EleutherAI / tokengrams

Efficiently computing & storing token n-grams from large corpora
15Updated last month

Related projects

Alternatives and complementary repositories for tokengrams