Efficiently computing & storing token n-grams from large corpora
☆27Oct 6, 2024Updated last year
Alternatives and similar repositories for tokengrams
Users that are interested in tokengrams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pile Deduplication Code☆18May 15, 2023Updated 2 years ago
- ☆18Mar 31, 2026Updated last week
- ☆17Aug 30, 2025Updated 7 months ago
- ☆14Jul 7, 2024Updated last year
- Engine for collecting, uploading, and downloading model activations☆28Apr 2, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Mapping out the "memory" of neural nets with data attribution☆52Updated this week
- Scalable Kubernetes-native implementation of the Open Data Fabric protocol for global collaborative data processing☆23Mar 28, 2026Updated 2 weeks ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 7 months ago
- A library for mechanistic anomaly detection☆22Jan 9, 2025Updated last year
- ☆44Nov 17, 2024Updated last year
- Minimum Description Length probing for neural network representations☆20Jan 28, 2025Updated last year
- Explorations into the proposed SDFT, Self-Distillation Enables Continual Learning, from Shenfeld et al. of MIT☆30Feb 6, 2026Updated 2 months ago
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Sep 13, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆25Feb 27, 2023Updated 3 years ago
- Fork of kingoflolz/mesh-transformer-jax with memory usage optimizations and support for GPT-Neo, GPT-NeoX, BLOOM, OPT and fairseq dense L…☆22Nov 14, 2022Updated 3 years ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated last year