An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Jun 19, 2024Updated last year
Alternatives and similar repositories for infini-gram
Users that are interested in infini-gram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆101Jan 24, 2026Updated 2 months ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated 11 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 2 months ago
- ☆13Dec 15, 2025Updated 3 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆42Oct 24, 2023Updated 2 years ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆23Apr 30, 2025Updated 10 months ago
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- ☆19Dec 4, 2025Updated 3 months ago
- ☆23Nov 6, 2022Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- XAI based human-in-the-loop framework for automatic rule-learning.☆49Jul 7, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 9 months ago
- Resa: Transparent Reasoning Models via SAEs☆48Sep 23, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Meta-learning inductive biases in the form of useful conserved quantities.☆40Nov 19, 2022Updated 3 years ago
- Code for the paper "Model Agnostic Interpretability for Multiple Instance Learning".☆13Jan 28, 2022Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- Conversational Language model toolkit for training against human preferences.☆41Apr 9, 2024Updated last year
- ☆21Jan 15, 2024Updated 2 years ago
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 9 months ago
- An experiment workflow and organization tool.☆18Mar 11, 2026Updated 2 weeks ago
- Library for the Zotero API☆15Jan 15, 2024Updated 2 years ago
- MLIR backend for Nx☆14May 24, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Weighted multiple-instance learning algorithm based on stochastic gradient descent☆12Feb 22, 2019Updated 7 years ago
- Web app using Pyodide to demo different types of Scikit-learn classifiers☆12Apr 16, 2022Updated 3 years ago
- A LNCS template for typst☆15Jan 26, 2026Updated 2 months ago
- Finding semantically meaningful and accurate prompts.☆47Oct 30, 2023Updated 2 years ago
- Offline-first, decentralized graph database of collaborative Web apps☆15May 12, 2017Updated 8 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated 10 months ago
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 8 months ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆12Jan 12, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- TBC☆28Nov 2, 2022Updated 3 years ago
- ☆12Mar 20, 2026Updated last week
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 3 years ago
- A multiprocessing-friendly Python mock object☆10Aug 31, 2017Updated 8 years ago
- coloring terminal text with intensities (used for plotting probability, entropy with tokens)☆12Oct 11, 2024Updated last year
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆43Nov 10, 2025Updated 4 months ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago