An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Jun 19, 2024Updated last year
Alternatives and similar repositories for infini-gram
Users that are interested in infini-gram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆106Jan 24, 2026Updated 4 months ago
- Code for the paper "Closing the Curious Case of Neural Text Degeneration"☆12Apr 9, 2025Updated last year
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated 4 months ago
- ☆14Dec 15, 2025Updated 5 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆42Oct 24, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- ☆19Dec 4, 2025Updated 5 months ago
- ☆23Nov 6, 2022Updated 3 years ago
- Efficiently computing & storing token n-grams from large corpora☆27Oct 6, 2024Updated last year
- Resa: Transparent Reasoning Models via SAEs☆49Sep 23, 2025Updated 8 months ago
- The fore client package☆13Jul 16, 2024Updated last year
- Code for the paper "Model Agnostic Interpretability for Multiple Instance Learning".☆13Jan 28, 2022Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆59Jan 12, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Conversational Language model toolkit for training against human preferences.☆41Apr 9, 2024Updated 2 years ago
- ☆92Aug 18, 2024Updated last year
- ☆21Jan 15, 2024Updated 2 years ago
- This is a text based fantasy AI game☆13Dec 15, 2024Updated last year
- Code for NeurIPS 2024 Paper - Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass☆21Aug 22, 2024Updated last year
- An experiment workflow and organization tool.☆18Mar 11, 2026Updated 2 months ago
- Official Repository for Task-Circuit Quantization☆27Jun 1, 2025Updated 11 months ago
- MLIR backend for Nx☆14May 24, 2024Updated 2 years ago
- Weighted multiple-instance learning algorithm based on stochastic gradient descent☆12Feb 22, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Web app using Pyodide to demo different types of Scikit-learn classifiers☆12Apr 16, 2022Updated 4 years ago
- Offline-first, decentralized graph database of collaborative Web apps☆15May 12, 2017Updated 9 years ago
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.☆24Sep 24, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆99Apr 26, 2023Updated 3 years ago
- ☆138May 29, 2025Updated 11 months ago
- TBC☆28Nov 2, 2022Updated 3 years ago
- manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices☆10Jan 12, 2021Updated 5 years ago
- Implementation of Recurrent Hidden Semi-Markov Model http://www.cc.gatech.edu/~lsong/papers/DaiDaiZhaLietal17.pdf☆12Mar 31, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆12Mar 18, 2023Updated 3 years ago
- The first dense retrieval model that can be prompted like an LM☆92May 8, 2025Updated last year
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆43Nov 10, 2020Updated 5 years ago
- A few models converted from caffe to CoreMLs format.☆15Jun 6, 2017Updated 8 years ago
- A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪☆19Apr 9, 2026Updated last month
- Weighted multiple-instance learning algorithm☆18Oct 9, 2018Updated 7 years ago
- Scratchpad/Chain-of-Thought Prompts☆12Jun 6, 2022Updated 3 years ago