AlexWan0/infini-gram

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AlexWan0/infini-gram)

AlexWan0 / infini-gram

An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)

☆33

Alternatives and similar repositories for infini-gram

Users that are interested in infini-gram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liujch1998 / infini-gram
View on GitHub
☆113Jan 24, 2026Updated 6 months ago
gmftbyGMFTBY / Rep-Dropout
View on GitHub
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆41Oct 17, 2023Updated 2 years ago
samblouir / birdie
View on GitHub
☆15Jun 8, 2026Updated last month
GSYfate / knnlm-limits
View on GitHub
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Apr 30, 2025Updated last year
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆20Mar 20, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
sail-sg / SkyLadder
View on GitHub
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆43Dec 29, 2025Updated 7 months ago
ethansmith2000 / TransformerExperiments
View on GitHub
☆19Dec 4, 2025Updated 7 months ago
EleutherAI / tokengrams
View on GitHub
Efficiently computing & storing token n-grams from large corpora
☆28Jun 15, 2026Updated last month
ghrua / NgramRes
View on GitHub
☆23Nov 6, 2022Updated 3 years ago
edwardmilsom / function-space-learning-rates-paper
View on GitHub
Code for the paper "Function-Space Learning Rates"
☆23Jun 3, 2025Updated last year
stephenkyang / mean-reversion-pairs-trading
View on GitHub
manipulating cointegrated pairs to achieve a market-neutral strategy that outperforms indices
☆11Jan 12, 2021Updated 5 years ago
JAEarly / MILLI
View on GitHub
Code for the paper "Model Agnostic Interpretability for Multiple Instance Learning".
☆13Jan 28, 2022Updated 4 years ago
foreai-co / fore
View on GitHub
The fore client package
☆13Jul 16, 2024Updated 2 years ago
cloudygoose / blindspot_nlg
View on GitHub
☆21Jan 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
harubaru / convogpt
View on GitHub
Conversational Language model toolkit for training against human preferences.
☆42Apr 9, 2024Updated 2 years ago
frankxu2004 / knnlm-why
View on GitHub
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆59Jan 12, 2023Updated 3 years ago
myrho / bright-db
View on GitHub
Offline-first, decentralized graph database of collaborative Web apps
☆15May 12, 2017Updated 9 years ago
beaver-lodge / manx
View on GitHub
MLIR backend for Nx
☆14May 24, 2024Updated 2 years ago
idiap / wmil-sgd
View on GitHub
Weighted multiple-instance learning algorithm based on stochastic gradient descent
☆12Feb 22, 2019Updated 7 years ago
alipala / ai_fantasy_rpg
View on GitHub
This is a text based fantasy AI game
☆14Dec 15, 2024Updated last year
csinva / cookiecutter-ml-research
View on GitHub
A logical, reasonably standardized, but flexible project structure for conducting ml research 🍪
☆19Apr 9, 2026Updated 3 months ago
thakur-nandan / income
View on GitHub
INCOME: An Easy Repository for Training and Evaluation of Index Compression Methods in Dense Retrieval. Includes BPR and JPQ.
☆24Sep 24, 2023Updated 2 years ago
incjung / cl-swagger-codegen
View on GitHub
lisp code generator for swagger
☆10Jan 12, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
joeljang / ELM
View on GitHub
[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning
☆99Apr 26, 2023Updated 3 years ago
hjian42 / CommunityLM
View on GitHub
[COLING 2022]: CommunityLM: Probing Partisan Worldviews from Language Models
☆14Jan 31, 2023Updated 3 years ago
swj0419 / kNN_prompt
View on GitHub
TBC
☆28Nov 2, 2022Updated 3 years ago
simsal0r / mixture-of-decision-trees
View on GitHub
Mixture of Decision Trees for Interpretable Machine Learning
☆11Sep 2, 2021Updated 4 years ago
ORNL / curifactory
View on GitHub
An experiment workflow and organization tool.
☆18Jul 10, 2026Updated 2 weeks ago
gandalfnicolas / SADCAT
View on GitHub
☆13Jun 18, 2026Updated last month
Hanjun-Dai / r-hsmm
View on GitHub
Implementation of Recurrent Hidden Semi-Markov Model http://www.cc.gatech.edu/~lsong/papers/DaiDaiZhaLietal17.pdf
☆13Mar 31, 2019Updated 7 years ago
yikee / FLIP
View on GitHub
Small Reward Models via Backward Inference
☆21May 25, 2026Updated 2 months ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
orionw / promptriever
View on GitHub
The first dense retrieval model that can be prompted like an LM
☆93May 8, 2025Updated last year
angrysky56 / llada_gui
View on GitHub
GUI for LLaDA Diffusion LLM with Quantization for low end GPU and CPU options.
☆25Mar 7, 2025Updated last year
IBPA / LOVE
View on GitHub
Learning Ontologies Via Embeddings
☆12Jul 6, 2023Updated 3 years ago
dallascard / DWAC
View on GitHub
Deep Weighted Averaging Classifiers
☆22Feb 4, 2019Updated 7 years ago
cocoxu / SemEval-PIT2015
View on GitHub
data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015
☆43Nov 10, 2020Updated 5 years ago
swairshah / Intensify
View on GitHub
coloring terminal text with intensities (used for plotting probability, entropy with tokens)
☆12Oct 11, 2024Updated last year
mdering / CoreMLZoo
View on GitHub
A few models converted from caffe to CoreMLs format.
☆15Jun 6, 2017Updated 9 years ago