AlexWan0 / infini-gramView external linksLinks
An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Jun 19, 2024Updated last year
Alternatives and similar repositories for infini-gram
Users that are interested in infini-gram are comparing it to the libraries listed below
Sorting:
- ☆13Dec 15, 2025Updated last month
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Dec 29, 2025Updated last month
- ☆97Jan 24, 2026Updated 2 weeks ago
- Schedule free optimiser implemented in JAX using Optimistix☆15May 29, 2024Updated last year
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆42Oct 24, 2023Updated 2 years ago
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- ☆23Jan 27, 2025Updated last year
- TorrentPier. Alternative compiled announcer (Ocelot)☆22Jul 12, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- Official Repository for Task-Circuit Quantization☆24Jun 1, 2025Updated 8 months ago
- ☆15Apr 26, 2022Updated 3 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆30Nov 12, 2024Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNS☆28Oct 21, 2025Updated 3 months ago
- ☆91Aug 18, 2024Updated last year
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Official repository of Sparse ISO-FLOP Transformations for Maximizing Training Efficiency☆25Jul 31, 2024Updated last year
- Resa: Transparent Reasoning Models via SAEs☆47Sep 23, 2025Updated 4 months ago
- Efficiently computing & storing token n-grams from large corpora☆26Oct 6, 2024Updated last year
- Code for Principal Masked Autoencoders☆30Feb 4, 2026Updated last week
- Raku library for the Mustache template format☆23Oct 11, 2022Updated 3 years ago
- A fast implementation of T5/UL2 in PyTorch using Flash Attention☆113Oct 30, 2025Updated 3 months ago
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65May 14, 2023Updated 2 years ago
- How to use the Flax Linen API to build a convolutional neural network model and train it for image classification (using TensorFlow Datas…☆25Aug 16, 2023Updated 2 years ago
- Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More☆34May 17, 2025Updated 8 months ago
- SemClinBr - a multi-institutional and multi-specialty semantically annotated corpus for Portuguese clinical NLP tasks☆30Mar 12, 2024Updated last year
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated last year
- Official Implementation of APB (ACL 2025 main Oral) and Spava.☆32Jan 30, 2026Updated 2 weeks ago
- This repository contains the code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron …☆33Jun 14, 2023Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Mar 2, 2023Updated 2 years ago
- A framework for adversarial attacks against token classification models☆33Nov 6, 2021Updated 4 years ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆39Oct 17, 2023Updated 2 years ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"☆251Jan 31, 2025Updated last year
- ☆42Jun 15, 2023Updated 2 years ago
- ☆41Jun 19, 2024Updated last year
- ☆16Jul 23, 2023Updated 2 years ago
- Redis distributed lock implementation for Python based on Pub/Sub messaging☆11Nov 15, 2025Updated 2 months ago