☆101Jan 24, 2026Updated last month
Alternatives and similar repositories for infini-gram
Users that are interested in infini-gram are comparing it to the libraries listed below
Sorting:
- ☆96Dec 19, 2025Updated 2 months ago
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated last year
- ☆22Apr 2, 2025Updated 11 months ago
- A language server implementation for pysen☆10Nov 14, 2021Updated 4 years ago
- Chunk Dedupe Estimation☆20Nov 5, 2024Updated last year
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- ☆18Feb 19, 2024Updated 2 years ago
- Awesome List of Sources of Japanese Censored Words☆19Sep 11, 2022Updated 3 years ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated 10 months ago
- Code for "Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem" (NAACL 2022)☆111May 14, 2025Updated 9 months ago
- Understanding how features learned by neural networks evolve throughout training☆41Oct 24, 2024Updated last year
- ☯️ AllenNLP training configurations for promising models on Named Entity Recognition. (BiLSTM-CRF, BiLSTM-CNN-CRF, BERT, BERT-CRF)☆15Nov 26, 2020Updated 5 years ago
- Experiments for efforts to train a new and improved t5☆76Apr 15, 2024Updated last year
- Learning to Model Editing Processes☆26Aug 3, 2025Updated 7 months ago
- Extracts plain text, language identification and more metadata from WARC records☆23Oct 1, 2025Updated 5 months ago
- Collection of academic works in natural language processing, computational linguistics, and computational cognitive science that study th…☆22Mar 20, 2024Updated last year
- Code for "FactKB: Generalizable Factuality Evaluation using Language Models Enhanced with Factual Knowledge". EMNLP 2023.☆20Dec 25, 2023Updated 2 years ago
- Hugging Face and Pyserini interoperability☆19May 18, 2023Updated 2 years ago
- 🖋 Resource and Tool for Writing System Identification (Unicode 17.0) -- LREC 2024☆21Feb 17, 2026Updated 2 weeks ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆184May 13, 2022Updated 3 years ago
- ☆22Sep 2, 2025Updated 6 months ago
- sequence tagging for NER for ULMFiT☆20Nov 4, 2020Updated 5 years ago
- 🚀 A demonstration of hyperparameter optimization using Optuna for models implemented with AllenNLP.☆16Nov 28, 2020Updated 5 years ago
- Reinforcement learning (RL) is an effective method to find reasoning pathways in incomplete knowledge graphs (KGs). To overcome the chall…☆24Oct 13, 2024Updated last year
- ☆23Jan 7, 2025Updated last year
- ☆23Nov 6, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆28Apr 17, 2024Updated last year
- ☆19May 23, 2024Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆22Oct 21, 2022Updated 3 years ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆64Jul 6, 2025Updated 8 months ago
- Pipeline parallelism for the minimalist☆41Aug 6, 2025Updated 7 months ago
- ❄️ Nix-based dotfiles, claude code configs, and system settings for macOS & NixOS, which makes everyday software development fun!☆32Updated this week
- Ongoing research training transformer models at scale☆43Updated this week
- The "GPT-API-Accelerate" project provides a set of Python classes for accelerating the process of generating responses to prompts using t…☆23Oct 12, 2024Updated last year
- allennlp-light is a port of AllenNLP's core modules and nn portions into a standalone package with minimum dependencies☆56Oct 12, 2022Updated 3 years ago
- Python library for building and running distributed data pipelines using Ray☆54Dec 16, 2025Updated 2 months ago
- Utilities and boilerplate code to use wandb with allennlp☆21May 22, 2023Updated 2 years ago
- Code for SaGe subword tokenizer (EACL 2023)☆27Nov 30, 2024Updated last year
- Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.☆26Jun 20, 2023Updated 2 years ago