Code for NeurIPS 2024 Paper - Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
☆21Aug 22, 2024Updated last year
Alternatives and similar repositories for SuperposedDecoding
Users that are interested in SuperposedDecoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆21Sep 2, 2025Updated 9 months ago
- ArxivDaily☆13Jun 24, 2026Updated last week
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆29Nov 11, 2025Updated 7 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- ☆10Oct 28, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code & Data for our Paper "RobustGEC: Robust Grammatical Error Correction Against Subtle Context Perturbation" (EMNLP 2023)☆17Jan 23, 2024Updated 2 years ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆36Feb 26, 2026Updated 4 months ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs☆13Feb 13, 2024Updated 2 years ago
- A recipe for constituency parsing, disfluency tagging and obtaining the fluent transcripts of English Fisher dataset☆13May 2, 2021Updated 5 years ago
- ☆14Oct 17, 2024Updated last year
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated 2 years ago
- Test equality between a black-box LLM API and a reference distribution☆18Oct 29, 2024Updated last year
- Source code for the frontend of chesshq.com☆11Apr 29, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- code for paper "Discerning and Resolving Knowledge Conflicts through Adaptive Decoding with Contextual Information-Entropy Constraint"☆11Sep 29, 2024Updated last year
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 8 months ago
- ☆13Oct 10, 2023Updated 2 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 8 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- ☆13Oct 14, 2020Updated 5 years ago
- A very limited implementation of arXiv:1904.00759☆13Dec 2, 2019Updated 6 years ago
- Repoformer: Selective Retrieval for Repository-Level Code Completion (ICML 2024)☆67Jun 17, 2025Updated last year
- experiment☆12Jan 1, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2022] Official implementation of "Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems".☆12Jul 19, 2022Updated 3 years ago
- This project allows you to plug in a GitHub repository URL, generate vectors for a LLM and use ChatGPT models to interact. The main frame…☆19Jun 4, 2023Updated 3 years ago
- Official code for the paper "Attention as a Hypernetwork"☆57Feb 24, 2026Updated 4 months ago
- Well documented examples of running distributed training jobs on Modal☆29Jun 24, 2026Updated last week
- ☆19Oct 12, 2024Updated last year
- Private Adaptive Optimization with Side Information (ICML '22)☆16Jun 23, 2022Updated 4 years ago
- Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters☆10Mar 5, 2024Updated 2 years ago
- ☆39Apr 17, 2024Updated 2 years ago
- [ICML 2023] Official repository of paper: Dividing and Conquering a BlackBox to a Mixture of Interpretable Models: Route, Interpret, Repe…☆25Aug 2, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🚧 SimpleXMQ - JavaScript SMP protocol client and agent 🏗☆13Jan 4, 2022Updated 4 years ago
- Nix-friendly fork of: Optimized Stable Diffusion modified to run on lower GPU VRAM☆10Sep 11, 2022Updated 3 years ago
- ☆20Mar 19, 2023Updated 3 years ago
- LLM verified with Monte Carlo Tree Search☆10Nov 15, 2023Updated 2 years ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- Fast and Robust Early-Exiting Framework for Autoregressive Language Models with Synchronized Parallel Decoding (EMNLP 2023 Long)☆67Sep 28, 2024Updated last year
- Python interface for SPAMS (SPArse Modeling Software)☆21Oct 24, 2024Updated last year