Code for NeurIPS 2024 Paper - Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass
☆21Aug 22, 2024Updated last year
Alternatives and similar repositories for SuperposedDecoding
Users that are interested in SuperposedDecoding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Experiments Notebook of "Understanding the Skill Gap in Recurrent Language Models: The Role of the Gather-and-Aggregate Mechanism"☆15Apr 30, 2025Updated last year
- ☆12Sep 7, 2024Updated last year
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆28Nov 11, 2025Updated 5 months ago
- Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"☆24Apr 30, 2025Updated last year
- ☆10Oct 28, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official codebase for permutation self-consistency.☆19Feb 11, 2024Updated 2 years ago
- ☆23Nov 6, 2022Updated 3 years ago
- LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation☆33Feb 26, 2026Updated 2 months ago
- [NeurIPS 2024 D&B] Evaluating Copyright Takedown Methods for Language Models☆17Jul 17, 2024Updated last year
- Conversational Language model toolkit for training against human preferences.☆41Apr 9, 2024Updated 2 years ago
- ☆21Jan 15, 2024Updated 2 years ago
- ☆12Apr 25, 2025Updated last year
- ☆15Jul 24, 2022Updated 3 years ago
- Text generation using language models with multiple exit heads☆16Sep 18, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Oct 17, 2024Updated last year
- An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)☆33Jun 19, 2024Updated last year
- This repository is the official implementation of our EMNLP 2022 paper ELMER: A Non-Autoregressive Pre-trained Language Model for Efficie…☆26Oct 27, 2022Updated 3 years ago
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- TBC☆28Nov 2, 2022Updated 3 years ago
- Source code for the frontend of chesshq.com☆11Apr 29, 2022Updated 4 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 6 months ago
- ☆13Oct 10, 2023Updated 2 years ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆32Oct 9, 2025Updated 6 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021☆13Jun 22, 2021Updated 4 years ago
- ☆14Jun 6, 2023Updated 2 years ago
- Conversational Recommender System Evaluation via Simulation☆19Apr 21, 2026Updated last week
- ☆19Mar 31, 2024Updated 2 years ago
- Test-time-training on nearest neighbors for large language models☆50Apr 18, 2024Updated 2 years ago
- ☆13Oct 14, 2020Updated 5 years ago
- A very limited implementation of arXiv:1904.00759☆13Dec 2, 2019Updated 6 years ago
- experiment☆12Jan 1, 2023Updated 3 years ago
- Official code for the paper "Attention as a Hypernetwork"☆56Feb 24, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- Chess ELO Discord Bot☆11Aug 13, 2025Updated 8 months ago
- [ICML 2022] Official implementation of "Score-Guided Intermediate Layer Optimization: Fast Langevin Mixing for Inverse Problems".☆12Jul 19, 2022Updated 3 years ago
- Bayesian scaling laws for in-context learning.☆15Mar 12, 2025Updated last year
- Well documented examples of running distributed training jobs on Modal☆26Apr 24, 2026Updated last week
- ☆38Apr 17, 2024Updated 2 years ago
- ☆19Oct 12, 2024Updated last year