Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated 2 years ago
Alternatives and similar repositories for decoding-time-realignment
Users that are interested in decoding-time-realignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for decoding compressed trust.☆27May 7, 2024Updated 2 years ago
- A method of ensemble learning for heterogeneous large language models.☆62Aug 7, 2024Updated last year
- The code used to train and run inference with MMDocIR☆34May 29, 2025Updated last year
- Test-time-training on nearest neighbors for large language models☆50Apr 18, 2024Updated 2 years ago
- ☆18Oct 16, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆24Mar 7, 2025Updated last year
- ☆13Mar 16, 2025Updated last year
- arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.☆28Jun 10, 2026Updated 2 weeks ago
- Multi-Candidate Speculative Decoding☆41Apr 22, 2024Updated 2 years ago
- Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation☆21Jun 11, 2025Updated last year
- PFLoRA-lib: Personalized Federated Learning with LoRA Algorithm Library focusing on privacy-protection, federated-learning, Citation, Ext…☆14Sep 19, 2024Updated last year
- Official implementation of DapperFL.☆13Oct 29, 2024Updated last year
- ☆64Apr 9, 2024Updated 2 years ago
- Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)☆47Dec 9, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- [ICCV2025] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding☆60Apr 4, 2026Updated 2 months ago
- ☆12Oct 17, 2024Updated last year
- ☆16Jul 17, 2025Updated 11 months ago
- ☆37Sep 24, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆70Mar 27, 2025Updated last year
- ☆31Feb 27, 2025Updated last year
- Video packaging platform - this will build a Docker with a web API that will let you upload, encrypt and serve videos as MPEG DASH files☆10Sep 6, 2020Updated 5 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Sep 9, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jan 18, 2024Updated 2 years ago
- Confidence-aware Personalized Federated Learning via Variational Expectation Maximization [Accepted at CVPR 2023]☆16Nov 8, 2023Updated 2 years ago
- A trainable user simulator☆34Jun 30, 2025Updated 11 months ago
- Official codes for "Understanding Deep Gradient Leakage via Inversion Influence Functions", NeurIPS 2023☆15Oct 13, 2023Updated 2 years ago
- Unofficial implementations of block/layer-wise pruning methods for LLMs.☆78Apr 29, 2024Updated 2 years ago
- ☆46Feb 8, 2024Updated 2 years ago
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆11May 24, 2025Updated last year
- ☆36Jun 5, 2025Updated last year
- StyleSwin: Transformer-based GAN for High-resolution Image Generation☆11Dec 21, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆68Nov 4, 2024Updated last year
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- ☆16Jul 29, 2025Updated 11 months ago
- Code for our NeurIPS 2024 paper Improved Generation of Adversarial Examples Against Safety-aligned LLMs☆12Nov 7, 2024Updated last year
- Disrupting Diffusion: Token-Level Attention Erasure Attack against Diffusion-based Customization(ACM MM2024)☆18Mar 31, 2025Updated last year
- [NeurIPS 2025] RESAnything: Attribute Prompting for Arbitrary Referring Segmentation☆17May 26, 2026Updated last month
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated 2 years ago