Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆21Jun 17, 2024Updated last year
Alternatives and similar repositories for decoding-time-realignment
Users that are interested in decoding-time-realignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Codebase for decoding compressed trust.☆27May 7, 2024Updated last year
- A method of ensemble learning for heterogeneous large language models.☆64Aug 7, 2024Updated last year
- The code used to train and run inference with MMDocIR☆33May 29, 2025Updated 11 months ago
- Test-time-training on nearest neighbors for large language models☆50Apr 18, 2024Updated 2 years ago
- ☆18Oct 16, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆23Mar 7, 2025Updated last year
- Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"☆11Oct 11, 2024Updated last year
- ☆28Jul 11, 2024Updated last year
- Multi-Candidate Speculative Decoding☆40Apr 22, 2024Updated 2 years ago
- Accelerate Video Diffusion Inference via Sketching-Rendering Cooperation☆20Jun 11, 2025Updated 10 months ago
- Official implementation of DapperFL.☆13Oct 29, 2024Updated last year
- Codes for our paper "Speculative Decoding: Exploiting Speculative Execution for Accelerating Seq2seq Generation" (EMNLP 2023 Findings)☆47Dec 9, 2023Updated 2 years ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- direct preference optimization with only 1 model copy :)☆14Oct 2, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆14Sep 14, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- Understanding Self-Supervised Learning in a non-IID Setting☆21Oct 21, 2022Updated 3 years ago
- Exploiting Label Skew in Federated Learning with Model Concatenation (AAAI 2024)☆13Dec 16, 2023Updated 2 years ago
- ☆16Jul 17, 2025Updated 9 months ago
- ☆36Sep 24, 2024Updated last year
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆68Mar 27, 2025Updated last year
- ☆30Feb 27, 2025Updated last year
- Video packaging platform - this will build a Docker with a web API that will let you upload, encrypt and serve videos as MPEG DASH files☆10Sep 6, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Sep 9, 2025Updated 7 months ago
- Cost-Sensitive Toolpath Agent for Multi-turn Image Editing☆31Mar 26, 2025Updated last year
- Confidence-aware Personalized Federated Learning via Variational Expectation Maximization [Accepted at CVPR 2023]☆16Nov 8, 2023Updated 2 years ago
- Unofficial implementations of block/layer-wise pruning methods for LLMs.☆78Apr 29, 2024Updated 2 years ago
- Official codes for "Understanding Deep Gradient Leakage via Inversion Influence Functions", NeurIPS 2023☆15Oct 13, 2023Updated 2 years ago
- ☆46Feb 8, 2024Updated 2 years ago
- ☆28May 24, 2025Updated 11 months ago
- ☆33Jul 4, 2024Updated last year
- SuperGS: Super-Resolution 3D Gaussian Splatting Enhanced by Variational Residual Features and Uncertainty-Augmented Learning☆11May 24, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Dec 13, 2022Updated 3 years ago
- Code accompanying our paper "Finding trainable sparse networks through Neural Tangent Transfer" to be published at ICML-2020.☆13Jun 14, 2020Updated 5 years ago
- ☆35Jun 5, 2025Updated 10 months ago
- StyleSwin: Transformer-based GAN for High-resolution Image Generation☆11Dec 21, 2021Updated 4 years ago
- ☆68Nov 4, 2024Updated last year
- Parallel Self-Adjusting Computation☆16Jul 5, 2021Updated 4 years ago
- Code for paper "Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers"☆17Jan 27, 2023Updated 3 years ago