liutianlin0121 / decoding-time-realignment
Implementation of "Decoding-time Realignment of Language Models", ICML 2024.
☆18Updated 10 months ago
Alternatives and similar repositories for decoding-time-realignment:
Users that are interested in decoding-time-realignment are comparing it to the libraries listed below
- [NeurIPS 2024] Fast Best-of-N Decoding via Speculative Rejection☆42Updated 6 months ago
- Code for preprint "Metadata Conditioning Accelerates Language Model Pre-training (MeCo)"☆36Updated last month
- ☆98Updated 6 months ago
- ☆14Updated last year
- Codebase for Instruction Following without Instruction Tuning☆34Updated 7 months ago
- Long Context Extension and Generalization in LLMs☆53Updated 7 months ago
- Towards Systematic Measurement for Long Text Quality☆34Updated 7 months ago
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆51Updated 2 years ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆59Updated 9 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"☆53Updated last month
- Official Code Repository for [AutoScale–Automatic Prediction of Compute-optimal Data Compositions for Training LLMs]☆12Updated 2 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆74Updated 10 months ago
- Suri: Multi-constraint instruction following for long-form text generation (EMNLP’24)☆22Updated 5 months ago
- ☆18Updated 4 months ago
- ☆19Updated 2 years ago
- [NAACL 2025] A Closer Look into Mixture-of-Experts in Large Language Models