The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
☆22May 29, 2024Updated last year
Alternatives and similar repositories for aligner-replication
Users that are interested in aligner-replication are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆191Jan 16, 2025Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- ☆29Oct 8, 2025Updated 5 months ago
- ☆27Aug 30, 2023Updated 2 years ago
- Reimplementation of https://github.com/montemac/algebraic_value_editing in pure PyTorch for efficiency on large models☆11Jun 28, 2023Updated 2 years ago
- Dataset for EMNLP'23 Paper "DocTrack: A Visually-Rich Document Dataset Really Aligned with Human Eye Movement for Machine Reading"☆11Oct 25, 2023Updated 2 years ago
- This project has included related source codes and datasets of our EMNLP2021 paper☆10May 28, 2022Updated 3 years ago
- ☆16Jun 14, 2023Updated 2 years ago
- ☆15May 22, 2025Updated 10 months ago
- Self-hosted LLM chatbot arena, with yourself as the only judge☆41Feb 6, 2024Updated 2 years ago
- ☆17Oct 18, 2022Updated 3 years ago
- ☆10Jan 16, 2024Updated 2 years ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- code for "Generative News Recommendation"☆15May 31, 2024Updated last year
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 7 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆56Jun 16, 2024Updated last year
- Beginner-friendly serverless LLM deployment with Replicate & fly.io☆13Sep 3, 2023Updated 2 years ago
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆19Mar 20, 2025Updated last year
- ☆16Oct 5, 2022Updated 3 years ago
- Tidy autoregressive inference in JAX☆15Sep 1, 2025Updated 6 months ago
- Universal LLM Telegram chatbot in Python☆17Aug 16, 2024Updated last year
- ☆16Jun 18, 2022Updated 3 years ago
- ☆16Sep 30, 2023Updated 2 years ago
- ☆116Jan 21, 2025Updated last year
- ☆53Apr 17, 2022Updated 3 years ago
- The OlymMATH dataset☆24Jun 1, 2025Updated 9 months ago
- Official repository for ORPO☆473May 31, 2024Updated last year
- Accepted by ACL 2025☆30Aug 13, 2025Updated 7 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆116Feb 9, 2024Updated 2 years ago
- Framework for building VulkanScenGraph related projects together☆15Oct 7, 2024Updated last year
- ☆19Jul 16, 2020Updated 5 years ago
- Autonomous coding agent right in your IDE.☆12Jan 14, 2025Updated last year
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Aug 4, 2025Updated 7 months ago
- JAX implementation of Kolmogorov Arnold Networks (KANs).☆10May 7, 2024Updated last year
- Xwin-LM: Powerful, Stable, and Reproducible LLM Alignment☆1,038May 31, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- ☆29Jan 23, 2024Updated 2 years ago
- Official repo for BWLer: Barycentric Weight Layer☆29Sep 26, 2025Updated 5 months ago