The reproduct of the paper - Aligner: Achieving Efficient Alignment through Weak-to-Strong Correction
☆21May 29, 2024Updated 2 years ago
Alternatives and similar repositories for aligner-replication
Users that are interested in aligner-replication are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆193Jan 16, 2025Updated last year
- Trust Region Preference Approximation: A simple and stable reinforcement learning algorithm for LLM reasoning☆15Jun 28, 2025Updated last year
- [EMNLP2023]: MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute Control☆12Nov 11, 2023Updated 2 years ago
- The code and data for the paper JiuZhang3.0☆49May 26, 2024Updated 2 years ago
- Official implementation of "OffsetBias: Leveraging Debiased Data for Tuning Evaluators"☆26Sep 11, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Nov 11, 2022Updated 3 years ago
- ☆17Oct 18, 2022Updated 3 years ago
- Complexity Based Prompting for Multi-Step Reasoning☆17Mar 10, 2023Updated 3 years ago
- ☆10Jan 16, 2024Updated 2 years ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated 2 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- 4 bits quantization of LLaMa using GPTQ☆12Jun 2, 2023Updated 3 years ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 11 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆56Jun 16, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Beginner-friendly serverless LLM deployment with Replicate & fly.io☆13Sep 3, 2023Updated 2 years ago
- Improving Math reasoning through Direct Preference Optimization with Verifiable Pairs☆20Mar 20, 2025Updated last year
- ☆16Oct 5, 2022Updated 3 years ago
- Tools for content datamining and NLP at scale☆45Jun 20, 2024Updated 2 years ago
- Universal LLM Telegram chatbot in Python☆17Aug 16, 2024Updated last year
- ☆16Jun 18, 2022Updated 4 years ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆66Jul 8, 2024Updated last year
- ☆16Sep 30, 2023Updated 2 years ago
- ☆116Jan 21, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Apr 28, 2023Updated 3 years ago
- ☆53Apr 17, 2022Updated 4 years ago
- [NeurIPS 2025] UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents☆58Nov 27, 2025Updated 7 months ago
- Official repository for ORPO☆481May 31, 2024Updated 2 years ago
- Accepted by ACL 2025☆30Aug 13, 2025Updated 10 months ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆116Feb 9, 2024Updated 2 years ago
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment☆11Jun 27, 2025Updated last year
- Framework for building VulkanScenGraph related projects together☆15Oct 7, 2024Updated last year
- Stochastic trace estimation using JAX☆19Aug 20, 2025Updated 10 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆19Jul 16, 2020Updated 5 years ago
- DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents☆24Aug 4, 2025Updated 10 months ago
- CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era☆39Jun 18, 2025Updated last year
- Official repository for Mi:dm 2.0, the large language model developed by KT.☆59Oct 29, 2025Updated 8 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆146Oct 27, 2024Updated last year
- ☆29Jan 23, 2024Updated 2 years ago
- Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation. Bingchen Zhao and Kai Han. (NeurIPS 2021)☆12Aug 20, 2023Updated 2 years ago