magpie-align / magpieLinks
[ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data generation pipeline!
☆786Updated 7 months ago
Alternatives and similar repositories for magpie
Users that are interested in magpie are comparing it to the libraries listed below
Sorting:
- ☆963Updated 9 months ago
- Large Reasoning Models☆806Updated 11 months ago
- ☆995Updated 4 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆926Updated 9 months ago
- Official repository for ORPO☆463Updated last year
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆575Updated 11 months ago
- ☆552Updated 11 months ago
- RewardBench: the first evaluation tool for reward models.☆653Updated 5 months ago
- Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.☆750Updated last year
- ☆314Updated last year
- An Open Source Toolkit For LLM Distillation☆779Updated 4 months ago
- Generative Representational Instruction Tuning