allenai / open-instruct
AllenAI's post-training codebase
☆2,657Updated this week
Alternatives and similar repositories for open-instruct:
Users that are interested in open-instruct are comparing it to the libraries listed below
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,448Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆3,387Updated this week
- O1 Replication Journey☆1,947Updated last month
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,654Updated last month
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,651Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,262Updated this week
- ☆1,326Updated 3 months ago
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,160Updated this week
- Recipes to scale inference-time compute of open models☆1,002Updated last month
- ☆1,006Updated 2 months ago
- A library for advanced large language model reasoning☆1,946Updated this week
- Tools for merging pretrained large language models.☆5,260Updated last week
- Minimalistic large language model 3D-parallelism training☆1,483Updated this week
- An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)☆4,809Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,444Updated 2 months ago
- PyTorch native post-training library☆4,856Updated this week
- A bibliography and survey of the papers surrounding o1☆1,155Updated 3 months ago
- Large Reasoning Models☆801Updated 2 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,230Updated last week
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,900Updated last year
- ☆890Updated 3 weeks ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,790Updated last year
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,418Updated 2 weeks ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆821Updated this week
- YaRN: Efficient Context Window Extension of Large Language Models☆1,421Updated 10 months ago
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"☆1,018Updated this week
- Reference implementation for DPO (Direct Preference Optimization)☆2,377Updated 6 months ago