allenai / open-instructLinks
AllenAI's post-training codebase
☆3,018Updated this week
Alternatives and similar repositories for open-instruct
Users that are interested in open-instruct are comparing it to the libraries listed below
Sorting:
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,629Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,757Updated last week
- Minimalistic large language model 3D-parallelism training☆1,926Updated last week
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,771Updated 5 months ago
- Democratizing Reinforcement Learning for LLMs☆3,378Updated last month
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,407Updated 2 weeks ago
- Fully open data curation for reasoning models☆1,921Updated 2 weeks ago
- An Open Large Reasoning Model for Real-World Solutions☆1,497Updated 3 weeks ago
- ☆1,225Updated 3 months ago
- O1 Replication Journey☆1,992Updated 5 months ago
- Recipes to scale inference-time compute of open models☆1,095Updated 3 weeks ago
- Tools for merging pretrained large language models.☆5,829Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,889Updated 10 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆897Updated 4 months ago
- A bibliography and survey of the papers surrounding o1☆1,199Updated 7 months ago
- Data and tools for generating and inspecting OLMo pre-training data.☆1,241Updated last week
- Curated list of datasets and tools for post-training.☆3,158Updated 4 months ago
- A library for advanced large language model reasoning☆2,144Updated last week
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,785Updated 5 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆9,710Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,615Updated 3 months ago
- Synthetic data curation for post-training and structured data extraction☆1,404Updated this week
- LIMO: Less is More for Reasoning☆960Updated 2 months ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,609Updated 10 months ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,162Updated last year
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆857Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,612Updated 2 weeks ago
- Training Large Language Model to Reason in a Continuous Latent Space☆1,155Updated 4 months ago
- A reading list on LLM based Synthetic Data Generation 🔥☆1,306Updated 2 weeks ago
- Official Repo for Open-Reasoner-Zero☆1,967Updated 2 weeks ago