maitrix-org / llm-reasonersLinks
A library for advanced large language model reasoning
β2,302Updated 5 months ago
Alternatives and similar repositories for llm-reasoners
Users that are interested in llm-reasoners are comparing it to the libraries listed below
Sorting:
- AllenAI's post-training codebaseβ3,294Updated this week
- A reading list on LLM based Synthetic Data Generation π₯β1,460Updated 5 months ago
- β1,035Updated 11 months ago
- β963Updated 9 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Modelsβ1,825Updated 10 months ago
- β1,349Updated 11 months ago
- An Open Large Reasoning Model for Real-World Solutionsβ1,527Updated 5 months ago
- O1 Replication Journeyβ2,002Updated 10 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.β1,902Updated 3 months ago
- Recipes to scale inference-time compute of open modelsβ1,117Updated 5 months ago
- Autonomous Agents (LLMs) research papers. Updated Daily.β1,058Updated last week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,390Updated last week
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"β1,388Updated 8 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,927Updated last week
- List of language agents based on paper "Cognitive Architectures for Language Agents"β1,060Updated 10 months ago
- A bibliography and survey of the papers surrounding o1β1,209Updated last year
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...β2,151Updated 6 months ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,424Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ3,484Updated 2 weeks ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).β892Updated last month
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β2,925Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,108Updated this week
- Training Large Language Model to Reason in a Continuous Latent Spaceβ1,327Updated 3 months ago
- Large Reasoning Modelsβ807Updated 11 months ago
- 800,000 step-level correctness labels on LLM solutions to MATH problemsβ2,068Updated 2 years ago
- Scalable RL solution for advanced reasoning of language modelsβ1,769Updated 8 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learningβ1,241Updated 6 months ago
- [COLM 2025] LIMO: Less is More for Reasoningβ1,045Updated 3 months ago
- Code for Quiet-STaRβ741Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)β1,217Updated last year