maitrix-org / llm-reasonersLinks
A library for advanced large language model reasoning
β2,292Updated 4 months ago
Alternatives and similar repositories for llm-reasoners
Users that are interested in llm-reasoners are comparing it to the libraries listed below
Sorting:
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...β2,141Updated 5 months ago
- A reading list on LLM based Synthetic Data Generation π₯β1,441Updated 4 months ago
- Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"β1,373Updated 8 months ago
- β1,035Updated 10 months ago
- List of language agents based on paper "Cognitive Architectures for Language Agents"β1,050Updated 9 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Modelsβ1,823Updated 9 months ago
- From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 πβ3,398Updated 5 months ago
- O1 Replication Journeyβ2,003Updated 9 months ago
- Autonomous Agents (LLMs) research papers. Updated Daily.β1,044Updated this week
- An Open Large Reasoning Model for Real-World Solutionsβ1,524Updated 4 months ago
- β1,350Updated 11 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)β2,889Updated 2 weeks ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.β1,885Updated 2 months ago
- AllenAI's post-training codebaseβ3,263Updated this week
- Must-read Papers on LLM Agents.β2,731Updated 2 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRLβ3,400Updated 3 weeks ago
- A repo lists papers related to LLM based agentβ2,066Updated 3 months ago
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,β¦β2,225Updated last year
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"β797Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ2,060Updated last year
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,373Updated this week
- β964Updated 9 months ago
- [ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.β2,601Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifiβ¦β2,903Updated last week
- Large Reasoning Modelsβ805Updated 10 months ago
- A bibliography and survey of the papers surrounding o1β1,209Updated 11 months ago
- [ICLR 2025] Automated Design of Agentic Systemsβ1,438Updated 9 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).β890Updated 3 weeks ago
- Recipes to scale inference-time compute of open modelsβ1,114Updated 5 months ago
- Code for Quiet-STaRβ739Updated last year