maitrix-org / llm-reasoners
A library for advanced large language model reasoning
☆1,690Updated this week
Alternatives and similar repositories for llm-reasoners:
Users that are interested in llm-reasoners are comparing it to the libraries listed below
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆791Updated this week
- List of language agents based on paper "Cognitive Architectures for Language Agents"☆852Updated last week
- Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...☆1,787Updated 3 weeks ago
- Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓☆2,349Updated this week
- Code for Quiet-STaR☆706Updated 5 months ago
- Recipes to scale inference-time compute of open models☆975Updated last week
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,625Updated last month
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,677Updated 5 months ago
- 📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥☆1,187Updated last week
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,497Updated last week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,359Updated 9 months ago
- A bibliography and survey of the papers surrounding o1☆1,076Updated 2 months ago
- [ACL 2023] Reasoning with Language Model Prompting: A Survey☆929Updated last month
- ☆997Updated last month
- O1 Replication Journey☆1,910Updated 2 weeks ago
- ☆2,341Updated this week
- A repo lists papers related to LLM based agent☆1,215Updated 5 months ago
- Large Reasoning Models☆801Updated last month
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆803Updated 2 months ago
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆661Updated 3 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,346Updated 2 months ago
- A reading list on LLM based Synthetic Data Generation 🔥☆993Updated 2 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,405Updated 9 months ago
- Benchmarking large language models' complex reasoning ability with chain-of-thought prompting☆2,644Updated 5 months ago
- ☆868Updated this week
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆2,064Updated this week
- TextGrad: Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.☆2,017Updated this week
- List of papers on hallucination detection in LLMs.☆750Updated last month
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆903Updated 3 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆583Updated 3 weeks ago