dCaples / AutoDidact
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆611Updated last month
Alternatives and similar repositories for AutoDidact:
Users that are interested in AutoDidact are comparing it to the libraries listed below
- Atom of Thoughts for Markov LLM Test-Time Scaling☆560Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,127Updated last week
- Synthetic data curation for post-training and structured data extraction☆1,290Updated last week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆851Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,488Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆881Updated last month
- ☆928Updated 3 months ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆808Updated last week
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆347Updated 3 months ago
- Recipes to scale inference-time compute of open models☆1,066Updated 2 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆287Updated this week
- Dream 7B, a large diffusion language model☆622Updated last week
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆540Updated 2 weeks ago
- Optimizing inference proxy for LLMs☆2,210Updated this week
- Scalable RL solution for advanced reasoning of language models☆1,529Updated last month
- [ICML 2025 Spotlight] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆515Updated 2 months ago
- Large Reasoning Models☆804Updated 5 months ago
- ☆1,356Updated 5 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆1,698Updated this week
- Understanding R1-Zero-Like Training: A Critical Perspective☆908Updated 3 weeks ago
- Pretraining code for a large-scale depth-recurrent language model☆755Updated 3 weeks ago
- An Open Source Toolkit For LLM Distillation☆594Updated last week
- Fully open data curation for reasoning models☆1,742Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆512Updated last month
- ☆1,017Updated 4 months ago
- A-MEM: Agentic Memory for LLM Agents☆289Updated last month
- Build your own visual reasoning model☆357Updated this week
- LIMO: Less is More for Reasoning☆927Updated last month
- CodeScientist: An automated scientific discovery system for code-based experiments☆245Updated last month
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆436Updated last month