dCaples / AutoDidact
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆542Updated this week
Alternatives and similar repositories for AutoDidact:
Users that are interested in AutoDidact are comparing it to the libraries listed below
- ☆514Updated last week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆1,389Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,065Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆425Updated 6 months ago
- LIMO: Less is More for Reasoning☆864Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆735Updated 3 weeks ago
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆487Updated 3 weeks ago
- Recipes to scale inference-time compute of open models☆1,044Updated last month
- An Open Source Toolkit For LLM Distillation☆554Updated 2 months ago
- Verifiers for LLM Reinforcement Learning☆686Updated this week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆474Updated last week
- Optimizing inference proxy for LLMs☆2,112Updated last week
- ☆910Updated 2 months ago
- ☆1,348Updated 4 months ago
- Large Reasoning Models☆799Updated 3 months ago
- A open, local Manus AI alternative. Powered with Deepseek R1. No APIs, no $456 monthly bills. Enjoy an AI agent that reason, code, and br…☆594Updated this week
- Build your own visual reasoning model☆312Updated this week
- CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆480Updated last month
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆479Updated 2 months ago
- The Fastest Way to Fine-Tune LLMs Locally☆284Updated last week
- free and open OpenAI Deep Research☆480Updated last month
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆347Updated last month
- 🤠 Agent-as-a-Judge and DevAI dataset☆384Updated 2 months ago
- ☆438Updated 5 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine