dCaples / AutoDidactLinks
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆668Updated 7 months ago
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆596Updated 4 months ago
- Synthetic data curation for post-training and structured data extraction☆1,547Updated 3 months ago
- ☆1,116Updated last year
- An Open Source Toolkit For LLM Distillation☆777Updated 4 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆558Updated 6 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆411Updated last month
- An Open Large Reasoning Model for Real-World Solutions☆1,527Updated 5 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆616Updated 7 months ago
- ☆963Updated 9 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆933Updated 5 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆350Updated 9 months ago
- Build datasets using natural language☆543Updated last month
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆913Updated 5 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,084Updated 2 months ago
- ☆1,035Updated 10 months ago
- ☆158Updated 6 months ago
- ☆433Updated last year
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,241Updated 5 months ago
- Lightweight and portable LLM sandbox runtime (code interpreter) Python library.☆613Updated this week
- Optimizing inference proxy for LLMs☆3,091Updated this week
- ☆1,348Updated 11 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,045Updated 3 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,484Updated 2 weeks ago
- Recipes to scale inference-time compute of open models☆1,117Updated 5 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆344Updated last year
- Integrating Tool Use into LLM Reasoning☆692Updated 8 months ago
- The code for NeurIPS 2025 paper "A-MEM: Agentic Memory for LLM Agents"☆667Updated last week
- II-Researcher: a new open-source framework designed to aid building search / research agents☆477Updated 3 months ago
- [NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards☆1,214Updated last month
- Prompt-to-Leaderboard☆260Updated 6 months ago