dCaples / AutoDidactLinks
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆632Updated 3 months ago
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below
Sorting:
- Atom of Thoughts for Markov LLM Test-Time Scaling☆574Updated this week
- 🤗 Benchmark Large Language Models Reliably On Your Data☆329Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,404Updated this week
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆649Updated 2 weeks ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆936Updated last month
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆927Updated last month
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆545Updated 3 months ago
- Recipes to scale inference-time compute of open models☆1,095Updated last month
- ☆1,356Updated 7 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆534Updated last month
- ☆938Updated 4 months ago
- free and open OpenAI Deep Research☆589Updated 4 months ago
- Large Reasoning Models☆804Updated 6 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆347Updated 4 months ago
- ☆149Updated 2 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆490Updated 4 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,612Updated 2 weeks ago
- Verifiers for LLM Reinforcement Learning☆1,290Updated last week
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆632Updated 2 weeks ago
- Lightweight and portable LLM sandbox runtime (code interpreter) Python library.☆317Updated this week
- II-Researcher: a new open-source framework designed to aid building search / research agents☆373Updated last month
- LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA☆496Updated 5 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,498Updated 3 weeks ago
- ☆784Updated last week
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,010Updated last week
- Scalable RL solution for advanced reasoning of language models☆1,615Updated 3 months ago
- Pretraining code for a large-scale depth-recurrent language model☆782Updated last week
- ☆478Updated 2 weeks ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆406Updated 2 weeks ago