dCaples / AutoDidactLinks
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆625Updated 2 months ago
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below
Sorting:
- Atom of Thoughts for Markov LLM Test-Time Scaling☆567Updated this week
- Synthetic data curation for post-training and structured data extraction☆1,364Updated this week
- Minimal Linux OS with a Model Context Protocol (MCP) gateway to expose local capabilities to LLMs.☆225Updated this week
- An Open Large Reasoning Model for Real-World Solutions☆1,494Updated this week
- Verifiers for LLM Reinforcement Learning☆1,057Updated this week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆892Updated 2 weeks ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆2,453Updated last week
- Recipes to scale inference-time compute of open models☆1,087Updated last week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆530Updated 2 months ago
- LIMO: Less is More for Reasoning☆953Updated last month
- free and open OpenAI Deep Research☆565Updated 3 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆343Updated 3 weeks ago
- Large Reasoning Models☆804Updated 5 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆723Updated this week
- Lightweight and portable LLM sandbox runtime (code interpreter) Python library.☆279Updated this week
- ☆1,354Updated 6 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆315Updated this week
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆888Updated 2 weeks ago
- ☆934Updated 4 months ago
- An Open Source Toolkit For LLM Distillation☆612Updated last month
- ☆1,024Updated 5 months ago
- HawkinsDB is our take on giving AI systems a more human-like way to store and recall information, inspired by how our own brains work. Ba…☆286Updated 5 months ago
- ☆142Updated last month
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆409Updated last month
- A-MEM: Agentic Memory for LLM Agents☆369Updated 2 weeks ago
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆607Updated this week
- Automatic evals for LLMs☆399Updated this week
- Pretraining code for a large-scale depth-recurrent language model☆770Updated this week
- Fully open data curation for reasoning models☆1,796Updated last week
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆345Updated 2 weeks ago