dCaples / AutoDidactLinks
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆682Updated 10 months ago
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆640Updated 2 months ago
- Synthetic data curation for post-training and structured data extraction☆1,618Updated 2 weeks ago
- An Open Source Toolkit For LLM Distillation☆859Updated last month
- 🤗 Benchmark Large Language Models Reliably On Your Data☆426Updated last month
- An Open Large Reasoning Model for Real-World Solutions☆1,533Updated this week
- Recipes to scale inference-time compute of open models☆1,124Updated 8 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆938Updated 8 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆566Updated 9 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,164Updated 2 months ago
- Optimizing inference proxy for LLMs☆3,317Updated last week
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- ☆1,193Updated last month
- ☆159Updated 9 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆350Updated last year
- ☆970Updated last year
- ☆1,033Updated last year
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆673Updated 10 months ago
- ☆1,346Updated last year
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆966Updated 8 months ago
- Build datasets using natural language☆566Updated 4 months ago
- A benchmark for emotional intelligence in large language models☆398Updated last year
- The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"☆775Updated last month
- Large Reasoning Models☆807Updated last year
- ☆434Updated last year
- Integrating Tool Use into LLM Reasoning☆712Updated 11 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,061Updated 6 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆487Updated last year
- II-Researcher: a new open-source framework designed to aid building search / research agents☆493Updated 6 months ago
- Prompt-to-Leaderboard☆271Updated 8 months ago
- Code and data for the Chain-of-Draft (CoD) paper☆339Updated 10 months ago