dCaples / AutoDidactLinks
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆682Updated 10 months ago
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆640Updated 2 months ago
- Synthetic data curation for post-training and structured data extraction☆1,618Updated 2 weeks ago
- An Open Large Reasoning Model for Real-World Solutions☆1,533Updated 8 months ago
- An Open Source Toolkit For LLM Distillation☆859Updated last month
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆350Updated last year
- ☆1,033Updated last year
- ☆971Updated last year
- ☆434Updated last year
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆966Updated 8 months ago
- A benchmark for emotional intelligence in large language models☆398Updated last year
- Recipes to scale inference-time compute of open models☆1,124Updated 8 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆938Updated 7 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆566Updated 9 months ago
- A compact LLM pretrained in 9 days by using high quality data☆339Updated 9 months ago
- Plug-and-play tree search for agents☆271Updated 6 months ago
- ☆159Updated 9 months ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆677Updated 7 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆426Updated last month
- ☆1,193Updated last month
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- Prompt-to-Leaderboard☆271Updated 8 months ago
- ☆1,346Updated last year
- Large Reasoning Models☆807Updated last year
- [COLM 2025] LIMO: Less is More for Reasoning☆1,061Updated 6 months ago
- Optimizing inference proxy for LLMs☆3,317Updated last week
- Pretraining and inference code for a large-scale depth-recurrent language model☆861Updated last month
- Scalable RL solution for advanced reasoning of language models☆1,803Updated 10 months ago
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆673Updated 10 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆493Updated 6 months ago
- A collection of prompts to challenge the reasoning abilities of large language models in presence of misguiding information☆456Updated 6 months ago