dCaples / AutoDidactLinks
Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆679Updated 9 months ago
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆631Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,594Updated last week
- An Open Large Reasoning Model for Real-World Solutions☆1,535Updated 7 months ago
- Recipes to scale inference-time compute of open models☆1,123Updated 7 months ago
- An Open Source Toolkit For LLM Distillation☆819Updated 3 weeks ago
- ☆968Updated 11 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆423Updated 2 weeks ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆960Updated 7 months ago
- multi1: create o1-like reasoning chains with multiple AI providers (and locally). Supports LiteLLM as backend too for 100+ providers at o…☆351Updated 11 months ago
- ☆1,344Updated last year
- Prompt-to-Leaderboard☆271Updated 8 months ago
- Pretraining and inference code for a large-scale depth-recurrent language model☆859Updated 2 weeks ago
- ☆158Updated 8 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆936Updated 7 months ago
- ☆1,032Updated last year
- ☆1,173Updated 3 weeks ago
- ☆433Updated last year
- [NeurIPS'25] Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆654Updated 9 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆566Updated 8 months ago
- Optimizing inference proxy for LLMs☆3,266Updated 2 weeks ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,061Updated 5 months ago
- A benchmark for emotional intelligence in large language models☆396Updated last year
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆346Updated last year
- Code and data for the Chain-of-Draft (CoD) paper☆338Updated 10 months ago
- Large Reasoning Models☆804Updated last year
- Automatic evals for LLMs☆574Updated 3 weeks ago
- Dream 7B, a large diffusion language model☆1,139Updated last month
- Unleashing the Power of Reinforcement Learning for Math and Code Reasoners☆737Updated 7 months ago
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆500Updated last year
- Lightweight and portable LLM sandbox runtime (code interpreter) Python library.☆760Updated last month