Autonomously train research-agent LLMs on custom data using reinforcement learning and self-verification.
☆689Mar 22, 2025Updated last year
Alternatives and similar repositories for AutoDidact
Users that are interested in AutoDidact are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆160Apr 17, 2025Updated last year
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,882Nov 13, 2025Updated 6 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆131Jun 11, 2025Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,387May 16, 2025Updated last year
- ☆94Jul 7, 2025Updated 11 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The Fastest Way to Fine-Tune LLMs Locally☆339Dec 18, 2025Updated 5 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆716Aug 5, 2025Updated 10 months ago
- ☆56Feb 10, 2025Updated last year
- Democratizing Reinforcement Learning for LLMs☆5,592Updated this week
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆57Feb 10, 2025Updated last year
- ☆226May 7, 2025Updated last year
- ☆21Mar 25, 2025Updated last year
- Optimizing inference proxy for LLMs☆4,135May 7, 2026Updated last month
- Minimal reproduction of DeepSeek R1-Zero☆13,140Feb 27, 2026Updated 3 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- SOTA search powered LLM☆3,826Apr 4, 2025Updated last year
- Create Custom LLMs☆1,851Apr 24, 2026Updated last month
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆263May 13, 2026Updated 3 weeks ago
- Efforts toward giving Qwen 3 Coder 30B A3B proper agentic tool calling capabilities at or near 100% reliability.☆63Aug 10, 2025Updated 10 months ago
- Train your own SOTA deductive reasoning model☆112Mar 6, 2025Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated last year
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,260Aug 27, 2025Updated 9 months ago
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,447Apr 20, 2026Updated last month
- ☆16Dec 16, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆53Oct 29, 2025Updated 7 months ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆303Apr 3, 2025Updated last year
- Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.☆66,153Updated this week
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated last year
- ~95% on SimpleQA (e.g. Qwen3.6-27B on a 3090). Supports all local and cloud LLMs (llama.cpp, Ollama, Google, ...). 10+ search engines - …☆8,381Updated this week
- ☆21Aug 18, 2024Updated last year
- Scalable RL solution for advanced reasoning of language models☆1,861Mar 18, 2025Updated last year
- ☆1,376Updated this week
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆663Apr 1, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A repository to store helpful information and emerging insights in regard to LLMs☆21Oct 27, 2023Updated 2 years ago
- Our library for RL environments + evals☆4,167Updated this week
- Manifold is an experimental platform for enabling long horizon workflow automation using teams of AI assistants.☆495Updated this week
- Large-scale LLM inference engine☆1,762May 8, 2026Updated last month
- ☆16Jun 4, 2025Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,542Mar 4, 2026Updated 3 months ago
- Synthetic data curation for post-training and structured data extraction☆1,684Apr 18, 2026Updated last month