OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆675Mar 21, 2026Updated 3 months ago
Alternatives and similar repositories for OpenTinker
Users that are interested in OpenTinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SkyRL: A Modular Full-stack RL Library for LLMs☆2,045Updated this week
- A Gym for Agentic LLMs☆497Jan 21, 2026Updated 5 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆75Jun 15, 2026Updated 2 weeks ago
- ☆130Mar 31, 2026Updated 3 months ago
- Vortex: Programmable Sparse Attention for Agents as Algorithm Designers☆63Jun 24, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆325Jun 17, 2026Updated 2 weeks ago
- Ideas for projects related to Tinker☆178Nov 6, 2025Updated 7 months ago
- ☆74Jan 18, 2026Updated 5 months ago
- Ludic – an LLM-RL library for the era of experience☆66Jan 9, 2026Updated 5 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,717Apr 14, 2026Updated 2 months ago
- Automate the creation of high quality research papers in latex. Powered by Swarms 🤖☆11Dec 1, 2025Updated 7 months ago
- Our library for RL environments + evals☆4,233Jun 26, 2026Updated last week
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆64Dec 18, 2025Updated 6 months ago
- Data recipes and robust infrastructure for training AI agents☆205Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆27Dec 21, 2025Updated 6 months ago
- Agentic RL Training at Scale☆1,566Updated this week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆688Updated this week
- slime is an LLM post-training framework for RL Scaling.☆7,099Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆22,173Updated this week
- ☆67May 7, 2026Updated last month
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆105Updated this week
- Training VLM agents with multi-turn reinforcement learning☆478May 11, 2026Updated last month
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆221Apr 30, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆22May 2, 2026Updated 2 months ago
- A construction kit for reinforcement learning environment management.☆459Jun 25, 2026Updated last week
- Training Proactive and Personalized LLM Agents☆111Jan 20, 2026Updated 5 months ago
- AndroidSubSystem4GNU/Linux☆48Dec 30, 2025Updated 6 months ago
- Open-source release accompanying Gao et al. 2025☆526Dec 11, 2025Updated 6 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆231May 31, 2025Updated last year
- Resources for the Enigmata Project.☆82Aug 13, 2025Updated 10 months ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆10,198Updated this week
- Democratizing Reinforcement Learning for LLMs☆5,649Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆76Mar 26, 2026Updated 3 months ago
- LLMs + Lean, on your laptop or in the cloud☆213Oct 10, 2025Updated 8 months ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 8 months ago
- An interface library for RL post training with environments.☆2,362Updated this week
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆118Jul 27, 2025Updated 11 months ago
- Post-training with Tinker☆3,517Updated this week
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆140Feb 16, 2026Updated 4 months ago