OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆645Mar 18, 2026Updated this week
Alternatives and similar repositories for OpenTinker
Users that are interested in OpenTinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Gym for Agentic LLMs☆467Jan 21, 2026Updated 2 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,699Updated this week
- ☆87Feb 12, 2026Updated last month
- Vortex: A Flexible and Efficient Sparse Attention Framework☆49Jan 21, 2026Updated 2 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆292Oct 2, 2025Updated 5 months ago
- ☆70Jan 18, 2026Updated 2 months ago
- Data recipes and robust infrastructure for training AI agents☆111Updated this week
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated last month
- Ideas for projects related to Tinker☆174Nov 6, 2025Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,553Mar 15, 2026Updated last week
- Ludic – an LLM-RL library for the era of experience☆61Jan 9, 2026Updated 2 months ago
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆60Dec 18, 2025Updated 3 months ago
- Our library for RL environments + evals☆3,918Updated this week
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆1,001Updated this week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆597Mar 16, 2026Updated last week
- AndroidSubSystem4GNU/Linux☆36Dec 30, 2025Updated 2 months ago
- Async RL Training at Scale☆1,156Updated this week
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,097Updated this week
- slime is an LLM post-training framework for RL Scaling.☆4,799Updated this week
- Automated GPU Kernel Generation via Co-Evolving Intrinsic World Model☆85Mar 2, 2026Updated 3 weeks ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆191Jan 12, 2026Updated 2 months ago
- Training VLM agents with multi-turn reinforcement learning☆431Mar 11, 2026Updated last week
- ☆62Updated this week
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆97Feb 26, 2026Updated 3 weeks ago
- A construction kit for reinforcement learning environment management.☆384Updated this week
- Training Proactive and Personalized LLM Agents☆103Jan 20, 2026Updated 2 months ago
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆19Jan 24, 2026Updated last month
- An interface library for RL post training with environments.☆1,288Updated this week
- Open-source release accompanying Gao et al. 2025☆510Dec 11, 2025Updated 3 months ago
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆72Jan 13, 2026Updated 2 months ago
- A holistic framework for advancing LLMs as data science agents☆39Feb 3, 2026Updated last month
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆26Mar 9, 2026Updated 2 weeks ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,050Updated this week
- Post-training with Tinker☆2,942Mar 15, 2026Updated last week
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆224May 31, 2025Updated 9 months ago
- Resources for the Enigmata Project.☆80Aug 13, 2025Updated 7 months ago
- ☆22Dec 29, 2025Updated 2 months ago
- LLMs + Lean, on your laptop or in the cloud☆203Oct 10, 2025Updated 5 months ago
- Democratizing Reinforcement Learning for LLMs☆5,259Updated this week