OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆674Mar 21, 2026Updated 2 months ago
Alternatives and similar repositories for OpenTinker
Users that are interested in OpenTinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,867Updated this week
- A Gym for Agentic LLMs☆488Jan 21, 2026Updated 4 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆75May 13, 2026Updated last week
- ☆119Mar 31, 2026Updated last month
- Vortex: A Flexible and Efficient Sparse Attention Framework☆53Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆318Oct 2, 2025Updated 7 months ago
- Ideas for projects related to Tinker☆176Nov 6, 2025Updated 6 months ago
- ☆74Jan 18, 2026Updated 4 months ago
- Data recipes and robust infrastructure for training AI agents☆150Updated this week
- Ludic – an LLM-RL library for the era of experience☆63Jan 9, 2026Updated 4 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,668Apr 14, 2026Updated last month
- Automate the creation of high quality research papers in latex. Powered by Swarms 🤖☆11Dec 1, 2025Updated 5 months ago
- Our library for RL environments + evals☆4,125Updated this week
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆64Dec 18, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆22Dec 21, 2025Updated 5 months ago
- Agentic RL Training at Scale☆1,384Updated this week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆649Apr 27, 2026Updated 3 weeks ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,337May 16, 2026Updated last week
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆1,340Updated this week
- ☆66May 7, 2026Updated 2 weeks ago
- AndroidSubSystem4GNU/Linux☆44Dec 30, 2025Updated 4 months ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆103Feb 26, 2026Updated 2 months ago
- Training VLM agents with multi-turn reinforcement learning☆457May 11, 2026Updated last week
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- slime is an LLM post-training framework for RL Scaling.☆5,710Updated this week
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆209Apr 30, 2026Updated 3 weeks ago
- A construction kit for reinforcement learning environment management.☆440Updated this week
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆20May 2, 2026Updated 3 weeks ago
- Training Proactive and Personalized LLM Agents☆110Jan 20, 2026Updated 4 months ago
- Open-source release accompanying Gao et al. 2025☆521Dec 11, 2025Updated 5 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆230May 31, 2025Updated 11 months ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,459May 16, 2026Updated last week
- Resources for the Enigmata Project.☆82Aug 13, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Democratizing Reinforcement Learning for LLMs☆5,548Updated this week
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆73Mar 26, 2026Updated last month
- LLMs + Lean, on your laptop or in the cloud