OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆675Mar 21, 2026Updated 2 months ago
Alternatives and similar repositories for OpenTinker
Users that are interested in OpenTinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,993Updated this week
- A Gym for Agentic LLMs☆493Jan 21, 2026Updated 4 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆75May 13, 2026Updated 3 weeks ago
- ☆125Mar 31, 2026Updated 2 months ago
- Vortex: Programmable Sparse Attention for Agents as Algorithm Designers☆59Updated this week
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆322Oct 2, 2025Updated 8 months ago
- Ideas for projects related to Tinker☆176Nov 6, 2025Updated 7 months ago
- ☆74Jan 18, 2026Updated 4 months ago
- Ludic – an LLM-RL library for the era of experience☆63Jan 9, 2026Updated 5 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,689Apr 14, 2026Updated last month
- Automate the creation of high quality research papers in latex. Powered by Swarms 🤖☆11Dec 1, 2025Updated 6 months ago
- Our library for RL environments + evals☆4,167Updated this week
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆64Dec 18, 2025Updated 5 months ago
- Data recipes and robust infrastructure for training AI agents☆161Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Schoenfeld’s Anatomy of Mathematical Reasoning by Language Models☆26Dec 21, 2025Updated 5 months ago
- Agentic RL Training at Scale☆1,455Updated this week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆672Updated this week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,850Updated this week
- ☆67May 7, 2026Updated last month
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆105Feb 26, 2026Updated 3 months ago
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆1,523Updated this week
- slime is an LLM post-training framework for RL Scaling.☆6,052Updated this week
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆216Apr 30, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆21May 2, 2026Updated last month
- A construction kit for reinforcement learning environment management.☆451Updated this week
- Training Proactive and Personalized LLM Agents☆111Jan 20, 2026Updated 4 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆230May 31, 2025Updated last year
- Resources for the Enigmata Project.☆81Aug 13, 2025Updated 9 months ago
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,953Updated this week
- Democratizing Reinforcement Learning for LLMs☆5,592Updated this week
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆76Mar 26, 2026Updated 2 months ago
- An interface library for RL post training with environments.☆1,932Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- LLMs + Lean, on your laptop or in the cloud☆211Oct 10, 2025Updated 8 months ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 8 months ago
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆138Nov 10, 2025Updated 7 months ago
- Post-training with Tinker☆3,442Updated this week
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆139Feb 16, 2026Updated 3 months ago
- LMCache: Supercharge Your LLM with the Fastest KV Cache Layer☆8,466Updated this week
- ☆30Mar 24, 2025Updated last year