OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆668Mar 21, 2026Updated last month
Alternatives and similar repositories for OpenTinker
Users that are interested in OpenTinker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,790Updated this week
- A Gym for Agentic LLMs☆478Jan 21, 2026Updated 3 months ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆74Updated this week
- ☆100Mar 31, 2026Updated last month
- Vortex: A Flexible and Efficient Sparse Attention Framework☆52Apr 26, 2026Updated last week
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆73Jan 18, 2026Updated 3 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆298Oct 2, 2025Updated 7 months ago
- Ideas for projects related to Tinker☆173Nov 6, 2025Updated 5 months ago
- Data recipes and robust infrastructure for training AI agents☆123Updated this week
- Ludic – an LLM-RL library for the era of experience☆62Jan 9, 2026Updated 3 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,642Apr 14, 2026Updated 2 weeks ago
- Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning☆63Dec 18, 2025Updated 4 months ago
- Our library for RL environments + evals☆4,057Updated this week
- Agentic RL Training at Scale☆1,323Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆632Apr 20, 2026Updated last week
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆1,145Apr 26, 2026Updated last week
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆21,046Updated this week
- ☆66Updated this week
- AndroidSubSystem4GNU/Linux☆46Dec 30, 2025Updated 4 months ago
- slime is an LLM post-training framework for RL Scaling.☆5,490Updated this week
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆101Feb 26, 2026Updated 2 months ago
- [ICML 2026] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆205Updated this week
- A construction kit for reinforcement learning environment management.☆425Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆20Jan 24, 2026Updated 3 months ago
- Training Proactive and Personalized LLM Agents☆107Jan 20, 2026Updated 3 months ago
- Open-source release accompanying Gao et al. 2025☆515Dec 11, 2025Updated 4 months ago
- A holistic framework for advancing LLMs as data science agents☆40Feb 3, 2026Updated 2 months ago
- Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics☆73Mar 26, 2026Updated last month
- Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement…☆9,382Updated this week
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆228May 31, 2025Updated 11 months ago
- Resources for the Enigmata Project.☆81Aug 13, 2025Updated 8 months ago
- Democratizing Reinforcement Learning for LLMs☆5,462Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Post-training with Tinker☆3,158Apr 26, 2026Updated last week
- LLMs + Lean, on your laptop or in the cloud☆207Oct 10, 2025Updated 6 months ago
- Archer2.0 evolves from its predecessor by introducing ASPO, which overcomes fundamental PPO-Clip limitations to prevent premature converg…☆31Oct 10, 2025Updated 6 months ago
- A self-hosted, browser-based AI CSV analyzer☆76Updated this week
- 💻 SETA: Scaling Environments for Terminal Agents - Environments☆129Feb 16, 2026Updated 2 months ago
- Automated High-Performance GPU Kernel Generation☆101Apr 20, 2026Updated last week
- Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings☆117Jul 27, 2025Updated 9 months ago