OpenTinker is an RL-as-a-Service infrastructure for foundation models
☆632Feb 20, 2026Updated last week
Alternatives and similar repositories for OpenTinker
Users that are interested in OpenTinker are comparing it to the libraries listed below
Sorting:
- ☆65Feb 12, 2026Updated 2 weeks ago
- A Gym for Agentic LLMs☆452Jan 21, 2026Updated last month
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,628Updated this week
- Vortex: A Flexible and Efficient Sparse Attention Framework☆48Jan 21, 2026Updated last month
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆288Oct 2, 2025Updated 5 months ago
- ☆69Jan 18, 2026Updated last month
- Agent-OM: Leveraging LLM Agents for Ontology Matching☆18Jan 24, 2026Updated last month
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆64Feb 19, 2026Updated last week
- Our library for RL environments + evals☆3,869Updated this week
- [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents☆577Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,522Updated this week
- Ludic – an LLM-RL library for the era of experience☆60Jan 9, 2026Updated last month
- Resources for the Enigmata Project.☆77Aug 13, 2025Updated 6 months ago
- slime is an LLM post-training framework for RL Scaling.☆4,381Updated this week
- A construction kit for reinforcement learning environment management.☆352Updated this week
- THEORY OF SPACE: a benchmark for evaluating whether foundation models can actively explore under partial observability efficiently to bui…☆36Updated this week
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆95Updated this week
- Few-shot Learning with Auxiliary Data☆31Dec 8, 2023Updated 2 years ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,519Updated this week
- A benchmark for evaluating LLMs on open-ended CS problems. Exploring the Next Frontier of Computer Science.☆146Feb 23, 2026Updated last week
- Minimal open-source implementation of AlphaProof and HyperTree Proof Search.☆66Jan 31, 2026Updated last month
- Training Proactive and Personalized LLM Agents☆100Jan 20, 2026Updated last month
- ☆46Jun 11, 2025Updated 8 months ago
- Async RL Training at Scale☆1,096Updated this week
- An interface library for RL post training with environments.☆1,201Updated this week
- Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.☆906Feb 24, 2026Updated last week
- Dynadiff: Single-stage Decoding of Images from Continuously Evolving fMRI☆42May 13, 2025Updated 9 months ago
- AllenAI's post-training codebase☆3,592Feb 24, 2026Updated last week
- Training VLM agents with multi-turn reinforcement learning☆416Feb 24, 2026Updated last week
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.co…☆13Jan 16, 2026Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Oct 18, 2025Updated 4 months ago
- Data recipes and robust infrastructure for training AI agents☆104Updated this week
- ☆41Mar 26, 2025Updated 11 months ago
- Ideas for projects related to Tinker☆170Nov 6, 2025Updated 3 months ago
- LLMs + Lean, on your laptop or in the cloud☆202Oct 10, 2025Updated 4 months ago
- Run, deploy and monitor CLI agents in secure cloud sandboxes.☆45Updated this week
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago
- ASTRA is an end-to-end system for synthesizing agentic trajectories and rule-verifiable environments for SFT and RL training, developed b…☆114Jan 30, 2026Updated last month
- An open-source reinforcement learning framework for training LLM-based agents — supporting GRPO, PPO, RLHF, multi-turn reasoning, tool us…☆284Feb 3, 2026Updated last month