sii-research / siiRLLinks
siiRL: Shanghai Innovation Institute RL Framework for Advanced LLMs and Multi-Agent Systems
☆330Updated this week
Alternatives and similar repositories for siiRL
Users that are interested in siiRL are comparing it to the libraries listed below
Sorting:
- Training VLM agents with multi-turn reinforcement learning☆381Updated this week
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆330Updated 9 months ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆288Updated 2 months ago
- Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.☆304Updated this week
- ☆132Updated 2 months ago
- Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme☆147Updated 9 months ago
- Official PyTorch implementation of the paper "dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching" (dLLM-Cache…☆197Updated 2 months ago
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithm…☆98Updated 5 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆191Updated 10 months ago
- A Telegram bot to recommend arXiv papers☆302Updated 2 months ago
- Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934☆204Updated 3 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆350Updated 2 weeks ago
- TraceRL & TraDo-8B: Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models☆398Updated last month
- ☆61Updated 9 months ago
- Towards a Unified View of Large Language Model Post-Training☆199Updated 4 months ago
- repo for paper https://arxiv.org/abs/2504.13837☆323Updated last month
- 青稞Talk☆189Updated last week
- ☆222Updated last month
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆411Updated 6 months ago
- A set of examples based on verl for end-to-end RL training recipes.☆139Updated last week
- VLA-Arena is an open-source benchmark for systematic evaluation of Vision-Language-Action (VLA) models.☆104Updated 2 weeks ago
- ☆110Updated 4 months ago
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆154Updated 3 weeks ago
- Official implementation of "Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding"☆799Updated 2 months ago
- Papers on integrating large language models with embodied AI☆36Updated 2 years ago
- ☆101Updated last month
- ☆209Updated 3 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆186Updated last week
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆225Updated 5 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆404Updated 3 months ago