A set of examples based on verl for end-to-end RL training recipes.
☆228Mar 25, 2026Updated this week
Alternatives and similar repositories for verl-recipe
Users that are interested in verl-recipe are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated 2 months ago
- Official code repository of Shuffle-R1☆25Feb 23, 2026Updated last month
- ☆17Nov 3, 2024Updated last year
- [TMLR] Process Reward Models That Think☆83Nov 29, 2025Updated 4 months ago
- A version of verl to support diverse tool use☆923Mar 2, 2026Updated 3 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Code for "Variational Reasoning for Language Models"☆58Sep 29, 2025Updated 6 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆68Mar 5, 2026Updated 3 weeks ago
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆25Feb 7, 2026Updated last month
- Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)☆18Nov 24, 2022Updated 3 years ago
- The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]☆186Jun 5, 2025Updated 9 months ago
- Code for "Adaptive Self-improvement LLM Agentic System for ML Library Development" (ICML 2025)☆15Jan 6, 2026Updated 2 months ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆58Nov 5, 2025Updated 4 months ago
- ☆11Jul 21, 2024Updated last year
- ☆13Nov 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Jul 20, 2023Updated 2 years ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆49Jan 21, 2026Updated 2 months ago
- ConvGQR: Generative Query Reformulation for Conversational Search. A codebase for ACL 2023 accepted paper.☆34Mar 5, 2024Updated 2 years ago
- An Ultra-Long Output Reinforcement Learning Approach☆23Jul 31, 2025Updated 8 months ago
- Accelerating RL for LLM Reasoning with Optimal Advantage Regression☆40May 30, 2025Updated 10 months ago
- Qualifying Exam Preparing☆17May 7, 2025Updated 10 months ago
- Ideas for projects related to Tinker☆173Nov 6, 2025Updated 4 months ago
- An AI-powered coding assistant plugin for the Eclipse IDE.☆13Oct 28, 2025Updated 5 months ago
- DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation.☆131Feb 10, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆10Apr 23, 2021Updated 4 years ago
- Demo for Qwen2.5-VL-3B-Instruct on Axera device.☆16Sep 3, 2025Updated 6 months ago
- {DeepL, Google, WMT-Best, davinci-003, turbo, gpt-4} × {En-De, En-Cs, En-Ru, En-Zh, De-Fr, En-Ja, Uk-En, Uk-Cs, En-Hr, En-Ha, En-Is}☆14Jun 18, 2023Updated 2 years ago
- AFlow & MathAI☆19Feb 24, 2025Updated last year
- RLLaVA is a user-friendly framework for multi-modal RL research and optimized for resource-constrained teams.☆58Mar 18, 2026Updated last week
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,171Jul 15, 2025Updated 8 months ago
- DOMAINEVAL is an auto-constructed benchmark for multi-domain code generation that consists of 2k+ subjects (i.e., description, reference …☆14Dec 12, 2024Updated last year
- ☆10Sep 18, 2017Updated 8 years ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆20,286Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICCV 2025] Dynamic-VLM☆28Dec 16, 2024Updated last year
- OfficeBench: Benchmarking Language Agents across Multiple Applications for Office Automation☆34May 23, 2025Updated 10 months ago
- Indonesian speech/phoneme recognizer powered by Kaldi 2.0 (lhotse, icefall, sherpa).☆15Jun 30, 2023Updated 2 years ago
- The code for paper "EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning"☆37Oct 1, 2025Updated 5 months ago
- 2025.01:从零到一实现了一个多模态大模型,并命名为Reyes(睿视),R:睿,eyes:眼。Reyes的参数量为8B,视觉编码器使用的是InternViT-300M-448px-V2_5,语言模型侧使用的 是Qwen2.5-7B-Instruct,Reyes也通过一个两…☆33Feb 10, 2026Updated last month
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,776Updated this week
- This is the implementation for the paper: Sequential Recommender System based on Hierarchical Attention Network☆11Mar 13, 2021Updated 5 years ago