agentica-project / verl-pipelineLinks
Async pipelined version of Verl
☆124Updated 9 months ago
Alternatives and similar repositories for verl-pipeline
Users that are interested in verl-pipeline are comparing it to the libraries listed below
Sorting:
- Homepage for ProLong (Princeton long-context language models) and paper "How to Train Long-Context Language Models (Effectively)"☆245Updated 4 months ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆226Updated 6 months ago
- (best/better) practices of megatron on veRL and tuning guide☆124Updated 4 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆116Updated 5 months ago
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆287Updated 2 months ago
- [COLM 2025] Code for Paper: Learning Adaptive Parallel Reasoning with Language Models☆138Updated last month
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆249Updated 9 months ago
- Reproducing R1 for Code with Reliable Rewards☆282Updated 8 months ago
- [NeurIPS'24] Official code for *🎯DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving*☆120Updated last year
- ☆78Updated last year
- ☆328Updated 7 months ago
- The HELMET Benchmark☆197Updated last month
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆182Updated 6 months ago
- ☆215Updated 11 months ago
- ☆80Updated 10 months ago
- ☆50Updated 5 months ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆218Updated 7 months ago
- Official repository for ACL 2025 paper "ProcessBench: Identifying Process Errors in Mathematical Reasoning"☆183Updated 8 months ago
- Repository of LV-Eval Benchmark☆73Updated last year
- CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings☆61Updated 11 months ago
- The code for creating the iGSM datasets in papers "Physics of Language Models Part 2.1, Grade-School Math and the Hidden Reasoning Proces…☆84Updated last year
- Repo of paper "Free Process Rewards without Process Labels"☆168Updated 10 months ago
- Bridge Megatron-Core to Hugging Face/Reinforcement Learning☆185Updated this week
- ☆33Updated 2 months ago
- A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models☆71Updated 11 months ago
- A Comprehensive Survey on Long Context Language Modeling☆219Updated 2 months ago
- ☆223Updated 10 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆259Updated 8 months ago
- Resources for the Enigmata Project.☆76Updated 5 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120Updated 8 months ago