lmgame-org / GRLLinks
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
☆59Updated last month
Alternatives and similar repositories for GRL
Users that are interested in GRL are comparing it to the libraries listed below
Sorting:
- Defeating the Training-Inference Mismatch via FP16☆182Updated 2 months ago
- The evaluation framework for training-free sparse attention in LLMs☆117Updated 2 weeks ago
- ☆105Updated 11 months ago
- DPO, but faster 🚀☆47Updated last year
- Kinetics: Rethinking Test-Time Scaling Laws☆86Updated 7 months ago
- Memory optimized Mixture of Experts☆73Updated 6 months ago
- Esoteric Language Models☆111Updated this week
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters☆131Updated last year
- ☆111Updated 5 months ago
- ☆77Updated this week
- ☆270Updated 8 months ago
- 🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…☆116Updated 3 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling☆42Updated last month
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆128Updated 7 months ago
- ☆63Updated 8 months ago
- ☆112Updated last year
- Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support☆288Updated this week
- Implementation for FP8/INT8 Rollout for RL training without performence drop.☆289Updated 3 months ago
- ☆84Updated 3 months ago
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆233Updated 7 months ago
- ☆131Updated 8 months ago
- [Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments☆177Updated last month
- Spectral Sphere Optimizer☆96Updated 3 weeks ago
- ☆54Updated last year
- The official github repo for "Diffusion Language Models are Super Data Learners".☆221Updated 3 months ago
- ☆221Updated 2 months ago
- Benchmarking Optimizers for LLM Pretraining☆49Updated last month
- P1: Mastering Physics Olympiads with Reinforcement Learning☆73Updated last month
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Updated 2 weeks ago
- Physics of Language Models: Part 4.2, Canon Layers at Scale where Synthetic Pretraining Resonates in Reality☆317Updated last month