lmgame-org / GRLLinks
Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning
β47Updated this week
Alternatives and similar repositories for GRL
Users that are interested in GRL are comparing it to the libraries listed below
Sorting:
- DPO, but faster πβ45Updated 10 months ago
- β89Updated 7 months ago
- The evaluation framework for training-free sparse attention in LLMsβ101Updated 3 months ago
- β54Updated 4 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernelβ119Updated 3 months ago
- Kinetics: Rethinking Test-Time Scaling Lawsβ80Updated 3 months ago
- Flash-Muon: An Efficient Implementation of Muon Optimizerβ193Updated 3 months ago
- Accelerate LLM preference tuning via prefix sharing with a single line of codeβ43Updated 3 months ago
- β96Updated last month
- β251Updated 4 months ago
- Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clustersβ130Updated 10 months ago
- Fast and memory-efficient exact attentionβ70Updated 7 months ago
- The official repository for SkyLadder: Better and Faster Pretraining via Context Window Schedulingβ35Updated last month
- Official implementation for Training LLMs with MXFP4β96Updated 5 months ago
- Linear Attention Sequence Parallelism (LASP)β87Updated last year
- β129Updated 4 months ago
- The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS β¦β60Updated last year
- Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)β161Updated 5 months ago
- Memory optimized Mixture of Expertsβ68Updated 2 months ago
- β52Updated 11 months ago
- M1: Towards Scalable Test-Time Compute with Mamba Reasoning Modelsβ40Updated 2 months ago
- β86Updated 8 months ago
- Using FlexAttention to compute attention with different masking patternsβ45Updated last year
- Fast and memory-efficient exact kmeansβ100Updated last week
- β85Updated last year
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDPβ82Updated last month
- Odysseus: Playground of LLM Sequence Parallelismβ77Updated last year
- A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithmβ¦β68Updated last month
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.β49Updated 11 months ago
- Here we will test various linear attention designs.β61Updated last year