lmgame-org / GRLLinks

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

☆51

Alternatives and similar repositories for GRL

Users that are interested in GRL are comparing it to the libraries listed below

Sorting:

frankxwang / dpo-prefix-sharing
DPO, but faster 🚀
☆46Updated 11 months ago
sail-sg / Precision-RL
Defeating the Training-Inference Mismatch via FP16
☆154Updated last week
Infini-AI-Lab / gsm_infinite
☆55Updated 5 months ago
Infini-AI-Lab / Multiverse
☆103Updated 2 months ago
sail-sg / SkyLadder
The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
☆40Updated last month
sustcsonglin / linear-attention-and-beyond-slides
☆95Updated 8 months ago
tilde-research / MoMoE-impl
Memory optimized Mixture of Experts
☆69Updated 3 months ago
RLsys-Foundation / TritonForge
🔥 LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilation…
☆99Updated 2 weeks ago
Infini-AI-Lab / Kinetics
Kinetics: Rethinking Test-Time Scaling Laws
☆82Updated 4 months ago
PiotrNawrot / sparse-frontier
The evaluation framework for training-free sparse attention in LLMs
☆104Updated last month
li-plus / flash-preference
Accelerate LLM preference tuning via prefix sharing with a single line of code
☆51Updated 4 months ago
shreyansh26 / Attention-Mask-Patterns
Using FlexAttention to compute attention with different masking patterns
☆47Updated last year
NVIDIA-NeMo / Automodel
Pytorch Distributed native training library for LLMs/VLMs with OOTB Hugging Face support
☆179Updated this week
nil0x9 / flash-muon
Flash-Muon: An Efficient Implementation of Muon Optimizer
☆206Updated 5 months ago
OpenNLPLab / LASP
Linear Attention Sequence Parallelism (LASP)
☆87Updated last year
HanGuo97 / log-linear-attention
☆254Updated 5 months ago
Dao-AILab / grouped-latent-attention
☆132Updated 5 months ago
mit-han-lab / flash-moba
☆143Updated last week
sail-sg / SimLayerKV
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
☆50Updated last year
yaof20 / Flash-RL
Implementation for FP8/INT8 Rollout for RL training without performence drop.
☆275Updated 2 weeks ago
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆130Updated 11 months ago
itsnamgyu / block-transformer
Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)
☆162Updated 7 months ago
liangyuwang / Tiny-FSDP
Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP
☆90Updated 3 months ago
Infini-AI-Lab / GRESO
☆66Updated 4 months ago
ScalingIntelligence / large_language_monkeys
☆109Updated last year
sail-sg / scaling-with-vocab
[NeurIPS-2024] 📈 Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies https://arxiv.org/abs/2407.13623
☆89Updated last year
tilde-research / nsa-impl
An efficient implementation of the NSA (Native Sparse Attention) kernel
☆126Updated 5 months ago
RobertCsordas / moeut
☆88Updated last year
Leooyii / LCEG
Long Context Extension and Generalization in LLMs
☆62Updated last year
SalesforceAIResearch / GemFilter
☆85Updated 2 weeks ago