rlite-project / RLiteLinks
A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithms with minimal intrusion.
β98Updated 5 months ago
Alternatives and similar repositories for RLite
Users that are interested in RLite are comparing it to the libraries listed below
Sorting:
- Implementation for FP8/INT8 Rollout for RL training without performence drop.β287Updated 2 months ago
- π₯ LLM-powered GPU kernel synthesis: Train models to convert PyTorch ops into optimized Triton kernels via SFT+RL. Multi-turn compilationβ¦β115Updated 2 months ago
- Bridge Megatron-Core to Hugging Face/Reinforcement Learningβ185Updated last week
- Odysseus: Playground of LLM Sequence Parallelismβ79Updated last year
- Async pipelined version of Verlβ124Updated 9 months ago
- Accelerate LLM preference tuning via prefix sharing with a single line of codeβ51Updated 6 months ago
- (best/better) practices of megatron on veRL and tuning guideβ124Updated 4 months ago
- β133Updated 7 months ago
- APRIL: Active Partial Rollouts in Reinforcement Learning to Tame Long-tail Generation. A system-level optimization for scalable LLM traβ¦β46Updated 3 months ago
- β129Updated 7 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automatonβ39Updated 11 months ago
- [Archived] For the latest updates and community contribution, please visit: https://github.com/Ascend/TransferQueue or https://gitcode.coβ¦β13Updated last week
- [ASPLOS'26] Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafterβ128Updated last month
- Super-Efficient RLHF Training of LLMs with Parameter Reallocationβ330Updated 9 months ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoningβ61Updated 2 months ago
- Estimate MFU for DeepSeekV3β26Updated last year
- β110Updated 4 months ago
- β220Updated 2 months ago
- Nano repo for RL training of LLMsβ70Updated 2 months ago
- Toolchain built around the Megatron-LM for Distributed Trainingβ84Updated last month
- Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMsβ203Updated last month
- Triton implementation of Flash Attention2.0β47Updated 2 years ago
- ByteCheckpoint: An Unified Checkpointing Library for LFMsβ264Updated last month
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDPβ93Updated 5 months ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.β52Updated last year
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.β225Updated 5 months ago
- [ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Lengthβ147Updated last month
- β117Updated 8 months ago
- [ICLR 2025] COAT: Compressing Optimizer States and Activation for Memory-Efficient FP8 Trainingβ257Updated 5 months ago
- NexRL is an ultra-loosely-coupled LLM post-training framework.β68Updated last week