A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithms with minimal intrusion.
☆101Aug 25, 2025Updated 6 months ago
Alternatives and similar repositories for RLite
Users that are interested in RLite are comparing it to the libraries listed below
Sorting:
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- Reproducing R1 for Code with Reliable Rewards☆290May 5, 2025Updated 10 months ago
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆19Mar 8, 2025Updated 11 months ago
- Implementation for the paper: CMoE: Fast Carving of Mixture-of-Experts for Efficient LLM Inference☆35Mar 6, 2025Updated last year
- An LLM leaderboard for stateful agents☆20Oct 16, 2025Updated 4 months ago
- ☆11Jan 10, 2025Updated last year
- Nex Venus Communication Library☆72Nov 17, 2025Updated 3 months ago
- NeurIPS 2020 paper: UnModNet: Learning to Unwrap a Modulo Image for High Dynamic Range Imaging☆10Oct 24, 2021Updated 4 years ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆165Feb 11, 2026Updated 3 weeks ago
- Paper-reading notes for Berkeley OS prelim exam.☆14Aug 28, 2024Updated last year
- ☆63Oct 17, 2023Updated 2 years ago
- ☆90Oct 30, 2025Updated 4 months ago
- Less Is More: Training-Free Sparse Attention with Global Locality for Efficient Reasoning☆29Sep 12, 2025Updated 5 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,586Updated this week
- DeeperGEMM: crazy optimized version☆74May 5, 2025Updated 10 months ago
- ☆335May 24, 2025Updated 9 months ago
- Triton Implementation of Flash Attention with Bias.☆21Apr 16, 2025Updated 10 months ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆48Jan 21, 2026Updated last month
- [NeurIPS 2024] Search for Efficient LLMs☆16Jan 16, 2025Updated last year
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆56Feb 20, 2026Updated 2 weeks ago
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆248Jul 13, 2025Updated 7 months ago
- Tiny-Megatron, a minimalistic re-implementation of the Megatron library☆23Sep 1, 2025Updated 6 months ago
- AI model training on heterogeneous, geo-distributed resources☆38Nov 24, 2025Updated 3 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆77Nov 4, 2025Updated 4 months ago
- ☆54Mar 15, 2025Updated 11 months ago
- crystalnet -- a mini core AI library (being refactored, see https://github.com/lgarithm/stdnn-ops)☆17Oct 1, 2019Updated 6 years ago
- Scalable toolkit for efficient model reinforcement☆1,372Updated this week
- qwen-nsa☆87Oct 14, 2025Updated 4 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 8 months ago
- Rust crate for some audio utilities☆27Mar 8, 2025Updated 11 months ago
- ☆20Nov 21, 2017Updated 8 years ago
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆331Apr 24, 2025Updated 10 months ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 7 months ago
- This repo is used for archiving my notes, codes and materials of cs learning.☆80Updated this week
- extensible collectives library in triton☆96Mar 31, 2025Updated 11 months ago
- ☆21Aug 30, 2025Updated 6 months ago
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆99Aug 20, 2025Updated 6 months ago
- ☆34Jun 22, 2024Updated last year
- A practical way of learning Swizzle☆37Feb 3, 2025Updated last year