A lightweight reinforcement learning framework that integrates seamlessly into your codebase, empowering developers to focus on algorithms with minimal intrusion.
☆102Aug 25, 2025Updated 7 months ago
Alternatives and similar repositories for RLite
Users that are interested in RLite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"☆19Mar 8, 2025Updated last year
- Asynchronous pipeline parallel optimization☆19Feb 2, 2026Updated last month
- Paper-reading notes for Berkeley OS prelim exam.☆14Aug 28, 2024Updated last year
- Reproducing R1 for Code with Reliable Rewards☆297May 5, 2025Updated 10 months ago
- Nex Venus Communication Library☆74Nov 17, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Cute layout visualization☆33Jan 18, 2026Updated 2 months ago
- ☆87Aug 16, 2025Updated 7 months ago
- ☆21Aug 30, 2025Updated 6 months ago
- An LLM leaderboard for stateful agents☆21Oct 16, 2025Updated 5 months ago
- NVSHMEM‑Tutorial: Build a DeepEP‑like GPU Buffer☆172Feb 11, 2026Updated last month
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆4,855Mar 20, 2026Updated last week
- [COLM 2025] Official repository for R2E-Gym: Procedural Environment Generation and Hybrid Verifiers for Scaling Open-Weights SWE Agents☆254Jul 13, 2025Updated 8 months ago
- Tiny-FSDP, a minimalistic re-implementation of the PyTorch FSDP☆101Aug 20, 2025Updated 7 months ago
- Official Repo for Open-Reasoner-Zero☆2,086Jun 2, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A neural network layer API and library for sequence modeling, designed for easy creation of sequence models that can be executed layerwis…☆56Mar 11, 2026Updated 2 weeks ago
- ☆93Oct 30, 2025Updated 4 months ago
- ☆44Nov 1, 2025Updated 4 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆79Nov 4, 2025Updated 4 months ago
- Vortex: A Flexible and Efficient Sparse Attention Framework☆49Jan 21, 2026Updated 2 months ago
- [ACL25' Findings] SWE-Dev is an SWE agent with a scalable test case construction pipeline.☆59Jul 21, 2025Updated 8 months ago
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆132Jun 24, 2025Updated 9 months ago
- ☆338May 24, 2025Updated 10 months ago
- ☆453Aug 10, 2025Updated 7 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Super-Efficient RLHF Training of LLMs with Parameter Reallocation☆333Apr 24, 2025Updated 11 months ago
- Triton Implementation of Flash Attention with Bias.☆22Apr 16, 2025Updated 11 months ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆44Dec 11, 2023Updated 2 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Rust crate for some audio utilities☆27Mar 8, 2025Updated last year
- TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.☆106Jun 28, 2025Updated 8 months ago
- Scalable toolkit for efficient model reinforcement☆1,447Updated this week
- ☆29Jul 24, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Measuring how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours☆240Mar 10, 2026Updated 2 weeks ago
- JAX Implementations of Descript Audio Codec and EnCodec☆34Mar 30, 2025Updated 11 months ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,713Updated this week
- Safe SLAC, an algorithm for safe cost-constrained reinforcement learning in high-dimensional POMDPs.☆11Mar 1, 2023Updated 3 years ago
- ☆34Jul 29, 2025Updated 7 months ago
- a website from fduers and for fduers☆18Mar 9, 2025Updated last year
- A fast communication-overlapping library for tensor/expert parallelism on GPUs.☆1,273Aug 28, 2025Updated 6 months ago