Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆45May 29, 2024Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below
Sorting:
- RL-based MPC for Discrete-Time Nonlinear Systems (Python)☆102Dec 2, 2025Updated 3 months ago
- awesome-edge-computing,边缘计算各种资料汇总,相关技术资料汇总☆23Nov 8, 2021Updated 4 years ago
- Benchmark for Multi-robot Cleaning Task Allocation☆14Aug 13, 2023Updated 2 years ago
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 8 months ago
- Deep Q Network for Multi-agent RL☆15Oct 18, 2020Updated 5 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆17Jan 22, 2019Updated 7 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- 萌妹yande.re 正在开发☆11Oct 4, 2018Updated 7 years ago
- A SITL guide for setting up Ardupilot, Gazebo & ROS☆16Jul 27, 2020Updated 5 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- Some methods to sampling data points from a given distribution.☆17Jul 16, 2018Updated 7 years ago
- MIGSAA Project 2 - Langevin Monte Carlo Algorithms☆15Jul 25, 2023Updated 2 years ago
- [KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning☆19May 18, 2022Updated 3 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- The code for task allocation and the simulation system based on ROS and Gazebo for task allocation are included☆18Jul 15, 2024Updated last year
- A Monte-Carlo simulator for Mobile Edge/Cloud Computing☆12Aug 22, 2023Updated 2 years ago
- DeepSeek R1 distilled into smaller OSS models☆17Dec 2, 2025Updated 3 months ago
- Official Implementation of Paper: WMPO: World Model-based Policy Optimization for Vision-Language-Action Models☆184Jan 4, 2026Updated 2 months ago
- Graph Attention-Guided Search for Dense Multi-Agent Pathfinding (AAAI-26)☆30Feb 13, 2026Updated last month
- RL and MARL from Mobile Edge Computing Load Optimization☆12Jun 28, 2023Updated 2 years ago
- LVI-SAM for easier using (更简单的使用LVI-SAM的方法)☆11Dec 5, 2023Updated 2 years ago
- LVI-SAM Chinese comments(LVI-SAM中文注释)☆14Dec 31, 2022Updated 3 years ago
- Dynamic Attention Encoder-Decoder model to learn and design heuristics to solve capacitated vehicle routing problems☆50Jan 7, 2021Updated 5 years ago
- [CVPR2025] Hand-held Object Reconstruction from RGB Video with Dynamic Interaction☆33Sep 1, 2025Updated 6 months ago
- Autonomous driving agent in Carla simulator leveraging IL and RL techniques.☆25Dec 31, 2024Updated last year
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- ☆25Jan 20, 2022Updated 4 years ago
- The model for edge classification by transforming edges to nodes.☆15Dec 22, 2020Updated 5 years ago
- Source code for paper "Decomposition-based Hierarchical Task Allocation and Planning for Multi-Robots under Hierarchical Temporal Logic S…☆23Dec 4, 2025Updated 3 months ago
- A fork of ns3 LTE module for reinforcement learning experiments☆13Feb 20, 2017Updated 9 years ago
- Collection of URDF files and generation scripts for various object sets.☆14Apr 21, 2023Updated 2 years ago
- Simulator for evaluating cloud/edge requests from connected vehicles and computing statistical analysis of the input network☆11May 7, 2018Updated 7 years ago
- 一个微信图形界面调试工具,免去你将程序部署到服务器的麻烦。☆35Jul 4, 2017Updated 8 years ago
- Gaussian Blief Propagation☆24Mar 5, 2023Updated 3 years ago
- Spatial Transformer Networks in PyTorch☆20Oct 10, 2021Updated 4 years ago
- This code simulate effect of using edge computing for NFV.☆12Jan 16, 2020Updated 6 years ago
- [CVPR 2025] UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References☆26Apr 28, 2025Updated 10 months ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago
- A dataset for traffic accident analysis in the US☆29Apr 17, 2025Updated 11 months ago