Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆46May 29, 2024Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- On-Policy Policy Gradient Algorithms in JAX☆42Jan 25, 2024Updated 2 years ago
- Code for Policy Bifurcation in Safe Reinforcement Learning☆10Jul 4, 2025Updated 9 months ago
- Deep Q Network for Multi-agent RL☆15Oct 18, 2020Updated 5 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆16Jan 22, 2019Updated 7 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Relative gradient optimization of the Jacobian term in unsupervised deep learning, NeurIPS 2020☆21Apr 27, 2021Updated 4 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- ☆20Jan 15, 2024Updated 2 years ago
- Some methods to sampling data points from a given distribution.☆17Jul 16, 2018Updated 7 years ago
- [KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning☆19May 18, 2022Updated 3 years ago
- Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding (RSS 2021)☆12Oct 22, 2021Updated 4 years ago
- The code for task allocation and the simulation system based on ROS and Gazebo for task allocation are included☆18Jul 15, 2024Updated last year
- [CoRL 2022] Official implementation of the publication Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinfo…☆26Jan 3, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A Monte-Carlo simulator for Mobile Edge/Cloud Computing☆12Aug 22, 2023Updated 2 years ago
- DeepSeek R1 distilled into smaller OSS models☆17Dec 2, 2025Updated 4 months ago
- Python binding to the CavalierContours C++ library☆13Nov 14, 2020Updated 5 years ago
- ☆13May 4, 2023Updated 2 years ago
- Dubin's Vehicle Model in Gym Environment for Path Tracking using RL Algorithms☆16Jun 30, 2021Updated 4 years ago
- Atari-style POMDPs☆27Mar 31, 2026Updated last week
- Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning☆10Mar 20, 2021Updated 5 years ago
- RL and MARL from Mobile Edge Computing Load Optimization☆12Jun 28, 2023Updated 2 years ago
- LVI-SAM Chinese comments(LVI-SAM中文注释)☆14Dec 31, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The CLI & python API for the well-known project gpt-academic.☆19Sep 22, 2024Updated last year
- Dynamic Attention Encoder-Decoder model to learn and design heuristics to solve capacitated vehicle routing problems☆50Jan 7, 2021Updated 5 years ago
- Autonomous driving agent in Carla simulator leveraging IL and RL techniques.☆25Dec 31, 2024Updated last year
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- ☆25Jan 20, 2022Updated 4 years ago
- The model for edge classification by transforming edges to nodes.☆15Dec 22, 2020Updated 5 years ago
- Source code for paper "Decomposition-based Hierarchical Task Allocation and Planning for Multi-Robots under Hierarchical Temporal Logic S…☆23Dec 4, 2025Updated 4 months ago
- A set of communication oriented environments☆32Updated this week
- 多水面无人船的鲁棒协同控制☆25Nov 18, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Demo for the subjective interface☆14Mar 4, 2018Updated 8 years ago
- 基于PPO算法的轨迹规划☆19Apr 11, 2024Updated last year
- 论文一体化写作神器(Python)☆17Apr 11, 2020Updated 5 years ago
- The folder contains NS-3 simulations for Mobility Robustness Optimization in Small Cell Networks☆11Dec 15, 2020Updated 5 years ago
- This repository contains a Reinforcement learning algorithm for task scheduling in edge-cloud computing.☆13Feb 10, 2024Updated 2 years ago
- This code simulate effect of using edge computing for NFV.☆12Jan 16, 2020Updated 6 years ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 7 years ago