Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆51May 29, 2024Updated 2 years ago
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- awesome-edge-computing,边缘计算各种资料汇总,相关技术资料汇总☆23Nov 8, 2021Updated 4 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic …☆16Jan 22, 2019Updated 7 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- [KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning☆19May 18, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CoRL 2022] Official implementation of the publication Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinfo…☆26Jan 3, 2023Updated 3 years ago
- A system for running Multi-Agent Path Finding (MAPF) experiments, with multiple implemented algorithms.☆37Apr 18, 2026Updated last month
- DeepSeek R1 distilled into smaller OSS models for hobbyist☆17Dec 2, 2025Updated 6 months ago
- ☆13May 4, 2023Updated 3 years ago
- The CLI & python API for the well-known project gpt-academic.☆19Sep 22, 2024Updated last year
- Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning☆10Mar 20, 2021Updated 5 years ago
- RL and MARL from Mobile Edge Computing Load Optimization☆12Jun 28, 2023Updated 2 years ago
- LVI-SAM Chinese comments(LVI-SAM中文注释)☆17Dec 31, 2022Updated 3 years ago
- On-Policy Model-free Reinforcement Learning for simplified Blackjack (David Silver Assignement)☆11Nov 20, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Autonomous driving agent in Carla simulator leveraging IL and RL techniques.☆28Dec 31, 2024Updated last year
- Research project on Resource-elastic tasks for edge cloud computing☆12Aug 12, 2021Updated 4 years ago
- The model for edge classification by transforming edges to nodes.☆15Dec 22, 2020Updated 5 years ago
- Deep Reinforcement Learning for Dynamic Multicahnnel Access in Wireless Networks☆14Oct 1, 2017Updated 8 years ago
- A fork of ns3 LTE module for reinforcement learning experiments☆13Feb 20, 2017Updated 9 years ago
- 一个微信图形界面调试工具,免去你将程序部署到服务器的麻烦。☆35Jul 4, 2017Updated 8 years ago
- This repository contains a Reinforcement learning algorithm for task scheduling in edge-cloud computing.☆13Feb 10, 2024Updated 2 years ago
- Knowledge makes up the brain☆82Updated this week
- Spatial Transformer Networks in PyTorch☆20Oct 10, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This code simulate effect of using edge computing for NFV.☆13Jan 16, 2020Updated 6 years ago
- Official implementation of the paper "Guidance Graph Optimization for Lifelong Multi-Agent Path Finding", published in IJCAI 2024.☆23Mar 10, 2026Updated 3 months ago
- My notes on reinforcement learning papers☆15Jun 14, 2018Updated 8 years ago
- [CVPR 2025] UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References☆30Apr 28, 2025Updated last year
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆22Apr 17, 2024Updated 2 years ago
- A DRL implementation repo☆23May 7, 2025Updated last year
- This is a repository for paper Sequential Attention Learning for End-to-end Driving☆28Feb 2, 2026Updated 4 months ago
- ☆76Jul 6, 2025Updated 11 months ago
- simulation of "A novel reinforcement learning algorithm for virtual network emb e dding" paper☆18Jan 16, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago
- Genetic algorithms for the placement of services in Fog domains☆14Apr 4, 2022Updated 4 years ago
- 用RLHF可选LoRA对LLaMA和MOSS进行训练|Training LLaMA or MOSS with RLHF [LoRA]☆21May 16, 2023Updated 3 years ago
- 双十一淘宝秒杀☆13Nov 10, 2018Updated 7 years ago
- Apprenticeship Learning with Inverse Reinforcement Learning☆28Aug 14, 2021Updated 4 years ago
- [CVPR'25] UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image☆40May 29, 2025Updated last year
- Pytorch Implementation for First Order Constrained Optimization in Policy Space (FOCOPS).☆29Dec 9, 2021Updated 4 years ago