Reinforcement Learning algorithms and use-cases, including DQN, PG, A3C, PPO etc. and RLHF, AlphaZero implementations. Designed for clarity, ease of use, and educational purposes.
☆47May 29, 2024Updated last year
Alternatives and similar repositories for CleanRL
Users that are interested in CleanRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- awesome-edge-computing,边缘计算各种资料汇总,相关技术资料汇总☆23Nov 8, 2021Updated 4 years ago
- Deep Q Network for Multi-agent RL☆15Oct 18, 2020Updated 5 years ago
- NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models☆17Oct 22, 2024Updated last year
- Relative gradient optimization of the Jacobian term in unsupervised deep learning, NeurIPS 2020☆21Apr 27, 2021Updated 5 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago
- ☆20Jan 15, 2024Updated 2 years ago
- MIGSAA Project 2 - Langevin Monte Carlo Algorithms☆15Jul 25, 2023Updated 2 years ago
- [KDD 2021] Energy-Efficient 3D Vehicular Crowdsourcing for Disaster Response by Distributed Deep Reinforcement Learning☆19May 18, 2022Updated 3 years ago
- [CoRL 2022] Official implementation of the publication Residual Skill Policies: Learning an Adaptable Skill-based Action Space for Reinfo…☆26Jan 3, 2023Updated 3 years ago
- A system for running Multi-Agent Path Finding (MAPF) experiments, with multiple implemented algorithms.☆37Apr 18, 2026Updated 3 weeks ago
- A Monte-Carlo simulator for Mobile Edge/Cloud Computing☆12Aug 22, 2023Updated 2 years ago
- Use Muon optimizer instead of AdamW.☆50Mar 2, 2026Updated 2 months ago
- ☆14Nov 13, 2025Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- DeepSeek R1 distilled into smaller OSS models☆17Dec 2, 2025Updated 5 months ago
- Offline RLHF codebase implementation for "Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human …☆42Mar 26, 2024Updated 2 years ago
- Graph Attention-Guided Search for Dense Multi-Agent Pathfinding (AAAI-26)☆35Feb 13, 2026Updated 2 months ago
- The CLI & python API for the well-known project gpt-academic.☆19Sep 22, 2024Updated last year
- Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning☆10Mar 20, 2021Updated 5 years ago
- LVI-SAM Chinese comments(LVI-SAM中文注释)☆17Dec 31, 2022Updated 3 years ago
- Dynamic Attention Encoder-Decoder model to learn and design heuristics to solve capacitated vehicle routing problems☆50Jan 7, 2021Updated 5 years ago
- Research project on Resource-elastic tasks for edge cloud computing☆12Aug 12, 2021Updated 4 years ago
- ☆25Jan 20, 2022Updated 4 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Deep Reinforcement Learning for Dynamic Multicahnnel Access in Wireless Networks☆14Oct 1, 2017Updated 8 years ago
- [CVPR2025] Hand-held Object Reconstruction from RGB Video with Dynamic Interaction☆32Sep 1, 2025Updated 8 months ago
- Collection of URDF files and generation scripts for various object sets.☆15Apr 21, 2023Updated 3 years ago
- Demo for the subjective interface☆14Mar 4, 2018Updated 8 years ago
- 一个微信图形界面调试工具,免去你将程序部署到服务器的麻烦。☆35Jul 4, 2017Updated 8 years ago
- 论文一体化写作神器(Python)☆17Apr 11, 2020Updated 6 years ago
- The folder contains NS-3 simulations for Mobility Robustness Optimization in Small Cell Networks☆11Dec 15, 2020Updated 5 years ago
- Official implementation of the paper "Guidance Graph Optimization for Lifelong Multi-Agent Path Finding", published in IJCAI 2024.☆20Mar 10, 2026Updated last month
- Official code for the ICLR 2025 paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"☆29Dec 1, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Using Reinforcement Learning method to realize load balancing control in dynamic cellular network☆15Sep 22, 2018Updated 7 years ago
- Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization☆23Apr 17, 2024Updated 2 years ago
- Noise Contrastive Estimation (NCE) in PyTorch☆32Mar 2, 2025Updated last year
- ☆75Jul 6, 2025Updated 10 months ago
- TAPAS is a tool for rapid prototyping of adaptive streaming algorithms and video streaming traffic generation☆17May 7, 2019Updated 7 years ago
- simulation of "A novel reinforcement learning algorithm for virtual network emb e dding" paper☆18Jan 16, 2020Updated 6 years ago
- Exploring algorithms in the domain of offline reinforcement learning (REM, Ensemble-DQN, DQN, ...)☆17Jul 7, 2020Updated 5 years ago