深度强化学习各算法介绍与Pytorch实现
☆77Jul 18, 2024Updated last year
Alternatives and similar repositories for rl-notebook
Users that are interested in rl-notebook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆159Jul 10, 2024Updated last year
- robopal: a multi-platform, modular robot simulation framework based on MuJoCo, mainly used for reinforcement learning and control algori…☆298May 27, 2025Updated last year
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆21Oct 23, 2020Updated 5 years ago
- ☆16Aug 6, 2024Updated last year
- Path Planning with Reinforcement Learning algorithms in an unknown environment☆21Jan 29, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆33Mar 20, 2025Updated last year
- 2D Grid Environment with common utils (raytracing) and quadrotor dynamics. With Exponential Control Barrier Functions☆13Jun 3, 2020Updated 5 years ago
- Add Flexiv robots to Isaac Sim and control them using Flexiv Elements Studio or Flexiv RDK with the actual force/torque controller used o…☆28Nov 20, 2025Updated 6 months ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆120Nov 2, 2023Updated 2 years ago
- The purpose of this project is to implement machine learning methods to study resource allocation problems, that is how to share limited …☆16Jun 7, 2022Updated 3 years ago
- An implementation of many different types of schedules for the learning rate.☆16Nov 7, 2022Updated 3 years ago
- Control inverted pendulum by LQR in OpenAI Gym☆12Oct 2, 2024Updated last year
- RL for path planning☆13Aug 4, 2018Updated 7 years ago
- Exploration of techniques to solve tasks with a Panda robotic arm. Simulation based on PyBullet physics engine and gymnasium.☆10Mar 17, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Quick backbone of Diffusion Policy on Robot Arm by PyBullet☆32Nov 30, 2025Updated 5 months ago
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆11Oct 13, 2023Updated 2 years ago
- pybullet WBC quadruped robot☆13Apr 5, 2022Updated 4 years ago
- 基于优化算法的人员应急疏散优化方案 | Optimization Plan for Emergency Evacuation of Personnel Based on Optimization Algorithm☆13Sep 4, 2024Updated last year
- MFO 3D path planning☆15Aug 14, 2019Updated 6 years ago
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- Mobile Robot Path Planning and Obstacle Avoidance Using PSO in Python☆54Mar 13, 2023Updated 3 years ago
- DRLib:a Concise Deep Reinforcement Learning Library, Integrating HER, PER and D2SR for Almost Off-Policy RL Algorithms.☆561Apr 2, 2024Updated 2 years ago
- 深度强化学习路径规划, SAC路径规划, Soft Actor-Critic算法, SAC-pytorch,激光雷达Lidar避障,激光雷达仿真,Adaptive-SAC☆571Dec 3, 2025Updated 5 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Python 3.x implementation of the retargeting system presented in the paper "Task Oriented Hand Motion Retargeting for Dexterous Manipulat…☆13Jul 10, 2022Updated 3 years ago
- A Neural Network Architecture for the Analysis of Unlabeled Time-Series Data☆10Jun 25, 2019Updated 6 years ago
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- gym_fetch_env with insert drawer open door☆13Mar 22, 2022Updated 4 years ago
- Deep Recurrent Q-Network with different exploration strategies for self-driving cars (using AirSim)☆10Sep 5, 2024Updated last year
- ☆11Feb 17, 2025Updated last year
- Custom Franka Panda packages for pick and place operations☆20Mar 11, 2022Updated 4 years ago
- A cell counter using computer vision techniques.☆10May 13, 2022Updated 4 years ago
- A pathfinding application of the GWO heuristic algorithm☆11Feb 4, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo refers to paper Invariant Transform Experience Replay. And this repo is built on top of OpenAI Baseline. For more information p…☆12Feb 2, 2021Updated 5 years ago
- a flexibility oriented stochastic scheduling framework is presented to evaluate short-term reliability and economic of islanded microgri…☆12Apr 29, 2022Updated 4 years ago
- Motion retarget for legged robots.☆42Aug 24, 2025Updated 9 months ago
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago
- environments for reinforcement learning based on panda-gym☆19Aug 22, 2022Updated 3 years ago
- This project utilizes deep reinforcement learning techniques to train a robot, which combines a mobile platform and a Panda robotic arm, …☆11Jun 7, 2023Updated 2 years ago
- trajectory optimization☆14Jun 11, 2021Updated 4 years ago