深度强化学习各算法介绍与Pytorch实现
☆78Jul 18, 2024Updated last year
Alternatives and similar repositories for rl-notebook
Users that are interested in rl-notebook are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆159Jul 10, 2024Updated last year
- robopal: a multi-platform, modular robot simulation framework based on MuJoCo, mainly used for reinforcement learning and control algori…☆298May 27, 2025Updated 10 months ago
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆151Jan 23, 2026Updated 2 months ago
- RLBench simulation project for autonomous bin picking using Pandas robot arm☆10Mar 1, 2021Updated 5 years ago
- Implementation of Soft Actor-Critic with Hindsight Experience Replay☆21Oct 23, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Aug 6, 2024Updated last year
- Repository for "Latent Conditioned Loco-Manipulation Using Motion Priors"☆27Sep 23, 2025Updated 6 months ago
- Path Planning with Reinforcement Learning algorithms in an unknown environment☆19Jan 29, 2026Updated 2 months ago
- ☆31Mar 20, 2025Updated last year
- 2D Grid Environment with common utils (raytracing) and quadrotor dynamics. With Exponential Control Barrier Functions☆13Jun 3, 2020Updated 5 years ago
- Add Flexiv robots to Isaac Sim and control them using Flexiv Elements Studio or Flexiv RDK with the actual force/torque controller used o…☆26Nov 20, 2025Updated 4 months ago
- 利用深度强化学习的方法实现多智能体间离散无交流的障碍避免。其中强化学习算法训练模型所需的数据集由最优互惠碰撞避免(Optimal Reciprocal Collision Avoidance, ORCA)算法生成。☆90Mar 14, 2019Updated 7 years ago
- The purpose of this project is to implement machine learning methods to study resource allocation problems, that is how to share limited …☆16Jun 7, 2022Updated 3 years ago
- An implementation of many different types of schedules for the learning rate.☆16Nov 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Control inverted pendulum by LQR in OpenAI Gym☆12Oct 2, 2024Updated last year
- 本仓库包含了完整的深度学习应用开发流程,以经典的手写字符识别为例,基于LeNet网络构建。推理部分使用torch、onnxruntime以及openvino框架💖☆17Nov 13, 2025Updated 5 months ago
- RL for path planning☆13Aug 4, 2018Updated 7 years ago
- Exploration of techniques to solve tasks with a Panda robotic arm. Simulation based on PyBullet physics engine and gymnasium.☆10Mar 17, 2025Updated last year
- Quick backbone of Diffusion Policy on Robot Arm by PyBullet☆32Nov 30, 2025Updated 4 months ago
- ☆10Oct 20, 2021Updated 4 years ago
- Code for PolyTask: Learning Unified Policies through Behavior Distillation☆11Oct 13, 2023Updated 2 years ago
- pybullet WBC quadruped robot☆13Apr 5, 2022Updated 4 years ago
- MFO 3D path planning☆15Aug 14, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- code for the paper Offline Prioritized Experience Replay☆12Jun 13, 2023Updated 2 years ago
- Mobile Robot Path Planning and Obstacle Avoidance Using PSO in Python☆54Mar 13, 2023Updated 3 years ago
- Python 3.x implementation of the retargeting system presented in the paper "Task Oriented Hand Motion Retargeting for Dexterous Manipulat…☆13Jul 10, 2022Updated 3 years ago
- Multi-Agent Deep Recurrent Q-Learning with Bayesian epsilon-greedy on AirSim simulator☆13Apr 1, 2022Updated 4 years ago
- A set of Matlab/Octave files that performs a method of Nonlinear System Identification.☆25Oct 26, 2018Updated 7 years ago
- gym_fetch_env with insert drawer open door☆13Mar 22, 2022Updated 4 years ago
- ☆11Feb 17, 2025Updated last year
- Deep Recurrent Q-Network with different exploration strategies for self-driving cars (using AirSim)☆10Sep 5, 2024Updated last year
- Custom Franka Panda packages for pick and place operations☆20Mar 11, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- The fkie_husky_manipulation_simulation package simulates a husky robot base and a manipulator arm such as a panda arm.☆10Dec 13, 2022Updated 3 years ago
- A cell counter using computer vision techniques.☆10May 13, 2022Updated 3 years ago
- Source code of "Variational Imitation Learning with Diverse-quality Demonstrations" in ICML 2020. This github repository includes python …☆20Aug 16, 2021Updated 4 years ago
- A pathfinding application of the GWO heuristic algorithm☆11Feb 4, 2020Updated 6 years ago
- This repo refers to paper Invariant Transform Experience Replay. And this repo is built on top of OpenAI Baseline. For more information p…☆12Feb 2, 2021Updated 5 years ago
- Motion retarget for legged robots.☆42Aug 24, 2025Updated 7 months ago
- The test code for the paper "Attention-based advantage actor-critic algorithm with prioritized experience replay for complex 2-D robotic …☆10Aug 7, 2022Updated 3 years ago