Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.
☆56Jun 30, 2020Updated 5 years ago
Alternatives and similar repositories for PPO-pytorch-Mujoco
Users that are interested in PPO-pytorch-Mujoco are comparing it to the libraries listed below
Sorting:
- PPO, DDPG, SAC implementation on mujoco environment☆125Feb 16, 2022Updated 4 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- This is pytorch version of maddpg.☆10Jun 23, 2020Updated 5 years ago
- Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.☆12Mar 9, 2021Updated 5 years ago
- PPO implementation of Humanoid-v2 from Open-AI gym☆27Mar 25, 2023Updated 2 years ago
- A minimal example of optimal transport with Input Convex Neural Networks in Pytorch☆24Dec 22, 2021Updated 4 years ago
- Fictitious Self-play & Reinforcement Learning☆18Jan 26, 2018Updated 8 years ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆30Jul 18, 2024Updated last year
- A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)☆56Mar 30, 2021Updated 4 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- simple code to reinforcement learning☆20Aug 30, 2020Updated 5 years ago
- OpenAI MountainCar-v0 DeepRL-based solutions (DQN, DuelingDQN, D3QN)☆24Aug 11, 2021Updated 4 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A MATLAB function library containing encoders, decoders and weight enumerators for Reed-Muller codes.☆11Aug 19, 2023Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Reading List☆35Jul 16, 2023Updated 2 years ago
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆32Oct 5, 2022Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- 中医智慧诊疗小程序后端☆10Aug 28, 2022Updated 3 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- 中国机器人及人工智能大赛全地形自适应机器人赛道☆12Apr 26, 2023Updated 2 years ago
- SAC, PPO, A2C implementation on Mujoco environments : Humanoid-v4, Ant-v4, Cheetah-v4 . Includes reward manipulation.☆34Sep 1, 2025Updated 6 months ago
- 超轻量化模型的UltraFace tensorrt部署 ,详细注释☆10Apr 2, 2024Updated last year
- Official URDF and SDF models of the R1 humanoid robot.☆16Dec 6, 2023Updated 2 years ago
- An analog, transistor-level simulation of an 8-bit CPU in SPICE☆13Jul 29, 2021Updated 4 years ago
- 抖音自动发布/上传视频脚本。Pyautogui☆10Aug 7, 2023Updated 2 years ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm collisions; considering complex side constraints; and optimiz…☆11Jul 6, 2021Updated 4 years ago
- ☆11Nov 20, 2024Updated last year
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- ☆10Mar 9, 2023Updated 3 years ago
- 医疗陪诊服务小程序☆12May 25, 2024Updated last year
- To simulate an image based visual servo in Gazebo using a camera mounted to a Doosan m0609 robotic arm manipulator and 4 points in Aruco …☆15Jan 10, 2024Updated 2 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- 基于adaboost的SVM预测股票价格☆11Mar 4, 2018Updated 8 years ago
- All Final Code to Operate Surena-V Humanoid Robot☆13Aug 26, 2025Updated 6 months ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- ☆10Oct 26, 2022Updated 3 years ago