qingshi9974 / PPO-pytorch-MujocoView external linksLinks
Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.
☆56Jun 30, 2020Updated 5 years ago
Alternatives and similar repositories for PPO-pytorch-Mujoco
Users that are interested in PPO-pytorch-Mujoco are comparing it to the libraries listed below
Sorting:
- In this project, I explore some typical value-based and policy-based RL algorithms. I do experiments on DQN and its six variants and thei…☆12Nov 18, 2020Updated 5 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- Implementation of Trust Region Policy Optimization and Proximal Policy Optimization algorithms on the objective of Robot Walk.☆12Mar 9, 2021Updated 4 years ago
- PPO implementation of Humanoid-v2 from Open-AI gym☆27Mar 25, 2023Updated 2 years ago
- A minimal codebase for PPO training on MuJoCo environments with some customization supports.☆17May 17, 2022Updated 3 years ago
- Transfer learning in deep reinforcement learning for continuous control. Implemented DDPG and TD3 algorithms and evaluated ability to ada…☆17Feb 25, 2025Updated 11 months ago
- A minimal example of optimal transport with Input Convex Neural Networks in Pytorch☆24Dec 22, 2021Updated 4 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Fictitious Self-play & Reinforcement Learning☆18Jan 26, 2018Updated 8 years ago
- SEIKO is a novel reinforcement learning method to efficiently fine-tune diffusion models in an online setting. Our methods outperform all…☆29Jul 18, 2024Updated last year
- A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)☆56Mar 30, 2021Updated 4 years ago
- simple code to reinforcement learning☆20Aug 30, 2020Updated 5 years ago
- Code for RRL (https://sites.google.com/view/abstractions4rl)☆27Jan 21, 2022Updated 4 years ago
- Reading List☆35Jul 16, 2023Updated 2 years ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- A MATLAB function library containing encoders, decoders and weight enumerators for Reed-Muller codes.☆11Aug 19, 2023Updated 2 years ago
- 基于cassie-mujoco-sim,参考gym-cassie改的一个cassie行走仿真测试例子☆28May 7, 2023Updated 2 years ago
- (AAAI'2019) The codes, models, logs, and data for an extended paper of the original paper "On Reinforcement Learning for Full-length Game…☆30Oct 5, 2022Updated 3 years ago
- Proximal Policy Optimization (Continuous Version) in PyTorch.☆29May 12, 2025Updated 9 months ago
- 《2021医学健康数据分析与挖掘》课程论文 -- 基于BERT的20NewsGroups数据集新闻分类实验☆10Jun 22, 2021Updated 4 years ago
- Assignments for CS294-112.☆30Sep 11, 2019Updated 6 years ago
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- 中医智慧诊疗小程序后端☆10Aug 28, 2022Updated 3 years ago
- 中国机器人及人工智能大赛全地形自适应机器人赛道☆12Apr 26, 2023Updated 2 years ago
- SAC, PPO, A2C implementation on Mujoco environments : Humanoid-v4, Ant-v4, Cheetah-v4 . Includes reward manipulation.☆34Sep 1, 2025Updated 5 months ago
- Transformer-based World Models☆88Apr 4, 2023Updated 2 years ago
- [IEEE ICASSP 2021] "A fast randomized adaptive CP decomposition for streaming tensors". In 46th IEEE International Conference on Acoustic…☆11Feb 16, 2023Updated 3 years ago
- Official URDF and SDF models of the R1 humanoid robot.☆16Dec 6, 2023Updated 2 years ago
- ☆10Jun 29, 2022Updated 3 years ago
- Exploring the minimal architecture required for coherent English language generation.☆12Mar 5, 2025Updated 11 months ago
- 医疗陪诊服务小程序☆12May 25, 2024Updated last year
- 超轻量化模型的UltraFace tensorrt部署 ,详细注释☆10Apr 2, 2024Updated last year
- ☆11Nov 20, 2024Updated last year
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- A scheduler to manage a multi tool dual arm robot while avoiding arm-to-arm collisions; considering complex side constraints; and optimiz…☆11Jul 6, 2021Updated 4 years ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- AI path planning and controller for formations of drones.☆14Apr 8, 2021Updated 4 years ago