PPO implementation for OpenAI gym environment based on Unity ML Agents
☆150Mar 17, 2018Updated 8 years ago
Alternatives and similar repositories for PPO
Users that are interested in PPO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Reinforcement Learning DQN on Unity ML Agent☆11Sep 2, 2018Updated 7 years ago
- A SpaceX Rocket Lander environment for OpenAI gym using Box2D☆305Jan 19, 2021Updated 5 years ago
- Keras Implementation of PPO to solve OpenAI Gym Environments☆16May 15, 2018Updated 7 years ago
- Landing a SpaceX Falcon Heavy Rocket in simulation using Reinforcement learning. Reinforcement learning is a technique that lets an agent…☆17Jan 16, 2023Updated 3 years ago
- Proximal Policy Optimization implementation with TensorFlow☆108Oct 9, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- AI for a Maze Game using Unity ML-Agent.☆17Dec 18, 2018Updated 7 years ago
- Learning Continuous Control in Deep Reinforcement Learning☆14Nov 24, 2018Updated 7 years ago
- Tank game to experiment with the Unity ML-Agents Toolkit☆10Apr 28, 2020Updated 6 years ago
- Tensorflow implementation of SNAIL and RL2☆11Aug 17, 2019Updated 6 years ago
- ☆12Jun 9, 2018Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 8 years ago
- Implementation of proximal policy optimization(PPO) with tensorflow☆35Feb 10, 2018Updated 8 years ago
- Project under CSF407 - AI☆13Jun 24, 2024Updated last year
- Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment☆107Jun 7, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Solving MuJoCo environments with Deep Deterministic Policy Gradients☆14Sep 17, 2018Updated 7 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- TensorFlow implementation of Value Iteration Networks (VIN): Clean, Simple and Modular☆52Mar 27, 2017Updated 9 years ago
- A PyTorch implementation of SSINet.☆16Nov 10, 2020Updated 5 years ago
- Ant Gather and Ant Maze envs, separated from RLLab☆11Aug 2, 2018Updated 7 years ago
- ☆11Apr 20, 2021Updated 5 years ago
- A utility for creating Heatmap images in Unity.☆24Oct 5, 2013Updated 12 years ago
- Utility AI powered bot in Unity3D 🔫☆13Nov 17, 2017Updated 8 years ago
- Simple, extensible implementations of some meta-learning algorithms in Jax☆11Oct 6, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reinforcement learning with unsupervised auxiliary tasks☆23Jan 10, 2019Updated 7 years ago
- Implementation of "Successive Convexification for 6-DoF Mars Rocket Powered Landing with Free-Final-Time"☆79Feb 21, 2019Updated 7 years ago
- MuJoCo Models for Personal Robot 2 (PR2)☆11Aug 25, 2018Updated 7 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Code for ICML25 paper "HYGMA: Hypergraph Coordination Networks with Dynamic Grouping for Multi-Agent Reinforcement Learning"☆23Nov 11, 2025Updated 5 months ago
- The lite edition of 微信跳一跳(JumpJump) developed by Unity with AI developed by ml-agents.☆33Jan 30, 2018Updated 8 years ago
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Mar 16, 2018Updated 8 years ago
- Test to make an AI walk using Unity ML-Agents plugin.☆14Dec 19, 2017Updated 8 years ago
- A trajectory optimization algorithm that doesn't require dynamics derivatives☆36Dec 13, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Clockwork VAEs in JAX/Flax☆32Jul 16, 2021Updated 4 years ago
- A C++ Package for Solving Multiple-Phase Optimal Control Problem Using Adaptive Radau Pseudospectral Methods☆10Aug 31, 2020Updated 5 years ago
- 模仿LD41的拼图游戏☆11Oct 14, 2019Updated 6 years ago
- Code for the paper "Minimum-Delay Adaptation in Non-Stationary Reinforcement Learning via Online High-Confidence Change-Point Detection"☆11Aug 7, 2023Updated 2 years ago
- Minimal and Clean Reinforcement Learning Examples in PyTorch☆41Dec 25, 2018Updated 7 years ago
- Examples of published reinforcement learning algorithms in recent literature implemented in TensorFlow☆103Aug 3, 2020Updated 5 years ago
- Distributed DRL by Ray and TensorFlow Tutorial.☆10Dec 26, 2019Updated 6 years ago