Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
☆56Dec 17, 2018Updated 7 years ago
Alternatives and similar repositories for PPO-clip-and-PPO-penalty-on-Atari-Domain
Users that are interested in PPO-clip-and-PPO-penalty-on-Atari-Domain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems☆13Mar 22, 2023Updated 3 years ago
- 新增一个CBF层,并将其结合进actor网络中,得到safe RL框架。后续验证中发现这种做法并没有实质性的用处,所以不再继续这个项目☆12Mar 14, 2023Updated 3 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 4 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Introduction to surrogate modeling optimization in wireless networks☆10May 10, 2018Updated 8 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆33Dec 8, 2023Updated 2 years ago
- ☆16May 5, 2022Updated 4 years ago
- ☆13Apr 25, 2023Updated 3 years ago
- Active Learning with Partial Feedback, ICLR 2019☆11Apr 27, 2020Updated 6 years ago
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Mar 16, 2018Updated 8 years ago
- Reinforcement Learning Algorithms with Unity 3D Environments☆18Jul 15, 2019Updated 6 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆28Dec 16, 2022Updated 3 years ago
- Experiments with reinforcement learning using Gym, keras-rl and SUMO☆12Jan 22, 2017Updated 9 years ago
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- ☆24Feb 22, 2023Updated 3 years ago
- Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described in the paper presented in the Machine Learning for Autonomous …☆24Mar 6, 2021Updated 5 years ago
- ☆26May 14, 2019Updated 7 years ago
- Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…☆27Jul 20, 2022Updated 3 years ago
- ☆20Jun 13, 2022Updated 3 years ago
- Evaluation of TD-MPC2.☆21Jan 21, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆26Jan 13, 2021Updated 5 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆362Jun 2, 2020Updated 5 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- Implementation of PPO for CartPole-v1☆10Jan 1, 2019Updated 7 years ago
- This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…☆32Jul 27, 2023Updated 2 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- ☆16May 4, 2021Updated 5 years ago
- [NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".☆136Jan 29, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Sample-Efficient Automated Deep Reinforcement Learning☆34Mar 17, 2021Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Implementation of Deep Q-learning from Demonstrations using Keras and a Retro Gym environment.☆14Jul 16, 2018Updated 7 years ago
- Adversarial Imitation Learning from Incomplete Demonstrations☆15Apr 2, 2020Updated 6 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- A standard bare-bone ROS Gazebo simulator for the Franka Emika Panda robot built using inbuilt Gazebo ROS controllers and RobotHW interfa…☆11May 3, 2021Updated 5 years ago