Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty
☆56Dec 17, 2018Updated 7 years ago
Alternatives and similar repositories for PPO-clip-and-PPO-penalty-on-Atari-Domain
Users that are interested in PPO-clip-and-PPO-penalty-on-Atari-Domain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains scenarios from different source for training and testing autonomous vehicles.☆26Mar 20, 2023Updated 3 years ago
- A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems☆13Mar 22, 2023Updated 3 years ago
- 新增一个CBF层,并将其结合进actor网络中,得到safe RL框架。后续验证中发现这种做法并没有实质性的用处,所以不再继续这个项目☆12Mar 14, 2023Updated 3 years ago
- Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>☆19Jun 17, 2021Updated 5 years ago
- NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization☆13Apr 10, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 7 years ago
- rlplot is an easy to use and highly encapsulated RL plot library (including basic error bar lineplot and a wrapper to "rliable").☆33Dec 8, 2023Updated 2 years ago
- ☆13Apr 25, 2023Updated 3 years ago
- A Test-Implementation of the IMPALA algorithm (by deepmind 2018)☆35Mar 16, 2018Updated 8 years ago
- ☆17Oct 18, 2022Updated 3 years ago
- The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)☆26Feb 3, 2022Updated 4 years ago
- Model and some path schedule solutions specifically for UAV-sensorTime-sensitive Network. The code is used to evaluate the models' perfor…☆17May 16, 2020Updated 6 years ago
- Experiments with reinforcement learning using Gym, keras-rl and SUMO☆12Jan 22, 2017Updated 9 years ago
- E-MAML, and RL-MAML baseline implemented in Tensorflow v1☆17Dec 7, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆28Dec 16, 2022Updated 3 years ago
- This is a program to solve NER with HMM. The principles and details can refer to my blog: https://blog.csdn.net/weixin_41679411/article/d…☆11Nov 20, 2018Updated 7 years ago
- ☆24Feb 22, 2023Updated 3 years ago
- Running inference on the ZeroSCROLLS benchmark☆22Apr 18, 2024Updated 2 years ago
- Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described in the paper presented in the Machine Learning for Autonomous …☆24Mar 6, 2021Updated 5 years ago
- ☆22Mar 28, 2025Updated last year
- ☆26May 14, 2019Updated 7 years ago
- Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…☆27Jul 20, 2022Updated 3 years ago
- ☆20Jun 13, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…☆30Oct 29, 2023Updated 2 years ago
- ☆26Jan 13, 2021Updated 5 years ago
- The IP-Adapter training scripts and inference for Flux Model, which is implemented based on X-Lab☆17Oct 1, 2024Updated last year
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆363Jun 2, 2020Updated 6 years ago
- ☆30Aug 20, 2021Updated 4 years ago
- Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization☆44Nov 8, 2018Updated 7 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆12Jul 15, 2022Updated 3 years ago
- ☆16May 4, 2021Updated 5 years ago
- [NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".☆136Jan 29, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Sample-Efficient Automated Deep Reinforcement Learning☆34Mar 17, 2021Updated 5 years ago
- Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238☆48Nov 10, 2020Updated 5 years ago
- Adversarial Imitation Learning from Incomplete Demonstrations☆15Apr 2, 2020Updated 6 years ago
- Machine Learning Course Project Skoltech 2018☆109Feb 11, 2019Updated 7 years ago
- A standard bare-bone ROS Gazebo simulator for the Franka Emika Panda robot built using inbuilt Gazebo ROS controllers and RobotHW interfa…☆11May 3, 2021Updated 5 years ago
- Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…☆38Feb 5, 2019Updated 7 years ago