ChengTsang/PPO-clip-and-PPO-penalty-on-Atari-Domain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ChengTsang/PPO-clip-and-PPO-penalty-on-Atari-Domain)

ChengTsang / PPO-clip-and-PPO-penalty-on-Atari-Domain

Implement PPO-clip and PPO-penalty on Atari, which is the only open source of PPO-penalty

☆56

Alternatives and similar repositories for PPO-clip-and-PPO-penalty-on-Atari-Domain

Users that are interested in PPO-clip-and-PPO-penalty-on-Atari-Domain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

metadriverse / metadrive-scenario
View on GitHub
This repo contains scenarios from different source for training and testing autonomous vehicles.
☆26Mar 20, 2023Updated 3 years ago
Tencent-RoboticsX / CraftEnv
View on GitHub
A flexible Multi-Agent Reinforcement Learning (MARL) environment for Collective Robotic Construction (CRC) systems
☆13Mar 22, 2023Updated 3 years ago
CHH3213 / safeRL-CBF
View on GitHub
新增一个CBF层，并将其结合进actor网络中，得到safe RL框架。后续验证中发现这种做法并没有实质性的用处，所以不再继续这个项目
☆12Mar 14, 2023Updated 3 years ago
apexrl / CoDAIL
View on GitHub
Implementation of CoDAIL in the ICLR 2020 paper <Multi-Agent Interactions Modeling with Correlated Policies>
☆19Jun 17, 2021Updated 5 years ago
zmsn-2077 / CUP-safe-rl
View on GitHub
NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization
☆13Apr 10, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
ewine-project / SUMO_optimization
View on GitHub
Introduction to surrogate modeling optimization in wireless networks
☆10May 10, 2018Updated 8 years ago
PKU-MARL / TRPO-PPO-in-MARL
View on GitHub
☆16May 5, 2022Updated 4 years ago
Kayne0401 / Robust-Decision-Making-Framework
View on GitHub
☆13Apr 25, 2023Updated 3 years ago
peiyunh / alpf
View on GitHub
Active Learning with Partial Feedback, ICLR 2019
☆11Apr 27, 2020Updated 6 years ago
ducandu / RL-Implementation-IMPALA
View on GitHub
A Test-Implementation of the IMPALA algorithm (by deepmind 2018)
☆35Mar 16, 2018Updated 8 years ago
BlueFisher / RL-PPO-with-Unity
View on GitHub
Reinforcement Learning Algorithms with Unity 3D Environments
☆18Jul 15, 2019Updated 6 years ago
RobeSafe-UAH / rl-intersections
View on GitHub
☆17Oct 18, 2022Updated 3 years ago
uoe-agents / derl
View on GitHub
The official repository of Decoupled Reinforcement Learning to Stabilise Intrinsically-Motivated Exploration" (AAMAS 2022)
☆26Feb 3, 2022Updated 4 years ago
bstriner / traffic-rl
View on GitHub
Experiments with reinforcement learning using Gym, keras-rl and SUMO
☆12Jan 22, 2017Updated 9 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
geyang / e-maml
View on GitHub
E-MAML, and RL-MAML baseline implemented in Tensorflow v1
☆17Dec 7, 2019Updated 6 years ago
arjunbhorkar / ReViND
View on GitHub
☆28Dec 16, 2022Updated 3 years ago
tau-nlp / zero_scrolls
View on GitHub
Running inference on the ZeroSCROLLS benchmark
☆22Apr 18, 2024Updated 2 years ago
praveen-palanisamy / macad-agents
View on GitHub
Agents code for Multi-Agent Connected Autonomous Driving (MACAD) described in the paper presented in the Machine Learning for Autonomous …
☆24Mar 6, 2021Updated 5 years ago
ksluck / Coadaptation
View on GitHub
Repository replicating the design- and behaviour-adaptation algorithm using reinforcement learning algorithm presented in the paper " Dat…
☆27Jul 20, 2022Updated 3 years ago
tdmpc2 / tdmpc2-eval
View on GitHub
Evaluation of TD-MPC2.
☆21Jan 21, 2024Updated 2 years ago
abukharin3 / ERNIE
View on GitHub
Code for the NeurIPS 2023 Paper: Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Sta…
☆29Oct 29, 2023Updated 2 years ago
Grottoh / Deep-Active-Inference-for-Partially-Observable-MDPs
View on GitHub
☆26Jan 13, 2021Updated 5 years ago
clf28 / x-flux-ip-adapter
View on GitHub
The IP-Adapter training scripts and inference for Flux Model, which is implemented based on X-Lab
☆17Oct 1, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pat-coady / trpo
View on GitHub
Trust Region Policy Optimization with TensorFlow and OpenAI Gym
☆363Jun 2, 2020Updated 6 years ago
morning9393 / Optimal-Baseline-for-Multi-agent-Policy-Gradients
View on GitHub
☆30Aug 20, 2021Updated 4 years ago
akjayant / mbppol
View on GitHub
This repository has code for the paper "Model-based Safe Deep Reinforcement Learning via a Constrained Proximal Policy Optimization Algor…
☆32Jul 27, 2023Updated 2 years ago
jlwu002 / BCL
View on GitHub
[ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum
☆12Jul 15, 2022Updated 3 years ago
antonai91 / reinforcement_learning
View on GitHub
☆14May 4, 2021Updated 5 years ago
decisionforce / CoPO
View on GitHub
[NeurIPS 2021] Official implementation of paper "Learning to Simulate Self-driven Particles System with Coordinated Policy Optimization".
☆136Jan 29, 2024Updated 2 years ago
apexrl / EBIL-torch
View on GitHub
Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>
☆12Oct 8, 2021Updated 4 years ago
automl / SEARL
View on GitHub
Sample-Efficient Automated Deep Reinforcement Learning
☆34Mar 17, 2021Updated 5 years ago
YYCAAA / V-MPO_Lunarlander
View on GitHub
Simple implementation of V-MPO proposed in https://arxiv.org/abs/1909.12238
☆48Nov 10, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
AurelianTactics / dqfd-with-keras
View on GitHub
Implementation of Deep Q-learning from Demonstrations using Keras and a Retro Gym environment.
☆14Jul 16, 2018Updated 7 years ago
mingfeisun / agail
View on GitHub
Adversarial Imitation Learning from Incomplete Demonstrations
☆15Apr 2, 2020Updated 6 years ago
pgermain / PAC-Bayesian-Theory-Meets-Bayesian-Inference
View on GitHub
Code to related to my NIPS 2016 paper
☆10Dec 4, 2016Updated 9 years ago
proceduralia / high_replay_ratio_continuous_control
View on GitHub
Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"
☆28May 22, 2023Updated 3 years ago
Gladys-Zhao / mRNN-mLSTM
View on GitHub
Code for ICML 2020 paper: Do RNN and LSTM have Long Memory?
☆17Jan 6, 2021Updated 5 years ago
martisak / dict2uml
View on GitHub
Python library that prints a dict as PlantUML code.
☆12Dec 8, 2022Updated 3 years ago
osudrl / RSS-2020-learning-memory-based-control
View on GitHub
Code for recreating the results of our RSS 2020 paper, 'Learning Memory-Based Control for Human-Scale Bipedal Locomotion.'
☆10Aug 18, 2022Updated 3 years ago