cxxgtxy / POP3DView external linksLinks
Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
Alternatives and similar repositories for POP3D
Users that are interested in POP3D are comparing it to the libraries listed below
Sorting:
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Neural Response Ranker for Alana, Heriot-Watt University's Alexa Prize Socialbot☆13Nov 21, 2022Updated 3 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆46Jun 22, 2020Updated 5 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- Master's semester project at EPFL: implement a depth map fusion algorithm for structured light.☆11Jan 13, 2017Updated 9 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Aug 20, 2018Updated 7 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- GPT implementation in Flax☆18Jan 8, 2022Updated 4 years ago
- Rex is a JAX-powered framework for sim-to-real robotics.☆52Jun 11, 2025Updated 8 months ago
- ☆72May 24, 2019Updated 6 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- [ICML'18] Scalable Gaussian Processes with Grid-Structured Eigenfunctions☆20Jul 15, 2022Updated 3 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- Simplistic Pytorch Implementation of the Dreamer-RL☆20May 7, 2025Updated 9 months ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 9 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- A Realtime Frontend Integratable and Configurable Robot Kinematics Simulator (Only for Academic Use)☆16Apr 22, 2025Updated 9 months ago
- Solutions to the Deep RL Bootcamp labs☆43Oct 15, 2017Updated 8 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆24Feb 15, 2023Updated 3 years ago
- just for fun☆23Sep 10, 2017Updated 8 years ago
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 8 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Hierarchical Self-Play☆21Dec 5, 2018Updated 7 years ago
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 2 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Apr 17, 2018Updated 7 years ago
- Population Based Training, Figure 2☆25Dec 2, 2017Updated 8 years ago
- ☆26Jul 19, 2019Updated 6 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆50Feb 23, 2019Updated 6 years ago
- Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization☆24Jun 24, 2020Updated 5 years ago