Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
Alternatives and similar repositories for POP3D
Users that are interested in POP3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆46Jun 22, 2020Updated 5 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Neural Response Ranker for Alana, Heriot-Watt University's Alexa Prize Socialbot☆12Nov 21, 2022Updated 3 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- A Realtime Frontend Integratable and Configurable Robot Kinematics Simulator (Only for Academic Use)☆16Apr 22, 2025Updated 11 months ago
- ☆74May 24, 2019Updated 6 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆27Mar 30, 2023Updated 3 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Rex is a JAX-powered framework for sim-to-real robotics.☆54Jun 11, 2025Updated 10 months ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 3 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- ☆18Mar 18, 2026Updated last month
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [ICML'18] Scalable Gaussian Processes with Grid-Structured Eigenfunctions☆20Jul 15, 2022Updated 3 years ago
- Solutions to the Deep RL Bootcamp labs☆42Oct 15, 2017Updated 8 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Aug 20, 2018Updated 7 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- A C++/Python simulator package for reinforcement learning☆86Jan 10, 2019Updated 7 years ago
- Attention models☆32Nov 30, 2015Updated 10 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- ☆10Jul 20, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code and pretrained models accompanying the paper "Ensembling geophysical models using Bayesian Neural Networks"☆10Jul 11, 2022Updated 3 years ago
- Tensorflow implementation of the map reading algorithm described in ‘Teaching a Machine to Read Maps with Deep Reinforcement Learning’☆32Nov 14, 2017Updated 8 years ago
- Modular PyTorch implementation of policy gradient methods☆24Nov 15, 2018Updated 7 years ago
- A simple script to recompile arxiv papers into kindle-like format☆29Oct 4, 2023Updated 2 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆49Feb 23, 2019Updated 7 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Jul 18, 2023Updated 2 years ago