Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
Alternatives and similar repositories for POP3D
Users that are interested in POP3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆46Jun 22, 2020Updated 5 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Some C++ libraries I implemented☆15Feb 10, 2018Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Neural Response Ranker for Alana, Heriot-Watt University's Alexa Prize Socialbot☆12Nov 21, 2022Updated 3 years ago
- Master's semester project at EPFL: implement a depth map fusion algorithm for structured light.☆11Jan 13, 2017Updated 9 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- ☆75May 24, 2019Updated 7 years ago
- This repository contains some of the code used in the paper "Training Language Models with Langauge Feedback at Scale"☆26Mar 30, 2023Updated 3 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- Generalised UDRL☆37May 12, 2022Updated 4 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆29Dec 11, 2020Updated 5 years ago
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Rex is a JAX-powered framework for sim-to-real robotics.☆54Jun 11, 2025Updated 11 months ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 9 years ago
- Code for the paper "Phasic Policy Gradient"☆267Apr 2, 2023Updated 3 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- 3D learning environment with rigid body simulation for Linux/MacOSX☆14Dec 24, 2021Updated 4 years ago
- Matlab code for basic gait generator for students☆10Sep 25, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Apr 17, 2026Updated last month
- Implementation of a graph edit distance in python☆34May 9, 2019Updated 7 years ago
- [ICML'18] Scalable Gaussian Processes with Grid-Structured Eigenfunctions☆20Jul 15, 2022Updated 3 years ago
- Solutions to the Deep RL Bootcamp labs☆42Oct 15, 2017Updated 8 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- A C++/Python simulator package for reinforcement learning☆86Jan 10, 2019Updated 7 years ago
- Attention models☆32Nov 30, 2015Updated 10 years ago
- Example of android app written in Qt/Qml which uses MXNet for plant image recognition.☆10Nov 4, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jul 20, 2023Updated 2 years ago
- Code and pretrained models accompanying the paper "Ensembling geophysical models using Bayesian Neural Networks"☆10Jul 11, 2022Updated 3 years ago
- Repository containing python wrappers for NVIDIA Omniverse Isaac-Sim☆29May 12, 2021Updated 5 years ago
- Modular PyTorch implementation of policy gradient methods☆24Nov 15, 2018Updated 7 years ago
- Generate compile_commands.json and run clang-tidy with Bazel☆18Jun 23, 2019Updated 6 years ago
- A simple script to recompile arxiv papers into kindle-like format☆29Oct 4, 2023Updated 2 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆49Feb 23, 2019Updated 7 years ago