Policy Optimization with Penalized Point Probability Distance: an Alternative to Proximal Policy Optimization
☆44Nov 8, 2018Updated 7 years ago
Alternatives and similar repositories for POP3D
Users that are interested in POP3D are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆46Jun 22, 2020Updated 5 years ago
- self implementation of DPPO, Distributed Proximal Policy Optimization, by using tensorflow☆12Sep 1, 2017Updated 8 years ago
- Some C++ libraries I implemented☆15Feb 10, 2018Updated 8 years ago
- Deep reinforcement learning baselines base on OpenAI. More algorithms are included, such as Rainbow: Combining Improvements in Deep Rei…☆35Aug 23, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- Master's semester project at EPFL: implement a depth map fusion algorithm for structured light.☆11Jan 13, 2017Updated 9 years ago
- ☆25Jan 2, 2019Updated 7 years ago
- ☆73May 24, 2019Updated 6 years ago
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- A Tensorflow implementation of Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆32Oct 12, 2017Updated 8 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Code for experimenting with state and action abstractions in reinforcement learning.☆30Dec 11, 2020Updated 5 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- Sources for OpenCL and CUDA tutorials. http://jlaning.com☆20Jan 9, 2016Updated 10 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- Code for the paper "Phasic Policy Gradient"☆268Apr 2, 2023Updated 2 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆107Nov 17, 2020Updated 5 years ago
- Matlab code for basic gait generator for students☆10Sep 25, 2020Updated 5 years ago
- ☆18Mar 18, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of a graph edit distance in python☆34May 9, 2019Updated 6 years ago
- DQV-Learning: a novel faster synchronous Deep Reinforcement Learning algorithm☆24Feb 15, 2023Updated 3 years ago
- Solutions to the Deep RL Bootcamp labs☆42Oct 15, 2017Updated 8 years ago
- Code for EMNLP 2021 paper: Improving Sequence-to-Sequence Pre-training via Sequence Span Rewriting☆17Nov 30, 2021Updated 4 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Aug 20, 2018Updated 7 years ago
- Reinforcement Learning Assignment: Easy21☆12Jul 4, 2016Updated 9 years ago
- Code to reproduce the results of "Curiosity Driven Exploration of Learned Disentangled Goal Spaces"☆19Oct 26, 2018Updated 7 years ago
- A C++/Python simulator package for reinforcement learning☆85Jan 10, 2019Updated 7 years ago
- ☆10Jul 20, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code and pretrained models accompanying the paper "Ensembling geophysical models using Bayesian Neural Networks"☆10Jul 11, 2022Updated 3 years ago
- Repository containing python wrappers for NVIDIA Omniverse Isaac-Sim☆30May 12, 2021Updated 4 years ago
- Tensorflow implementation of the map reading algorithm described in ‘Teaching a Machine to Read Maps with Deep Reinforcement Learning’☆32Nov 14, 2017Updated 8 years ago
- Modular PyTorch implementation of policy gradient methods☆25Nov 15, 2018Updated 7 years ago
- Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.☆49Feb 23, 2019Updated 7 years ago
- Finite State Machine Designer☆12Nov 17, 2017Updated 8 years ago
- A gstreamer plugin for image capture from Basler's USB3 cameras. Please read the README before using.☆15Apr 7, 2020Updated 5 years ago