Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 6 years ago
Alternatives and similar repositories for model-free-episodic-control
Users that are interested in model-free-episodic-control are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Sep 5, 2018Updated 7 years ago
- Implementation of Deepmind's Neural Episodic Control☆58May 9, 2018Updated 7 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆54Jul 25, 2016Updated 9 years ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Reproducing Policy Distillation (DeepMind paper ICLR 2016)☆22Feb 17, 2020Updated 6 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- Pytorch implementation of Human-Level Control through Deep Reinforcement Learning☆11May 31, 2017Updated 8 years ago
- ☆13Apr 3, 2019Updated 6 years ago
- Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability☆204Oct 2, 2020Updated 5 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- Reference implementation of algorithms for reinforcement learning and Markov decision processes.☆12Jan 28, 2021Updated 5 years ago
- A2C for GVG-AI☆23Nov 7, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Feb 14, 2018Updated 8 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- ☆22Nov 8, 2021Updated 4 years ago
- FLUIDS is a lightweight driving simulator for benchmarking Deep Reinforcement and Imitation learning algorithms.☆24May 3, 2019Updated 6 years ago
- A re-implementation of the Pommerman environment in C++☆11Oct 6, 2021Updated 4 years ago
- Personal reading list for learning-based long-horizon goal reaching methods☆17Nov 26, 2020Updated 5 years ago
- Deep Reinforcement Learning with Fined Grained Action Repetition☆22Jan 8, 2018Updated 8 years ago
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Code for VIREL: A Variational Inference Framework for Reinforcement Learning☆14Dec 1, 2019Updated 6 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- ☆33Oct 17, 2018Updated 7 years ago
- Variable-order CRFs with structure learning☆17Aug 1, 2024Updated last year
- Co-training for Policy Learning☆13Aug 8, 2019Updated 6 years ago
- The Reinforcement-Learning-Related Papers of ICLR 2019☆47May 28, 2019Updated 6 years ago
- Option Critic with subgoal discovery by spectral decomposition of the Successor Features Matrix or clustering in Successor features space…☆24Nov 29, 2018Updated 7 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- The tvseg software is a library and GUI tool image segmentation.☆21May 31, 2015Updated 10 years ago
- Using WoLF (win or learn fast) PHC (policy hill climbing) algorithm to implement stochastic games☆15Jun 14, 2019Updated 6 years ago
- Pytorch Implementation of AAMAS 2021 paper <Energy-Based Imitation Learning>☆11Oct 8, 2021Updated 4 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆54May 15, 2019Updated 6 years ago
- Pytorch implementation of Planar Flow☆17Dec 2, 2019Updated 6 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Code for the paper "Skynet: A Top Deep RL Agent in the Inaugural Pommerman Team Competition"☆37May 9, 2019Updated 6 years ago