Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018
☆62Sep 5, 2018Updated 7 years ago
Alternatives and similar repositories for emdqn
Users that are interested in emdqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆58May 9, 2018Updated 7 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago
- ☆15Jul 1, 2021Updated 4 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆39Jan 22, 2021Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Tensorflow Implementation for "Noisy network for exploration"☆31Jul 17, 2017Updated 8 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 3 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆54Jul 25, 2016Updated 9 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Pytorch implementation of Human-Level Control through Deep Reinforcement Learning☆11May 31, 2017Updated 8 years ago
- Personal reading list for learning-based long-horizon goal reaching methods☆17Nov 26, 2020Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- The Winning Solution for the Learning To Run Challenge 2017☆60Jul 4, 2018Updated 7 years ago
- ☆22Oct 4, 2021Updated 4 years ago
- ☆10Aug 17, 2022Updated 3 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆54Jul 7, 2021Updated 4 years ago
- (Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760☆24May 3, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Disagreement-Regularized Imitation Learning☆30May 25, 2021Updated 4 years ago
- Count based exploration with the successor representation for Unity ML's Pyramid☆12Jun 19, 2019Updated 6 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- ☆13Apr 4, 2023Updated 2 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Feb 14, 2018Updated 8 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆600Oct 28, 2020Updated 5 years ago
- ☆16Oct 3, 2023Updated 2 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 9 years ago
- SeqGAN but with more bells and whistles☆24Feb 15, 2018Updated 8 years ago
- ☆22Dec 31, 2019Updated 6 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆129Feb 8, 2022Updated 4 years ago
- Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"☆664Jan 19, 2023Updated 3 years ago
- Task-Focused Few-Shot Object Detection Benchmark☆14Jun 24, 2025Updated 9 months ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago