LinZichuan/emdqn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LinZichuan/emdqn)

LinZichuan / emdqn

Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018

☆63

Alternatives and similar repositories for emdqn

Users that are interested in emdqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

astier / model-free-episodic-control
View on GitHub
Model-Free-Episodic-Control implementation.
☆17Jun 3, 2019Updated 7 years ago
EndingCredits / Neural-Episodic-Control
View on GitHub
Implementation of Deepmind's Neural Episodic Control
☆59May 9, 2018Updated 8 years ago
MouseHu / GEM
View on GitHub
☆16Jul 1, 2021Updated 5 years ago
wenh123 / NoisyNet-DQN
View on GitHub
Tensorflow Implementation for "Noisy network for exploration"
☆32Jul 17, 2017Updated 9 years ago
DuaneNielsen / rnd
View on GitHub
Exploration by Random Network Distillation
☆15Dec 30, 2018Updated 7 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Kaixhin / EC
View on GitHub
Episodic Control
☆22Sep 20, 2022Updated 3 years ago
suyoung-lee / Episodic-Backward-Update
View on GitHub
Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.
☆16Sep 24, 2019Updated 6 years ago
schatty / EMAC
View on GitHub
[IJCAI 2021] Solving Continuous Control with Episodic Memory
☆15Apr 10, 2022Updated 4 years ago
sudeepraja / Model-Free-Episodic-Control
View on GitHub
Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460
☆52Jul 25, 2016Updated 10 years ago
jxwuyi / HouseNavAgent
View on GitHub
Navigation agent with Bayesian relational memory in the House3D environment
☆30Sep 13, 2019Updated 6 years ago
haoliuhl / taming-maml
View on GitHub
Taming MAML: efficient unbiased meta-reinforcement learning
☆30Sep 30, 2022Updated 3 years ago
quanvuong / Supervised_Policy_Update
View on GitHub
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Dec 8, 2022Updated 3 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
Mehooz / awesome-long-horizon-goal-reaching
View on GitHub
Personal reading list for learning-based long-horizon goal reaching methods
☆17Nov 26, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jannerm / mbpo
View on GitHub
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
☆558Nov 22, 2022Updated 3 years ago
nnaisense / 2017-learning-to-run
View on GitHub
The Winning Solution for the Learning To Run Challenge 2017
☆60Jul 4, 2018Updated 8 years ago
yanlai00 / bridge_data_imitation_learning
View on GitHub
☆22Oct 4, 2021Updated 4 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
lucasBertola / Connect-4-Gym-env-Reinforcement-learning
View on GitHub
Connect Four Environment is a project designed for training reinforcement learning models to play the classic Connect4 game. It's compati…
☆18Sep 18, 2023Updated 2 years ago
yosider / merlin
View on GitHub
(Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760
☆25May 3, 2019Updated 7 years ago
Breakend / MultiStepBootstrappingInRL
View on GitHub
Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.
☆15Feb 17, 2017Updated 9 years ago
xkianteb / dril
View on GitHub
Disagreement-Regularized Imitation Learning
☆30May 25, 2021Updated 5 years ago
bonniesjli / DQN_SR
View on GitHub
Count based exploration with the successor representation for Unity ML's Pyramid
☆12Jun 19, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
uber-research / Evolvability-ES
View on GitHub
☆14Jun 26, 2019Updated 7 years ago
google-research / episodic-curiosity
View on GitHub
Tensorflow/Keras code and trained models for Episodic Curiosity Through Reachability
☆205Oct 2, 2020Updated 5 years ago
MishaLaskin / curl
View on GitHub
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆605Oct 28, 2020Updated 5 years ago
kazizzad / BDQN-MxNet-Gluon
View on GitHub
Efficient Exploration through Bayesian Deep Q-Networks
☆38Feb 14, 2018Updated 8 years ago
RuohanW / RED
View on GitHub
Implementation of Random Expert Distillation
☆29May 11, 2019Updated 7 years ago
illidanlab / rpg
View on GitHub
Ranking Policy Gradient
☆23Nov 27, 2019Updated 6 years ago
jachiam / surprise
View on GitHub
Surprise-based intrinsic motivation for deep reinforcement learning
☆21Mar 6, 2017Updated 9 years ago
nhynes / abc
View on GitHub
SeqGAN but with more bells and whistles
☆24Feb 15, 2018Updated 8 years ago
NiloFreitas / Deep-RL-and-IL
View on GitHub
A unified framework of Deep Reinforcement Learning and Deep Imitation Learning in simulation environments
☆15Nov 11, 2019Updated 6 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
denisyarats / exorl
View on GitHub
ExORL: Exploratory Data for Offline Reinforcement Learning
☆138Feb 8, 2022Updated 4 years ago
cbfinn / maml_rl
View on GitHub
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
☆668Jan 19, 2023Updated 3 years ago
uncharted-technologies / robust-domain-randomization
View on GitHub
Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"
☆12Nov 22, 2022Updated 3 years ago
alexlee-gk / slac
View on GitHub
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
☆154Oct 26, 2020Updated 5 years ago
rll-research / finetune-vs-metarl
View on GitHub
☆14May 31, 2022Updated 4 years ago
gjp1203 / nui_in_madrl
View on GitHub
Negative Update Intervals in Multi-Agent Deep Reinforcement Learning
☆35May 14, 2019Updated 7 years ago
denisyarats / drq
View on GitHub
DrQ: Data regularized Q
☆423Jan 13, 2023Updated 3 years ago