(Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760
☆24May 3, 2019Updated 7 years ago
Alternatives and similar repositories for merlin
Users that are interested in merlin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆59May 9, 2018Updated 8 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆39Feb 13, 2021Updated 5 years ago
- ☆14Oct 5, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Pytorch Implementation of Deepmind's 'Hybrid computing using a neural network with dynamic external memory' (Differentiable Neural Comput…☆20Dec 9, 2017Updated 8 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 7 years ago
- Code for the paper "D2RL: Deep Dense Architectures for Reinforcement Learning"☆40Jan 22, 2021Updated 5 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- MLP-framework (pure numpy) and DDQN-framework for OpenAI's Gym games. +test code for PPO added. +Hindsight Experience Replay(HER) bitfli…☆19May 24, 2018Updated 8 years ago
- Symbol Emergence in Robotics tool KIT☆21Nov 15, 2023Updated 2 years ago
- ☆10Apr 5, 2022Updated 4 years ago
- Code for the paper "Residual Policy Learning for Shared Autonomy".☆17Apr 14, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆16Mar 2, 2019Updated 7 years ago
- A simple option critic framework using Q-Learning☆14Feb 7, 2022Updated 4 years ago
- Modification of SOMPY repo with robust K-means clustering (bootstrapped SSE elbow method)☆13Apr 6, 2019Updated 7 years ago
- Code and additional information for our paper entitled 'Scene Augmentation Methods for Interactive Embodied AI Tasks'☆10Apr 25, 2023Updated 3 years ago
- Resources for Auxiliary Tasks and Exploration Enable ObjectNav☆42Oct 22, 2021Updated 4 years ago
- python, ccxt, backtrader, dash☆10Apr 20, 2018Updated 8 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆63Sep 5, 2018Updated 7 years ago
- Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural …☆28Nov 11, 2020Updated 5 years ago
- Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch☆348May 23, 2026Updated 3 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Chrome extension to remove the "People also search for" element☆12Apr 16, 2022Updated 4 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆33Sep 17, 2018Updated 7 years ago
- ☆13Dec 12, 2022Updated 3 years ago
- Reproducing Random Numbers in Matlab and Python / NumPy☆11Dec 6, 2015Updated 10 years ago
- Python implementation of the paper Learning hierarchical relationships for object-goal navigation☆49Dec 8, 2022Updated 3 years ago
- Memory Augmented Neural Networks (Pytorch)☆14Sep 2, 2018Updated 7 years ago
- An MCP Server for Cosense☆20Dec 22, 2025Updated 5 months ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆259Oct 11, 2024Updated last year
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆18Apr 15, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- Show n-hop link destination pages beyond projects☆10Nov 25, 2025Updated 6 months ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆30Jun 17, 2019Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆63Jan 8, 2019Updated 7 years ago
- Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/ab…☆17Nov 7, 2018Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 8 years ago