(Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760
☆24May 3, 2019Updated 6 years ago
Alternatives and similar repositories for merlin
Users that are interested in merlin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆58May 9, 2018Updated 7 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Sep 20, 2016Updated 9 years ago
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 6 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 6 years ago
- ☆10Apr 5, 2022Updated 3 years ago
- Symbol Emergence in Robotics tool KIT☆21Nov 15, 2023Updated 2 years ago
- Code for the paper "Residual Policy Learning for Shared Autonomy".☆17Apr 14, 2020Updated 5 years ago
- A simple option critic framework using Q-Learning☆14Feb 7, 2022Updated 4 years ago
- Code and additional information for our paper entitled 'Scene Augmentation Methods for Interactive Embodied AI Tasks'☆10Apr 25, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Resources for Auxiliary Tasks and Exploration Enable ObjectNav☆42Oct 22, 2021Updated 4 years ago
- ☆16Oct 17, 2024Updated last year
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆62Sep 5, 2018Updated 7 years ago
- python, ccxt, backtrader, dash☆10Apr 20, 2018Updated 7 years ago
- MatchAttention: Matching the Relative Positions for High-Resolution Cross-View Matching☆22Nov 13, 2025Updated 4 months ago
- Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural …☆28Nov 11, 2020Updated 5 years ago
- Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch☆347Mar 16, 2026Updated last week
- Chrome extension to remove the "People also search for" element☆12Apr 16, 2022Updated 3 years ago
- Python implementation of the paper Learning hierarchical relationships for object-goal navigation☆48Dec 8, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Memory Augmented Neural Networks (Pytorch)☆14Sep 2, 2018Updated 7 years ago
- An MCP Server for Cosense☆17Dec 22, 2025Updated 3 months ago
- Lecture: Data Compression in Computational Science and Quantum Computing (計算科学・量子計算における情報圧縮)☆13Jan 18, 2023Updated 3 years ago
- Implementation of SNAIL(A Simple Neural Attentive Meta-Learner) with Gluon☆12Feb 22, 2019Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆18Apr 15, 2022Updated 3 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆13Nov 1, 2022Updated 3 years ago
- Show n-hop link destination pages beyond projects☆10Nov 25, 2025Updated 4 months ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code for "Divide-and-Conquer Reinforcement Learning"☆63Jan 8, 2019Updated 7 years ago
- Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/ab…☆17Nov 7, 2018Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- This is a self-contained memory module for the Dynamic Kanerva Machine, as reported in the NIPS 2018 paper: Learning Attractor Dynamics f…☆44Jan 24, 2019Updated 7 years ago
- Code and data accompanying "Learning Deployable Navigation at Kilometer Scale from a Single Traversal"☆11Jun 15, 2018Updated 7 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- PyTorch Implementation of Generative Query Network☆138Dec 13, 2018Updated 7 years ago