(Personal experiment) Unsupervised Predictive Memory in a Goal-Directed Agent https://arxiv.org/abs/1803.10760
☆24May 3, 2019Updated 7 years ago
Alternatives and similar repositories for merlin
Users that are interested in merlin are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Exploring by Minimizing Uncertainty of Q values (EMU-Q) as presented in "Bayesian RL for Goal-Only Rewards" at CoRL'18.☆10Nov 8, 2018Updated 7 years ago
- Navigation agent with Bayesian relational memory in the House3D environment☆30Sep 13, 2019Updated 6 years ago
- PyTorch implementation of SAC-Q Reinforcement Learning Algorithm (tested on OpenAI Gym environments)☆38Feb 13, 2021Updated 5 years ago
- ☆14Oct 5, 2017Updated 8 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆16Nov 14, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gym implementation of connector to Deepmind lab☆12Mar 26, 2019Updated 7 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- RL framework for embodied agents based on PyTorch☆11Apr 11, 2019Updated 7 years ago
- Symbol Emergence in Robotics tool KIT☆21Nov 15, 2023Updated 2 years ago
- Code for the paper "Residual Policy Learning for Shared Autonomy".☆17Apr 14, 2020Updated 6 years ago
- ☆16Mar 2, 2019Updated 7 years ago
- A simple option critic framework using Q-Learning☆14Feb 7, 2022Updated 4 years ago
- Resources for Auxiliary Tasks and Exploration Enable ObjectNav☆42Oct 22, 2021Updated 4 years ago
- ☆16Oct 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- python, ccxt, backtrader, dash☆10Apr 20, 2018Updated 8 years ago
- Code for paper "Episodic Memory Deep Q-Networks" (https://arxiv.org/abs/1805.07603), IJCAI 2018☆63Sep 5, 2018Updated 7 years ago
- Implementation of Bidirectional Recurrent Independent Mechanisms (Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural …☆28Nov 11, 2020Updated 5 years ago
- Differentiable Neural Computers, Sparse Access Memory and Sparse Differentiable Neural Computers, for Pytorch☆348Mar 16, 2026Updated last month
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- ☆13Dec 12, 2022Updated 3 years ago
- Reproducing Random Numbers in Matlab and Python / NumPy☆11Dec 6, 2015Updated 10 years ago
- An MCP Server for Cosense☆19Dec 22, 2025Updated 4 months ago
- Lecture: Data Compression in Computational Science and Quantum Computing (計算科学・量子計算における情報圧縮)☆13Jan 18, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)☆18Apr 15, 2022Updated 4 years ago
- Official code for Conformal Isometry of Lie Group Representation in Recurrent Network of Grid Cells (NeurIPS workshop on Symmetry and Geo…☆12Nov 1, 2022Updated 3 years ago
- Show n-hop link destination pages beyond projects☆10Nov 25, 2025Updated 5 months ago
- Implementation of the Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning by Tianmin Shu, Caiming Xiong…☆11Jun 18, 2018Updated 7 years ago
- Code for Environment Probing Interaction Policies [ICLR 2019]☆29Jun 17, 2019Updated 6 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆63Jan 8, 2019Updated 7 years ago
- Tensorflow code for WACV 2019 paper "Attention Based Natural Language Grounding by Navigating Virtual Environment" - https://arxiv.org/ab…☆17Nov 7, 2018Updated 7 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆10May 8, 2018Updated 7 years ago
- This is a self-contained memory module for the Dynamic Kanerva Machine, as reported in the NIPS 2018 paper: Learning Attractor Dynamics f…☆44Jan 24, 2019Updated 7 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Code and data accompanying "Learning Deployable Navigation at Kilometer Scale from a Single Traversal"☆11Jun 15, 2018Updated 7 years ago
- Dataset for Bilingual VLN☆11Dec 5, 2020Updated 5 years ago
- PyTorch Implementation of Generative Query Network☆138Dec 13, 2018Updated 7 years ago
- Taming MAML: efficient unbiased meta-reinforcement learning☆30Sep 30, 2022Updated 3 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…☆12Feb 25, 2023Updated 3 years ago
- [CoRL 2020] Learning 3D Dynamic Scene Representations for Robot Manipulation☆58Apr 11, 2023Updated 3 years ago
- PyTorch implementation of Advantage Actor-Critic (A2C)☆47Nov 25, 2017Updated 8 years ago