Leaning hard attention model by policy gradient with rewards based on active inference.
☆22Sep 9, 2017Updated 8 years ago
Alternatives and similar repositories for hard-attention
Users that are interested in hard-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for a generative controller for the AI Gym cartpole task☆15Feb 22, 2017Updated 9 years ago
- Active inference implementation of dynamic multi-armed bandits☆20Jun 25, 2025Updated 10 months ago
- Probabilistic inference for models of behaviour☆13Mar 5, 2026Updated 2 months ago
- General framework for Bayesian inversion of continuous hierarchical models☆10Sep 20, 2021Updated 4 years ago
- ☆14Oct 7, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implement BinaryNet of CNN with chainer☆11May 5, 2016Updated 10 years ago
- Repo for code for the NIPS paper entitled "An Architecture for Deep, Hierarchical Generative Models"☆14Oct 27, 2016Updated 9 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- ☆16Mar 10, 2018Updated 8 years ago
- PyTorch implementations of Reinforcement Learning algorithms in less than 200 lines☆10Apr 3, 2020Updated 6 years ago
- Design good curriculums for deep reinforcement learning☆14May 18, 2016Updated 9 years ago
- Explore and Control with Adversarial Surprise☆10Jul 20, 2021Updated 4 years ago
- Code for Continual Reinforcement Learning with Multi-Timescale Replay☆24Apr 16, 2020Updated 6 years ago
- Deterministic Policy Gradient using torch7☆43Jun 2, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyOblige is Python wrapper for OBLIGE - random level generator for Doom☆11Jul 2, 2018Updated 7 years ago
- Deep Variational Information Bottleneck (DVIB) in PyTorch.☆10Apr 25, 2020Updated 6 years ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- ☆20Apr 27, 2016Updated 10 years ago
- some RL algorithms☆19Dec 9, 2016Updated 9 years ago
- Pytorch Code for Semi-supervised Learning on MNIST Data Set☆12Mar 11, 2017Updated 9 years ago
- A docker container that lets you run AirSim without building it.☆14Sep 20, 2017Updated 8 years ago
- ☆20Sep 8, 2023Updated 2 years ago
- PyTorch implementation of different Deep RL algorithms for the LunarLander-v2 environment in OpenAI Gym☆11May 20, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 4 years ago
- Using a shared file to exchange data between Unity and Python☆13Oct 30, 2021Updated 4 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated last month
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Cloud-and-Learning compatible Automated vehicle Platform (Mirrored from Gitlab, please post issues to the Gitlab link)☆11Apr 17, 2022Updated 4 years ago
- Torch implementation of Sequence to Sequence Learning with Neural Networks☆24Oct 14, 2015Updated 10 years ago
- Code for Attentive Recurrent Comparators☆56Mar 3, 2017Updated 9 years ago
- Reinforcement Learning through Active Inference☆83Apr 27, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Highway networks implemented in PyTorch.☆54Apr 5, 2017Updated 9 years ago
- Demo of Unity3D CUDA texture interop issue☆17Jun 20, 2017Updated 8 years ago
- ☆16Aug 12, 2023Updated 2 years ago
- A Tree-LSTM-based dependency tree sentiment labeler☆15May 9, 2019Updated 7 years ago
- ☆10May 5, 2017Updated 9 years ago
- Implementation of Robust Adversarial Reinforcement Learning☆14Nov 27, 2017Updated 8 years ago
- ☆13Apr 2, 2025Updated last year