using recurrent networks(LSTM) to solve POMDPs
☆35Oct 10, 2018Updated 7 years ago
Alternatives and similar repositories for pytorch-a2clstm-DRQN
Users that are interested in pytorch-a2clstm-DRQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Solving POMDP using Recurrent networks☆93Jun 9, 2020Updated 5 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- ☆20Feb 8, 2023Updated 3 years ago
- ☆42Jul 18, 2019Updated 6 years ago
- My own implementation of Reinforcement Learning algorithms using Tensorflow 2.0☆30Jan 22, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- Deep Recurrent Attention Reinforcement Learning in Atari☆83Jul 19, 2018Updated 7 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆179Dec 8, 2022Updated 3 years ago
- ☆14Feb 19, 2020Updated 6 years ago
- VIP cheatsheets for Stanford's CS 229 Machine Learning☆10May 20, 2020Updated 6 years ago
- Gym-like extensions for POMDP☆56Feb 28, 2021Updated 5 years ago
- Python package for Dec-POMDP files in the .dpomdp format☆11Oct 28, 2022Updated 3 years ago
- Some example code for the "Introduction to Bayesian Reinforcement Learning" presentations☆29Feb 15, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- Tools for using the Kinect One (Kinect v2) in ROS with OpenCV4. Tested on ROS Noetic☆31Apr 13, 2023Updated 3 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- This is the repository for the Master of Science thesis titled "GAN-based Matrix Factorization for Recommender Systems".☆10Aug 10, 2020Updated 5 years ago
- RL Algorithms for Visual Continuous Control☆36May 31, 2023Updated 2 years ago
- Hierarchical and Stable Multiagent Reinforcement Learning for Cooperative Navigation Control☆14May 5, 2022Updated 4 years ago
- Wavelet transform based scheme for edge detection☆17Aug 10, 2020Updated 5 years ago
- This repository provides the python implementation for the paper "Decentralized Multi-Agent Formation Control via Deep Reinforcement Lear…☆20Jan 19, 2022Updated 4 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Direct Gibbs sampling for DPMM using python.☆17Jun 2, 2017Updated 8 years ago
- ☆16Jun 1, 2023Updated 2 years ago
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Mar 3, 2021Updated 5 years ago
- ☆12Mar 24, 2025Updated last year
- ☆29Jul 14, 2025Updated 10 months ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- ☆20Sep 14, 2019Updated 6 years ago
- A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…☆11Oct 23, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆26Sep 16, 2025Updated 8 months ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆14May 15, 2023Updated 3 years ago
- A visual-servo implementation based on IBVS by RealMan Robotics.☆21Oct 14, 2024Updated last year
- This is the companion code for the paper Noisy-Input Entropy Search for Efficient Robust Bayesian Optimization by Lukas P. Fröhlich et al…☆11Nov 10, 2020Updated 5 years ago
- Code for "Optimizing Quantum Variational Circuits with Deep Reinforcement Learning"☆20May 10, 2024Updated 2 years ago