Solving POMDP using Recurrent networks
☆93Jun 9, 2020Updated 5 years ago
Alternatives and similar repositories for Recurrent-Deep-Q-Learning
Users that are interested in Recurrent-Deep-Q-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆42Jul 18, 2019Updated 6 years ago
- using recurrent networks(LSTM) to solve POMDPs☆35Oct 10, 2018Updated 7 years ago
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 6 years ago
- Fully Decentralized Multi-Agent Reinforcement Learning with Networked Agents☆12Jan 14, 2022Updated 4 years ago
- Code for the NeurIPS 2021 paper "Deep Bandits Show-Off: Simple and Efficient Exploration with Deep Networkst"☆14Sep 12, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Exploring bayesian strategies for approximating optimal actions in POMDPs☆14Jun 27, 2019Updated 6 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆179Dec 8, 2022Updated 3 years ago
- Predicting Walmart's Quarterly Earnings - Pytorch LSTM Example☆10Nov 3, 2021Updated 4 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- Implementation of 6 DQN extension methods using Pytorch. (RAINBOW)☆16Dec 7, 2020Updated 5 years ago
- POMDPs in Python.☆256Mar 11, 2026Updated last month
- Gym-like extensions for POMDP☆56Feb 28, 2021Updated 5 years ago
- Just another DAgger algorithm implementation☆14Apr 10, 2017Updated 9 years ago
- DistFlow Safe Reinforcement Learning Algorithm for Voltage Magnitude Regulation in Distribution Networks☆13Jul 9, 2025Updated 9 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- ☆26Apr 26, 2024Updated 2 years ago
- Code for the ICML 2020 publication "Information Particle Filter Tree: An Online Algorithm for POMDPs with Belief-Based Rewards on Continu…☆14Jul 3, 2020Updated 5 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆25Feb 2, 2025Updated last year
- Experiments with reinforcement learning and recurrent neural networks☆115Oct 27, 2023Updated 2 years ago
- Code for "Towards Optimal Correlational Object Search" | ICRA 2022☆21Jul 10, 2024Updated last year
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Based on the Kinect depth camera and MediaPipe, real-time reasoning of the joint activity of each part of the human body, and output a de…☆14May 10, 2023Updated 2 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Oct 20, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆14Feb 19, 2020Updated 6 years ago
- Python implementation of tabular asynchronous actor critic☆11May 3, 2016Updated 10 years ago
- ☆14Dec 4, 2018Updated 7 years ago
- Simple Cartpole example writed with pytorch.☆170Oct 29, 2019Updated 6 years ago
- Reimplementation of D4RT☆42Dec 26, 2025Updated 4 months ago
- POMDP formulation of a pedestrian avoidance problem for autonomous driving☆50Apr 3, 2020Updated 6 years ago
- Python package for Dec-POMDP files in the .dpomdp format☆11Oct 28, 2022Updated 3 years ago
- A pytorch tutorial for DRL(Deep Reinforcement Learning)☆225Apr 24, 2023Updated 3 years ago
- PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)☆55Dec 8, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- code for paper "Two-Critic Deep Reinforcement Learning for Inverter-based Volt-Var Control in Active Distribution Networks"☆19Apr 10, 2024Updated 2 years ago
- Tutorials on how to use EAGERx☆16Aug 14, 2025Updated 8 months ago
- Codes for Hilbert space reduced-rank GP regression☆16Jul 30, 2019Updated 6 years ago
- Rich literature review and discussion on the implementation of "Hierarchical Decision-Making for Autonomous Driving"☆60Aug 21, 2018Updated 7 years ago
- Value-Decomposition Networks For Cooperative Multi-Agent Learning☆25Apr 14, 2021Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,898May 29, 2022Updated 3 years ago