using recurrent networks(LSTM) to solve POMDPs
☆35Oct 10, 2018Updated 7 years ago
Alternatives and similar repositories for pytorch-a2clstm-DRQN
Users that are interested in pytorch-a2clstm-DRQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 7 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- ☆20Feb 8, 2023Updated 3 years ago
- Multi-agent active perception with prediction rewards☆12Nov 13, 2020Updated 5 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Feb 19, 2020Updated 6 years ago
- VIP cheatsheets for Stanford's CS 229 Machine Learning☆10May 20, 2020Updated 6 years ago
- Gym-like extensions for POMDP☆56Feb 28, 2021Updated 5 years ago
- Python package for Dec-POMDP files in the .dpomdp format☆11Oct 28, 2022Updated 3 years ago
- Reimplementation of D4RT☆48Dec 26, 2025Updated 5 months ago
- Some example code for the "Introduction to Bayesian Reinforcement Learning" presentations☆29Feb 15, 2019Updated 7 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- For managing 2P imaging datasets from preprocessing to activity trace extraction☆10Apr 12, 2019Updated 7 years ago
- ☆13Jun 1, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- This is the repository for the Master of Science thesis titled "GAN-based Matrix Factorization for Recommender Systems".☆10Aug 10, 2020Updated 5 years ago
- RL Algorithms for Visual Continuous Control☆36May 31, 2023Updated 3 years ago
- ☆10Apr 23, 2020Updated 6 years ago
- Hierarchical and Stable Multiagent Reinforcement Learning for Cooperative Navigation Control☆14May 5, 2022Updated 4 years ago
- Wavelet transform based scheme for edge detection☆17Aug 10, 2020Updated 5 years ago
- Direct Gibbs sampling for DPMM using python.☆17Jun 2, 2017Updated 9 years ago
- ☆16Jun 1, 2023Updated 3 years ago
- ☆12Mar 24, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…☆11Oct 23, 2020Updated 5 years ago
- Pytorch code for "Learning Belief Representations for Imitation Learning in POMDPs" (UAI 2019)☆22Aug 4, 2022Updated 3 years ago
- A visual-servo implementation based on IBVS by RealMan Robotics.☆21Oct 14, 2024Updated last year
- Data cleanse, clustering with Vector Quantization and Adaptive Resonance Theory☆10Dec 10, 2017Updated 8 years ago
- This is the companion code for the paper Noisy-Input Entropy Search for Efficient Robust Bayesian Optimization by Lukas P. Fröhlich et al…☆10Nov 10, 2020Updated 5 years ago
- Code for "Optimizing Quantum Variational Circuits with Deep Reinforcement Learning"☆20May 10, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- ☆11Nov 24, 2023Updated 2 years ago
- reproducible dev+test+production environments for java+javascript+clojure(script)☆13Feb 2, 2021Updated 5 years ago
- The code of SpikingSSMs: Learning Long Sequences with Sparse and Parallel Spiking State Space Models☆23Mar 25, 2026Updated 2 months ago
- Codes for Spiking Neural Networks with Improved Inherent Recurrence Dynamics for Sequential Learning☆11May 5, 2022Updated 4 years ago
- Code for abstracting, evaluating, and visualizing Markov Decision Processes.☆10Jan 12, 2017Updated 9 years ago
- Official code for: Observe Then Act: Asynchronous Active Vision-Action Model for Robotic Manipulation (OTA)☆20Dec 30, 2024Updated last year