using recurrent networks(LSTM) to solve POMDPs
☆35Oct 10, 2018Updated 7 years ago
Alternatives and similar repositories for pytorch-a2clstm-DRQN
Users that are interested in pytorch-a2clstm-DRQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pathfinding Using Reinforcement Learning☆12May 21, 2019Updated 6 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- ☆20Feb 8, 2023Updated 3 years ago
- ☆42Jul 18, 2019Updated 6 years ago
- Multi-agent active perception with prediction rewards☆11Nov 13, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆179Dec 8, 2022Updated 3 years ago
- Based on the Kinect depth camera and MediaPipe, real-time reasoning of the joint activity of each part of the human body, and output a de…☆14May 10, 2023Updated 2 years ago
- ☆14Feb 19, 2020Updated 6 years ago
- VIP cheatsheets for Stanford's CS 229 Machine Learning☆10May 20, 2020Updated 5 years ago
- Gym-like extensions for POMDP☆56Feb 28, 2021Updated 5 years ago
- ☆10Sep 13, 2025Updated 7 months ago
- Python package for Dec-POMDP files in the .dpomdp format☆11Oct 28, 2022Updated 3 years ago
- Some example code for the "Introduction to Bayesian Reinforcement Learning" presentations☆29Feb 15, 2019Updated 7 years ago
- Auto-tune the Entropy Temperature of Soft Actor-Critic via Metagradient - 7th ICML AutoML workshop 2020☆33Jul 22, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- For managing 2P imaging datasets from preprocessing to activity trace extraction☆10Apr 12, 2019Updated 7 years ago
- Calculation of the entropy of the batch of images (whole image or patches)☆10Oct 15, 2021Updated 4 years ago
- This is the repository for the Master of Science thesis titled "GAN-based Matrix Factorization for Recommender Systems".☆10Aug 10, 2020Updated 5 years ago
- Band-limited Training and Inference for Convolutional Neural Networks☆20Nov 21, 2022Updated 3 years ago
- RL Algorithms for Visual Continuous Control☆36May 31, 2023Updated 2 years ago
- Code associated with the paper 'Nonlinear Model Predictive Control Based on Constraint-Aware Particle Filtering/Smoothing' by I. Askari☆14Mar 23, 2021Updated 5 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- Direct Gibbs sampling for DPMM using python.☆17Jun 2, 2017Updated 8 years ago
- ☆15Jun 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Non-linear policy graph improvement - planning for Dec-POMDPs☆16Mar 3, 2021Updated 5 years ago
- I added selfplay functionality to openai gyms☆10Jan 16, 2021Updated 5 years ago
- Code associated with our paper "Robust Domain Randomization for Reinforcement Learning"☆12Nov 22, 2022Updated 3 years ago
- ☆20Sep 14, 2019Updated 6 years ago
- A tool to implement Genomic Prediction Experiments using Deep Learning☆17Mar 24, 2023Updated 3 years ago
- A boilerplate (dbs, envs, teleop, models, web-apps) for robotic learning experiments & a Pytorch Implementation of "Learning Latent Plans…☆11Oct 23, 2020Updated 5 years ago
- This repo is the official implementation of the ICLR'23 paper "Towards Robustness Certification Against Universal Perturbations." We calc…☆12Feb 14, 2023Updated 3 years ago
- An environment based on JSBSIM aimed at one-to-one close air combat.☆14May 15, 2023Updated 2 years ago
- A visual-servo implementation based on IBVS by RealMan Robotics.☆19Oct 14, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Data cleanse, clustering with Vector Quantization and Adaptive Resonance Theory☆10Dec 10, 2017Updated 8 years ago
- Code for "Optimizing Quantum Variational Circuits with Deep Reinforcement Learning"☆20May 10, 2024Updated last year
- Deep reinforcement learning for UAV in Gazebo simulation environment☆146Aug 2, 2018Updated 7 years ago
- Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning☆22Jan 9, 2020Updated 6 years ago
- Intel Atom D2550 Embedded Motherboard☆13Dec 26, 2018Updated 7 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- ☆41Nov 16, 2022Updated 3 years ago