Deep recurrent Q learning on CartPole-v1 environment
☆95Jan 15, 2024Updated 2 years ago
Alternatives and similar repositories for DRQN-Pytorch-CartPole-v1
Users that are interested in DRQN-Pytorch-CartPole-v1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆42Jul 18, 2019Updated 6 years ago
- ☆13Jun 1, 2020Updated 5 years ago
- Self-Supervised Attention-Aware Reinforcement Learning☆18May 20, 2022Updated 4 years ago
- Implementing DQNClipped and DQNReg Algorithms☆10Mar 2, 2021Updated 5 years ago
- Code of Paper "Cooperative Sensing and Uploading for Quality-Cost Tradeoff of Digital Twins in VEC", IEEE TCE, 2024.☆12Jul 10, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Solving POMDP using Recurrent networks☆93Jun 9, 2020Updated 5 years ago
- Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM) on Pyramid env, Unity ML☆20Dec 17, 2023Updated 2 years ago
- Minimal implementation of multi-agent reinforcement learning algorithms☆59Aug 30, 2021Updated 4 years ago
- A clean and robust implementation of Prioritized DQN and Prioritized Double DQN☆23Jun 8, 2024Updated last year
- ☆10Sep 21, 2020Updated 5 years ago
- ☆18Nov 10, 2023Updated 2 years ago
- Deep recurrent Q Learning using Tensorflow, openai/gym and openai/retro☆179Dec 8, 2022Updated 3 years ago
- Implementation code for GraphMIX: Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning☆36Feb 13, 2021Updated 5 years ago
- OpenAI gym-based algorithm for the grid world problem☆28Oct 20, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Training a car to drive in the CarRacing-v0 Gym Environment using imitation learning.☆21Oct 18, 2020Updated 5 years ago
- ☆39Dec 8, 2022Updated 3 years ago
- This repo is the official implementation of "Mask-based Latent Reconstruction for Reinforcement Learning" (NeurIPS 2022).☆30Jul 6, 2023Updated 2 years ago
- Tuning the PI controller parameters by using a contextual bandit approach☆15Jan 13, 2022Updated 4 years ago
- Code for paper: Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory co…☆24Mar 1, 2025Updated last year
- RLlib超参数详解(中文)☆18Jan 24, 2022Updated 4 years ago
- Reinforcement learning library for PyTorch.☆11Jun 15, 2018Updated 7 years ago
- PyTorch implementation of the discrete Soft-Actor-Critic algorithm.☆58Oct 1, 2021Updated 4 years ago
- Pybullet conversion of the OpenAI Gym fetch_reach environment. Uses a franka emika panda robot.☆12Dec 21, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18Apr 20, 2025Updated last year
- Gym env for TurtleBot3 robot☆19Aug 13, 2019Updated 6 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆347Apr 26, 2026Updated last month
- ☆20Oct 12, 2025Updated 7 months ago
- Works about Cucker-Smale model and its extensions. =Keywords: ODE, Runge-Kutta methods, SDE, Euler-Maruyama method, NumPy, Matplotlib☆12Feb 14, 2024Updated 2 years ago
- This is the code implementation of the paper "Financial Trading as a Game: A Deep Reinforcement Learning Approach".☆91Mar 15, 2021Updated 5 years ago
- Deep Reinforcement Learning Algorithms Implementation in PyTorch☆27Feb 11, 2025Updated last year
- ☆32Feb 5, 2024Updated 2 years ago
- Quantum Multi-agent Reinforcement Learning (QMARL)☆43May 8, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13Jul 1, 2018Updated 7 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆807May 29, 2022Updated 4 years ago
- A Pytorch Implementation of Multi Agent Soft Actor Critic☆44Jan 29, 2019Updated 7 years ago
- A convex-set-based approach to manipulator trajectory planning☆15Apr 24, 2025Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT…☆1,342Mar 13, 2025Updated last year
- Examples for cgroup socket ingress/egress BPF filters with systemd☆14Jul 24, 2020Updated 5 years ago
- Model Predictive Controller for a quadcopter model using online learning with recursive Gaussian process regression in ROS-Gazebo☆27Apr 21, 2024Updated 2 years ago