Deep Q Learning via Pytorch
β86Jan 9, 2018Updated 8 years ago
Alternatives and similar repositories for dqn-pytorch
Users that are interested in dqn-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- pytorch, noisy_distributional_double_dueling_PER_RNN_CNN...CartPole-v1 , Acrobot-v1, MountainCar-v0β14Mar 19, 2018Updated 8 years ago
- πΉοΈ Flappy Bird hack using Deep Reinforcement Learning with Double Q-learningβ18Oct 9, 2021Updated 4 years ago
- Deep Q-Learning Network in pytorch (not actively maintained)β428Nov 1, 2017Updated 8 years ago
- ε¦δΉ DRL CNN -> DQN -> LSTMβ13Oct 7, 2018Updated 7 years ago
- Implementation of Deep/Double Deep/Dueling Deep Q networks for playing Atari games using Keras and OpenAI gymβ40Sep 23, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A PyTorch implementation of Rainbow DQN agentβ170Apr 23, 2018Updated 8 years ago
- β13Dec 6, 2018Updated 7 years ago
- Solutions for different Reinforcement Learning environmentsβ26Aug 2, 2024Updated last year
- π² Stanford CS234 : Reinforcement Learningβ13Jan 14, 2019Updated 7 years ago
- We implement MADDPG in a congestion env, and compare with several control groups to highlight the performance of MADDPGβ11Jul 14, 2021Updated 4 years ago
- krazy grid worldβ25Mar 2, 2020Updated 6 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advantaβ¦β194Sep 19, 2024Updated last year
- DQN to play Atari Pongβ113Jan 15, 2019Updated 7 years ago
- β44Dec 4, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer β’ AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Differentiable neural computersβ27Nov 16, 2016Updated 9 years ago
- Simple implementation of the model presented in Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic β¦β16Jan 22, 2019Updated 7 years ago
- LEACH routing protocol in WSNβ14Mar 1, 2020Updated 6 years ago
- Mobile Ad Hoc (MANET) simulation and analysis using OMNET++.β11Mar 3, 2020Updated 6 years ago
- β12Jul 4, 2022Updated 3 years ago
- Value & Policy Iteration for the frozenlake environment of OpenAIβ15May 14, 2019Updated 6 years ago
- SeqGAN but with more bells and whistlesβ24Feb 15, 2018Updated 8 years ago
- DQN based RL agent for Mountain Carβ12Sep 7, 2016Updated 9 years ago
- PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.β614Nov 11, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Optimized dqn for caffeβ11Dec 18, 2015Updated 10 years ago
- Deep Reinforcement Learning with pytorch & visdomβ805Jul 16, 2020Updated 5 years ago
- Simple, small, fully-connected Python version of NeoRLβ11Jan 29, 2016Updated 10 years ago
- MineRL 2021 Intro track baselinesβ13Jul 30, 2021Updated 4 years ago
- Reinforcement Learning Tutorial on Super Marioβ90Nov 13, 2017Updated 8 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.β11Aug 11, 2016Updated 9 years ago
- Dataset Bias correction (Python)β20Jan 7, 2018Updated 8 years ago
- A routing algorithm based on QLearningβ17Dec 24, 2020Updated 5 years ago
- For managing 2P imaging datasets from preprocessing to activity trace extractionβ10Apr 12, 2019Updated 7 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- OpenAI Gym Environments for the Application of Reinforcement Learning in the Simulation of Wireless Networked Feedback Control Loopsβ15Feb 5, 2021Updated 5 years ago
- A fighter fly out trajectory time series data mining demo, I use agnes and k-means to clustering the flyout data samples into left, straiβ¦β13Aug 12, 2017Updated 8 years ago
- Published by Packtβ11Jan 18, 2021Updated 5 years ago
- Actor-critic with experience replayβ258Oct 9, 2022Updated 3 years ago
- A Dynatrace OneAgent extension for gathering NVIDIA GPU metrics using NVIDIA Management Library (NVML)β10May 3, 2020Updated 6 years ago
- automatically create cortical flatmaps from FreeSurfer surfacesβ28Apr 20, 2026Updated 2 weeks ago
- β10Apr 13, 2023Updated 3 years ago