A PyTorch implementation of Human-Level Control through Deep Reinforcement Learning
☆24Jun 6, 2017Updated 9 years ago
Alternatives and similar repositories for DQN-pytorch
Users that are interested in DQN-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 白话强化学习与PyTorch的学习笔记☆36Apr 21, 2020Updated 6 years ago
- Template for building 2D grid worlds with OpenAI Gym and Pycolab☆14Jun 12, 2019Updated 7 years ago
- DQN with pytorch with on Breakout and SpaceInvaders☆27Aug 13, 2019Updated 6 years ago
- Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based Losses (ECCV 2018)☆16Aug 14, 2019Updated 6 years ago
- DQN to play Atari Pong☆112Jan 15, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Code for NeurIPS 2021 paper "Curriculum Offline Imitation Learning"☆18Oct 21, 2022Updated 3 years ago
- A very simple 1D Kalman Filter in MATLAB (for teaching)☆14Jan 3, 2017Updated 9 years ago
- Learning to reinforcement learn and treating sepsis on the side☆15Dec 9, 2017Updated 8 years ago
- Image Stiching for Panoramic Images☆10May 15, 2013Updated 13 years ago
- ☆25Dec 18, 2025Updated 6 months ago
- 引用整理https://blog.csdn.net/yellow_red_people/article/details/80465510 一文中PyTorch平台,利用DQN模型玩Flappy Bird游戏,是一个再励学习(强化学习)实验例子 。☆53Feb 10, 2019Updated 7 years ago
- 🕹️ Flappy Bird hack using Deep Reinforcement Learning with Double Q-learning☆19Oct 9, 2021Updated 4 years ago
- tracking person's behaviors such as: walking, sit down and fall down ..☆17Aug 3, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Open AI Gym Environment For MIMIC Dataset Sepsis Patient☆24Dec 8, 2022Updated 3 years ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated last year
- ☆15Updated this week
- ☆11Aug 17, 2018Updated 7 years ago
- Code for the paper "Importance Weighted Transfer of Samples in Reinforcement Learning" (ICML 2018).☆16May 29, 2018Updated 8 years ago
- Plato is a system for viewport adaptation based bitrate adaptive VR video streaming.☆15May 1, 2018Updated 8 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- Recursively Branched Deconvolutional Network: DCNN architecture for "Generalized Deep Image to Image Regression." CVPR2017 (Spotlight).☆21May 3, 2017Updated 9 years ago
- ☆10Nov 6, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the repository for paper "Improving Sepsis Treatment Strategies using Deep Reinforcement Learning and Mixture-of-Experts"☆26Jul 6, 2018Updated 7 years ago
- Task dependent skill transformation is challenging due to the ignorance of the relationships between primitive skills. In this project, w…☆14Jun 4, 2020Updated 6 years ago
- 北京交通大学计算机科学与技术学院系统与网络实验室☆25Updated this week
- A research project exploring fine-tuning BERT-style models for text generation☆41Nov 30, 2025Updated 6 months ago
- high availability ros master☆17Nov 1, 2019Updated 6 years ago
- image caption with semantic attention☆11Apr 1, 2017Updated 9 years ago
- Fine Grained Visual Categorization☆11Jun 16, 2018Updated 8 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- A minimal and interpretable Brian2 based DYNAP neuromorphic processor simulator for educational purposes.☆12Jun 23, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Arduino sketch to write to Nano 33 BLE Sense memory using the NVMC☆11Apr 15, 2020Updated 6 years ago
- Implementation of IODINE model☆10Jun 7, 2019Updated 7 years ago
- BuildStockQuery is a python library for querying datasets generated by ResStock™ and ComStock™.☆15Jun 5, 2026Updated last week
- Detect cat faces in images using CNNs with regression☆17Aug 16, 2017Updated 8 years ago
- Probing the limitations of multimodal language models for chemistry and materials research☆24Feb 1, 2026Updated 4 months ago
- Implementation of SNAIL(A Simple Neural Attentive Meta-Learner) with Gluon☆12Feb 22, 2019Updated 7 years ago
- Library for creating smooth cubic splines☆10Oct 15, 2020Updated 5 years ago