📖 Paper: Human-level control through deep reinforcement learning 🕹️
☆57May 9, 2024Updated last year
Alternatives and similar repositories for Human-level-control-through-deep-reinforcement-learning
Users that are interested in Human-level-control-through-deep-reinforcement-learning are comparing it to the libraries listed below
Sorting:
- IROS 2018 Software Tutorial on XBotControl☆10Oct 16, 2019Updated 6 years ago
- ☆13Sep 12, 2022Updated 3 years ago
- ☆20May 22, 2022Updated 3 years ago
- Deep Q Networks☆96Oct 18, 2018Updated 7 years ago
- Different implementations of Bayesian neural networks for uncertainty estimation. The uncertainty estimation is utilized for efficient ex…☆10Nov 29, 2020Updated 5 years ago
- XBotControl framework: XBotCore + OpenSoT + CartesI/O☆32Nov 20, 2023Updated 2 years ago
- Yet Another Reinforcement Learning Tutorial☆72Feb 17, 2023Updated 3 years ago
- PyTorch implementation of DeepMind's "Human-level control through deep reinforcement learning"☆19Apr 6, 2020Updated 5 years ago
- ☆14Jun 11, 2024Updated last year
- DQN with pytorch with on Breakout and SpaceInvaders☆27Aug 13, 2019Updated 6 years ago
- Learning Backtracking Models, ICLR'19☆10Feb 2, 2018Updated 8 years ago
- Episodic Control☆22Sep 20, 2022Updated 3 years ago
- Code repository for the research project "You Play Ball, I Play Ball: Bayesian Multi-Agent Reinforcement Learning for Slime Volleyball", …☆17Nov 15, 2020Updated 5 years ago
- ☆15Feb 18, 2020Updated 6 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆15May 30, 2020Updated 5 years ago
- Analyzes and adjusts the volume of MP3 files☆12Apr 7, 2019Updated 6 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- PyTorch Implementation of Ape-X (Distributed prioritized experience replay) architecture with DQN learner☆28Sep 5, 2020Updated 5 years ago
- Training a vision-based agent with the Deep Q Learning Network (DQN) in Atari's Breakout environment, implementation in Tensorflow.☆18Dec 12, 2018Updated 7 years ago
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- Collections of powerful RL architectures with brief introductions.☆13Nov 20, 2020Updated 5 years ago
- GPTPy: Your kind Python guide, powered by AI to fix errors and explain code☆14Mar 12, 2023Updated 3 years ago
- The original Deepmind Atari 2600 DQN code☆32Mar 24, 2023Updated 2 years ago
- Official python implementation of ASGRL in ICML 2022 paper: Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill D…☆20Oct 5, 2022Updated 3 years ago
- Focal loss implemention by PyTorch☆11Dec 16, 2018Updated 7 years ago
- PhysioNet 2019 Challenge: Early Prediction of Sepsis from Clinical Data☆12May 19, 2019Updated 6 years ago
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- For the paper "Prediction-Based Reachability for Collision Avoidance in Autonomous Driving"☆22Dec 16, 2020Updated 5 years ago
- Latest version of GaitSym including fully GUI design☆12Dec 4, 2025Updated 3 months ago
- 5th place solution for ACM MM2021 Robust Logo Detection Grand Challenge☆13Dec 25, 2022Updated 3 years ago
- Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.☆11Feb 25, 2025Updated last year
- ☆10Apr 5, 2024Updated last year
- Source code for our paper "AdapterShadow: Adapting Segment Anything Model for Shadow Detection"☆16Apr 19, 2025Updated 11 months ago
- The code corresponding to the paper "Improving Sample Efficiency of Deep Reinforcement Learning for Bipedal Walking".☆23Aug 8, 2022Updated 3 years ago
- 一些用于互联网算法岗面试复习用的常见手撕代码合集:排序算法、最短路算法、二叉树遍历算法、sql语句、nms算法、IOU算法、多头注意力MHA等☆21Mar 18, 2025Updated last year
- Code for "Learning Canonical Representations for Scene Graph to Image Generation", Herzig & Bar et al., ECCV2020☆30Nov 22, 2022Updated 3 years ago
- ☆16Nov 4, 2021Updated 4 years ago
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆16Mar 25, 2025Updated 11 months ago
- Implementation for "Surrogate Losses for Online Learning of Stepsizes in Stochastic Non-Convex Optimization"☆10Aug 3, 2022Updated 3 years ago