The original code from the DeepMind article + my tweaks
☆19Sep 3, 2015Updated 10 years ago
Alternatives and similar repositories for DeepMind-Atari-Deep-Q-Learner
Users that are interested in DeepMind-Atari-Deep-Q-Learner are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago
- An implementation of Highway Networks in Caffe☆96Sep 20, 2015Updated 10 years ago
- ☆46May 25, 2022Updated 3 years ago
- MC-AIXI-CTW by Marcus Hutter and his students (in particular Daniel Visentin)☆52May 29, 2011Updated 14 years ago
- Simple deep Q-learning agent.☆702Mar 17, 2019Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This maintains a reading list for multi-agent reinforcement learning☆16Jun 8, 2017Updated 8 years ago
- My own comments and modifications to word2vec by Mikolov et al.☆16Jan 31, 2016Updated 10 years ago
- ☆151Aug 9, 2016Updated 9 years ago
- Reasonably-okay-performing implementation of a GAN and an adversarial autoencoder on MNIST.☆30Jan 1, 2016Updated 10 years ago
- Algorithmic Intelligence Quotient☆39Jan 2, 2022Updated 4 years ago
- Reinforcement learning with a convolutional neural network.☆35Apr 13, 2015Updated 11 years ago
- ideally, this will become a pure Haskell library for Linear Integer/Mixed Programming☆16Nov 12, 2018Updated 7 years ago
- Fast time library☆20Jul 1, 2025Updated 9 months ago
- ☆11Jul 8, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple, small, fully-connected Python version of NeoRL☆11Jan 29, 2016Updated 10 years ago
- ☆13Nov 17, 2015Updated 10 years ago
- A Python Interface for the Arcade Learning Environment (Shared Object)☆129Oct 30, 2020Updated 5 years ago
- Unfinished. Deep Q Learning in Tensorflow for ATARI.☆84Feb 12, 2016Updated 10 years ago
- A Spiking Multi-Layer Perceptron☆33Sep 5, 2017Updated 8 years ago
- Exercises for the semi-supervised summer school https://semisupervised-learning.compute.dtu.dk.☆11Aug 11, 2016Updated 9 years ago
- An implementation of Deep Q-Network using Caffe☆208Nov 8, 2016Updated 9 years ago
- ☆10Feb 20, 2020Updated 6 years ago
- Introduction tutorials to deep learning with Theano and OpenDeep☆51Dec 5, 2015Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- The implementation of "The Kanerva Machine" with Pytorch and Pyro☆12Jun 14, 2018Updated 7 years ago
- ☆12Oct 15, 2024Updated last year
- Differentiable MPC in Chainer, developed as part of PFN summer internship 2019.☆15Aug 23, 2022Updated 3 years ago
- yet another toy OCaml interpreter in Haskell☆12Jul 5, 2020Updated 5 years ago
- ☆68May 23, 2016Updated 9 years ago
- Learning to share: simultaneous parameter tying and sparsification in deep learning☆13Aug 21, 2018Updated 7 years ago
- http://a-terada.github.com/lamp/☆14Jul 14, 2023Updated 2 years ago
- Low-rank Highway Networks☆13Mar 11, 2016Updated 10 years ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for a generative controller for the AI Gym cartpole task☆15Feb 22, 2017Updated 9 years ago
- ☆49Aug 1, 2016Updated 9 years ago
- Python3 reimplementation of Wissner-Gross & Freer, 2013☆15Dec 18, 2025Updated 4 months ago
- Deterministic Policy Gradient using torch7☆43Jun 2, 2016Updated 9 years ago
- Gated Recurrent Unit with Low-rank matrix factorization☆34Mar 11, 2016Updated 10 years ago
- Implimentation of the Model Free Episodic Control paper by Deep Mind : http://arxiv.org/abs/1606.04460☆52Jul 25, 2016Updated 9 years ago
- Bertsekas auction algorithm for asymmetric matrices with positive real coefficients (from 0 to 1; eg. MHT data association) - Multithread…☆11Sep 18, 2020Updated 5 years ago