Deep Double Q-Learning implementation introduced by Hasselt et al in this paper: https://arxiv.org/abs/1509.06461. It's interfacing with openAI Gym. WIP.
☆30Jan 1, 2017Updated 9 years ago
Alternatives and similar repositories for DDQN
Users that are interested in DDQN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Deep Q-Networks in tensorflow☆10Apr 4, 2017Updated 8 years ago
- Monte Carlo Conterfactual Regret Minimization for imperfect information games☆13Mar 29, 2019Updated 7 years ago
- Another Implementation of "Context Encoders: Feature Learning by Inpainting"☆11Sep 28, 2017Updated 8 years ago
- Implementaion of Generic L-layer Neural Network from Scratch☆12May 14, 2018Updated 7 years ago
- Implementation of DeDOL algorithm - Deep Reinforcement Learning based algorithm for Green Security Games with Real Time Information☆16Nov 7, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆11Oct 29, 2019Updated 6 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Jul 3, 2018Updated 7 years ago
- A makeshift python program which relies on nltk and Stanford Core NLP models to expand common contractions in the english language.☆10Nov 8, 2017Updated 8 years ago
- ☆10Aug 17, 2018Updated 7 years ago
- Anomaly Detection Discriminative GAN (ADD-GAN)☆14Oct 9, 2017Updated 8 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- Trading Stock with Deep Reinforcement Learning☆24Aug 20, 2018Updated 7 years ago
- Comprehensive Implementation of Proximal Policy Optimization☆12Aug 3, 2021Updated 4 years ago
- [Deprecated] A simple Julia interface to the Stanford CoreNLP toolkit.☆18Feb 8, 2020Updated 6 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Files for London PyData London, 2015☆15Jun 18, 2015Updated 10 years ago
- Simple implementation of regret matching algorithm for RPS nash equilibrium computation via self-play☆26Sep 25, 2018Updated 7 years ago
- Pagerank in Julia. An experiment in pagerank on graphs in the order of billions of edges. Currently tested with over half a billion edges…☆12Aug 14, 2013Updated 12 years ago
- The code for NeurIPS 2020 paper: Adversarial Crowdsourcing Through Robust Rank-One Matrix Completion.☆10Oct 26, 2020Updated 5 years ago
- 完整的新词发现&词库构建例子☆20Mar 12, 2017Updated 9 years ago
- Super Mario Bros. (NES) gameplay dataset for machine learning.☆12Jul 22, 2025Updated 8 months ago
- Weighted Training for Cross-Task Learning☆15Feb 12, 2023Updated 3 years ago
- arXiv submission related tool repository☆15Updated this week
- Generate semi-realistic maps with Node.js and present them with leaflet☆11Dec 11, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆24Dec 13, 2018Updated 7 years ago
- Fast wavelet transforms on the sphere☆13Dec 20, 2016Updated 9 years ago
- Reading Group @ DMG☆11Nov 15, 2018Updated 7 years ago
- A Python-Markdown extension to ignore html comments opened by three dashes.☆10Aug 3, 2022Updated 3 years ago
- Computer Vision, 1st Project : Shape from Shading☆12Feb 24, 2014Updated 12 years ago
- ☆25Jun 5, 2015Updated 10 years ago
- Turn Wagtail pages into lifelike speech using Amazon Polly.☆12Jul 14, 2025Updated 8 months ago
- Using k-means clustering for unsupervised CNN deep learning.☆11Oct 26, 2017Updated 8 years ago
- A package for benchmarking code and packages☆18Mar 20, 2016Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Unsupervised Tracklet Person Re-Identification☆10Apr 29, 2019Updated 6 years ago
- An eigenfaces demo to go along with a recent blog post: http://mikedusenberry.com/on-eigenfaces/☆17Feb 3, 2015Updated 11 years ago
- A Julia wrapper for Fast Library for Approximate Nearest Neighbors (FLANN)☆18Apr 8, 2024Updated last year
- A scalable implementation of the multifrontal method for symmetric and Hermitian systems (with intrafrontal pivoting)☆19Jun 27, 2016Updated 9 years ago
- reproduce some RL or Multi-Agent models☆35May 22, 2019Updated 6 years ago
- ☆10May 21, 2024Updated last year
- Nyan cat + Matplotlib☆12Sep 17, 2016Updated 9 years ago