Scaling All-Goals Updates in Reinforcement Learning Using Convolutional Neural Networks
☆40Feb 5, 2020Updated 6 years ago
Alternatives and similar repositories for qmap
Users that are interested in qmap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Jul 27, 2022Updated 3 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 3 years ago
- Template-DQN and DRRN agent implementations☆22Jun 12, 2023Updated 2 years ago
- Malmö challenge☆18May 22, 2017Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A2C for GVG-AI☆22Nov 7, 2018Updated 7 years ago
- ☆14Jun 26, 2019Updated 6 years ago
- Repository for Malmo Challenge☆28Jul 9, 2017Updated 8 years ago
- Training Sonic with RLlib☆62Apr 2, 2023Updated 3 years ago
- Hands-On Reinforcement Learning with TensorFlow & TRFL☆14Jan 18, 2021Updated 5 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- This repository contains the source code of the EMNLP 2020 paper Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehensio…☆20Oct 8, 2020Updated 5 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 5 years ago
- Code for Go-Explore: a New Approach for Hard-Exploration Problems☆580Dec 8, 2022Updated 3 years ago
- Reinforcement Learning papers on exploration methods.☆19Jun 27, 2021Updated 4 years ago
- Source code for OpenAI Retro Contest for Sonic the Hedgehog☆31Aug 20, 2018Updated 7 years ago
- ICML 2018 Self-Imitation Learning☆276Apr 18, 2020Updated 6 years ago
- Attempt at reinforcement learning with curiosity for Sonic the Hedgehog games. Number 149 on OpenAI retro contest leaderboard, but more w…☆32Sep 17, 2018Updated 7 years ago
- Transfer Learning via Unsupervised Task Discovery for Visual Question Answering☆19Apr 8, 2019Updated 7 years ago
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- ☆15Sep 22, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Apr 2, 2018Updated 8 years ago
- My attempt to run DQN on Jetson TX1, which learns how to play Nintendo Famicom Mini games through reinforcement learning directly.☆15Mar 23, 2018Updated 8 years ago
- WMG agent☆34Oct 3, 2023Updated 2 years ago
- ☆151Dec 9, 2024Updated last year
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- Deep Reinforcement Learning Agent☆19Dec 9, 2015Updated 10 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- Modified version of the LeagueSandbox project which relies on a Redis server to accept actions and send observations. Intended for reinfo…☆12Feb 23, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Deep reinforcement learning in ViZDoom (using Tensorflow)☆19Jan 25, 2018Updated 8 years ago
- Upper Confidence Tree Planner for ATARI games☆19Mar 9, 2016Updated 10 years ago
- CFG-GAN: Composite functional gradient learning of generative adversarial models☆15Jul 9, 2020Updated 5 years ago
- AI learning from visual input using ViZDoom environment.☆12Jul 24, 2016Updated 9 years ago
- implementation of Wasserstein Natural Policy Gradients and Wasserstein Natural Evolution Strategies☆13Mar 9, 2021Updated 5 years ago
- Add attention layer to LSTM/word2vec model for sentiment analysis using tensorflow☆26Sep 30, 2017Updated 8 years ago
- Code for the paper "Learning to Act by Predicting the Future", Alexey Dosovitskiy and Vladlen Koltun, ICLR 2017☆151Sep 2, 2024Updated last year