Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"
☆34Nov 22, 2018Updated 7 years ago
Alternatives and similar repositories for atari-demo
Users that are interested in atari-demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Chef cookbooks for managing a Ceph cluster☆12Apr 2, 2023Updated 3 years ago
- Code for the paper "World of Bits: An Open-Domain Platform for Web-Based Agents"☆32Nov 22, 2018Updated 7 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆207Nov 22, 2018Updated 7 years ago
- Bringup for fetch & freight☆17Jul 3, 2016Updated 9 years ago
- Fluentd output plugin that sends events to Amazon Kinesis Streams and Amazon Kinesis Firehose.☆13Apr 2, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a python3 compatible pyconfigatron☆10Oct 17, 2016Updated 9 years ago
- Wikipedia navigation environment for OpenAI Gym☆41Apr 2, 2023Updated 3 years ago
- Training Sonic with RLlib☆62Apr 2, 2023Updated 3 years ago
- Code for the paper "Improving GANs Using Optimal Transport"☆75Nov 22, 2018Updated 7 years ago
- OpenAI Retro Contest☆66Apr 2, 2023Updated 3 years ago
- Code for the paper "Understanding RL Vision"☆51Apr 2, 2023Updated 3 years ago
- Websockify is a WebSocket to TCP proxy/bridge. This allows a browser to connect to any application/server/service. Implementations in Py…☆29Nov 7, 2016Updated 9 years ago
- Service for quickly aliasing and redirecting to long URLs☆25Apr 26, 2023Updated 3 years ago
- The Prometheus monitoring system and time series database.☆37Apr 2, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆28Nov 22, 2018Updated 7 years ago
- Submissions for AI and Efficiency SOTA's☆58Jun 1, 2020Updated 6 years ago
- An interactive TLS-capable intercepting HTTP proxy for penetration testers and software developers☆17Nov 24, 2016Updated 9 years ago
- ViZDoom Python wrapper☆76Apr 2, 2023Updated 3 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- Code for the Neural GPU model originally described in "Neural GPUs Learn Algorithms"☆145Nov 22, 2018Updated 7 years ago
- Code for the paper "Quantifying Transfer in Reinforcement Learning"☆408Oct 7, 2023Updated 2 years ago
- Code for the paper "Continuous Adaptation via Meta-Learning in Nonstationary and Competitive Environments"☆309Apr 13, 2023Updated 3 years ago
- Code for the paper "Evolved Policy Gradients"☆254Nov 22, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Publicly releasable baselines for the Retro contest☆130Nov 22, 2018Updated 7 years ago
- Exploration by Random Network Distillation☆15Dec 30, 2018Updated 7 years ago
- Python wrappers for Pachi. Contains a modified version of the bleeding-edge Pachi source code.☆42Apr 2, 2023Updated 3 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆28Oct 28, 2018Updated 7 years ago
- A collection of infrastructure and tools for research in neural network interpretability.☆37Jan 25, 2019Updated 7 years ago
- Code for the paper "TD or not TD: Analyzing the Role of Temporal Differencing in Deep Reinforcement Learning", Artemij Amiranashvili, Ale…☆12Aug 24, 2018Updated 7 years ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆182Apr 2, 2023Updated 3 years ago
- A minimal implementation of Go-Explore without domain knowledge☆15Apr 26, 2021Updated 5 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Jul 23, 2018Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Example code for Weight Normalization, from "Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Netw…☆365Nov 22, 2018Updated 7 years ago
- Implementation for ICML 2019 paper, EMI: Exploration with Mutual Information.☆37Dec 7, 2020Updated 5 years ago
- OpenAI Gym environment for DART robotics simulator.☆22Apr 17, 2018Updated 8 years ago
- Code for the paper, "Distribution Augmentation for Generative Modeling", ICML 2020.☆132Apr 24, 2023Updated 3 years ago
- Code for the paper "Exploration by Random Network Distillation"☆933Oct 1, 2020Updated 5 years ago
- All those dead ducks must have been accumulating over the years. The dog is back with terrible terrible powers of resurrection.☆11Aug 23, 2020Updated 5 years ago
- PPO Dash: Improving Generalization in Deep Reinforcement Learning☆16Jul 17, 2019Updated 6 years ago