The state-of-art deep rl algorithms for Montezuma's revenge
☆28Oct 28, 2018Updated 7 years ago
Alternatives and similar repositories for rl-montezuma
Users that are interested in rl-montezuma are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- paper <<Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation>> python implementation☆10Mar 27, 2018Updated 8 years ago
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆207Nov 22, 2018Updated 7 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆95Jul 27, 2022Updated 3 years ago
- Reusable, Easy-to-use Uncertainty module package built with Tensorflow, Keras☆14Dec 31, 2018Updated 7 years ago
- ☆40Jul 29, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆10Apr 21, 2017Updated 9 years ago
- Minimal version of DeepMind AlphaZero☆85Dec 11, 2020Updated 5 years ago
- IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.☆12Aug 12, 2022Updated 3 years ago
- Exploring the use of options in creating small worlds for faster learning in RL Domains☆16Jan 23, 2012Updated 14 years ago
- Repository for studying distributional rl☆30Feb 2, 2025Updated last year
- weekly reinforcement learning paper reviews☆33Jan 8, 2018Updated 8 years ago
- TensorFlow implementation of Deep Reinforcement Learning papers☆28Dec 31, 2016Updated 9 years ago
- ☆11Sep 1, 2017Updated 8 years ago
- Cornell House Agent Learning Environment☆47Jun 22, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for the blog post "Learning Montezuma’s Revenge from a Single Demonstration"☆34Nov 22, 2018Updated 7 years ago
- Course Website for "AI618: Generative Model and Unsupervised Learning"☆37May 23, 2023Updated 3 years ago
- ☆119Jul 9, 2020Updated 5 years ago
- ☆15Feb 25, 2020Updated 6 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆371Aug 1, 2019Updated 6 years ago
- ☆11Oct 3, 2022Updated 3 years ago
- Map-Elites based on Evolution Strategies☆33Feb 11, 2022Updated 4 years ago
- This is the official codebase for paper: Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Acti…☆53Apr 9, 2026Updated last month
- Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstractions and Intrinsic Motivation☆88Mar 5, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Cochlear.ai submission for dcase2018 task2☆15Sep 14, 2018Updated 7 years ago
- Keeping track of RL experiments☆166Dec 17, 2022Updated 3 years ago
- Neural model of hierarchical reinforcement learning☆16Sep 14, 2017Updated 8 years ago
- World Models applied to the Open AI Sonic Retro Contest☆78Jun 30, 2018Updated 7 years ago
- ☆10Mar 11, 2024Updated 2 years ago
- Reinforcement Leanring for Tetris☆19Oct 24, 2016Updated 9 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 3 years ago
- dqn autoplay mario bros☆21Jul 24, 2017Updated 8 years ago
- Official python implementation of ASGRL in ICML 2022 paper: Leveraging Approximate Symbolic Models for Reinforcement Learning via Skill D…☆20Oct 5, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Convolutional neural networks for sound classification☆20Dec 30, 2017Updated 8 years ago
- ☆14Mar 9, 2020Updated 6 years ago
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆22Dec 7, 2024Updated last year
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆55May 12, 2025Updated last year
- Official implementation of Neural Episodic Control with State Abstraction☆13Aug 3, 2023Updated 2 years ago
- Code for the Reset-free Trial and Error learning paper (RTE) experiments☆10Jan 3, 2018Updated 8 years ago
- Implementations of autoencoders (VAE, AAE, and others)☆11Oct 1, 2018Updated 7 years ago