🍄Reinforcement Learning: Super Mario Bros with dueling dqn🍄
☆147May 20, 2025Updated 10 months ago
Alternatives and similar repositories for Super-Mario-RL
Users that are interested in Super-Mario-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of DQN and training Super Mario Bros☆25Nov 16, 2025Updated 4 months ago
- A Deep Q Network used for running experiments on reinforcement learning agents targeted at learning Super Mario Bros (NES)☆11Oct 12, 2017Updated 8 years ago
- A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.☆21Nov 6, 2019Updated 6 years ago
- Simple implementations of multi-agent evolutionary strategies using pytorch.☆17Jan 15, 2022Updated 4 years ago
- Repo for tracking my progress in the Data Structure and Algorithms specialization course☆19Apr 8, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES☆847Aug 1, 2023Updated 2 years ago
- ☆12Aug 24, 2023Updated 2 years ago
- Proximal Policy Optimization (PPO) algorithm for Super Mario Bros☆1,273Jul 24, 2021Updated 4 years ago
- A simple option critic framework using Q-Learning☆14Feb 7, 2022Updated 4 years ago
- Interactive tutorial to build a learning Mario, for first-time RL learners☆247Jan 27, 2023Updated 3 years ago
- This is pytorch implmentation project of Bootsrapped DQN☆13Dec 6, 2020Updated 5 years ago
- 本项目旨在探索强化学习技术在经典游戏《超级玛丽》中的应用,通过训练一个智能代理来自主导航并完成游戏关卡。我们采用了深度Q网络(DQN)和双深度Q网络(DDQN)等先进的强化学习算法,结合神经网络,使得代理能够学习如何在游戏世界中生存并获得高分。 项目特点 强化学习实践:本…☆18Updated this week
- Negative Update Intervals in Multi-Agent Deep Reinforcement Learning☆35May 14, 2019Updated 6 years ago
- Implementation of Relational Deep Reinforcement Learning☆25Jan 31, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Contextual Bandits Action Elimination DQN☆21Jun 25, 2018Updated 7 years ago
- Integrate Apache RocketMQ with A2A☆29Feb 28, 2026Updated last month
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆56Mar 12, 2025Updated last year
- Translating HTN planning problems to PDDL☆21Jul 7, 2021Updated 4 years ago
- DQN model used to train and beat Super Mario Bros. for the NES using PyTorch☆36Nov 22, 2022Updated 3 years ago
- [ICML 2022] Robust Deep Reinforcement Learning through Bootstrapped Opportunistic Curriculum☆11Jul 15, 2022Updated 3 years ago
- Approximate convex decomposition(ACD)☆10Sep 9, 2023Updated 2 years ago
- Tensorflow implementation of DQN to control cart-pole from OpenAI gym environment☆14Sep 24, 2017Updated 8 years ago
- ☆27Jul 9, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- ☆17Jan 24, 2021Updated 5 years ago
- a script help you to auto play bard music.☆15Sep 5, 2023Updated 2 years ago
- ☆12Jun 30, 2022Updated 3 years ago
- code for learning trajectory dependencies for human motion prediction☆11Mar 2, 2022Updated 4 years ago
- IPyHOP is a Re-entrant Iterative GTPyHOP written in Python 3. PyHOP is an acronym for Python Hierarchical Ordered Planner.☆11Aug 12, 2022Updated 3 years ago
- Standardization Project for mjai Format Specification☆12Aug 28, 2024Updated last year
- ☆11Oct 29, 2024Updated last year
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆11Apr 8, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A novel approach for Remaining Useful Life (RUL) prediction, combining meta-learning, knowledge discovery, and Physics-Informed Neural Ne…☆21Apr 21, 2025Updated 11 months ago
- Visualize machine learning models with Netron in VSCode☆17Nov 23, 2025Updated 4 months ago
- UAV trajectory design (DQN)☆22Oct 5, 2021Updated 4 years ago
- Reinforcement Learning PPO Super Mario Bros Agent☆13Dec 11, 2022Updated 3 years ago
- Reinforcement learning tutorials☆403Mar 25, 2023Updated 3 years ago
- VIDIMU-TOOLS is a code repository related to the public dataset "VIDIMU. multimodal video and IMU kinematic dataset on daily life activit…☆10Jun 2, 2024Updated last year
- unofficial implementation of https://arxiv.org/pdf/2301.08871v1.pdf on pytorch☆15Apr 20, 2023Updated 2 years ago