Hjananggch / gym_super_mario
本项目旨在探索强化学习技术在经典游戏《超级玛丽》中的应用,通过训练一个智能代理来自主导航并完成游戏关卡。我们采用了深度Q网络(DQN)和双深度Q网络(DDQN)等先进的强化学习算法,结合神经网络,使得代理能够学习如何在游戏世界中生存并获得高分。 项目特点 强化学习实践:本项目是强化学习理论与实践的结合,展示了如何将强化学习算法应用于实际问题中。 深度学习集成:通过集成深度学习模型,我们的智能代理能够处理复杂的游戏环境并做出决策。 环境优化:我们对游戏环境进行了优化,包括状态预处理和奖励设计,以提高学习效率和代理性能。 可视化工具:项目包含了训练过程的可视化工具,帮助开发者和研究人员理解代理的学习进度和行为策略。
☆8Updated 7 months ago
Alternatives and similar repositories for gym_super_mario:
Users that are interested in gym_super_mario are comparing it to the libraries listed below
- 一个简洁易用3D场景创建和控制工具。基于ThreeJS。纯Python接口。它适用于科研、多智能体强化学习领域的3D演示、娱乐等应用。☆42Updated last year
- 基于gym的pytorch深度强化学习(DRL)(PPO,PPG,DQN,SAC,DDPG,TD3等算法)☆104Updated last month
- Using deep reinforcement learning to play Snake game(贪吃蛇).☆80Updated 3 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆12Updated 4 years ago
- 基于Pytorch实现的PPO强化学习模型,支持训练各种游戏,如超级马里奥,雪人兄弟,魂斗罗等等。☆23Updated 4 years ago
- Play atari Tennis game by dqn☆74Updated 2 years ago
- notes☆29Updated 2 years ago
- ☆40Updated last week
- rl-papers☆47Updated 2 years ago
- mcc_second_guandan☆79Updated 2 years ago
- 强化学习玩超级马里奥☆72Updated 3 years ago
- The Code for Paper “Relay Hindsight Experience Replay: Self-Guided Continual Reinforcement Learning for Sequential Object Manipulation Ta…☆150Updated 10 months ago
- 2048 environment for Reinforcement Learning and DQN algorithm☆40Updated 2 years ago
- Example code for the Gym documentation☆71Updated last year
- ☆165Updated last year
- Multiagent Reinforcement Learning Research Project☆203Updated 6 months ago
- 强化学习算法库,包含了目前主流的强化学习算法(Value based and Policy based)的代码,代码都经过调试并可以运行☆79Updated last year
- RLToolkit is a flexible and high-efficient reinforcement learning framework. Include implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG,…☆17Updated last year
- ☆65Updated last year
- ☆19Updated 2 years ago
- This is the official implementation of Multi-Agent PPO.☆105Updated 2 years ago
- A Reinforcement Learning Project using PPO + LSTM☆73Updated last year
- ☆102Updated 2 months ago
- ☆96Updated 3 years ago
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆121Updated 2 weeks ago
- The implement of the policy gradient RL algorithm with pytorch☆38Updated 4 years ago
- ☆36Updated 10 months ago
- ☆21Updated 2 years ago
- PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms☆20Updated last month
- We extend pymarl2 to pymarl3, equipping the MARL algorithms with permutation invariance and permutation equivariance properties. The enh…☆155Updated last year