DQN model used to train and beat Super Mario Bros. for the NES using PyTorch
☆37Nov 22, 2022Updated 3 years ago
Alternatives and similar repositories for super-mario-bros-dqn
Users that are interested in super-mario-bros-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 强化学习玩超级马里奥☆82May 8, 2022Updated 3 years ago
- dqn autoplay mario bros☆21Jul 24, 2017Updated 8 years ago
- moziai强化学习和行为树的代码☆10Mar 18, 2020Updated 6 years ago
- PyTorch Implementation of DQN and training Super Mario Bros☆25Nov 16, 2025Updated 5 months ago
- This is an mathematical model established by MATLAB for the study of the missiles' attack effect on an aircraft carrier.☆13Jul 15, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 一门由AlphaGo项目负责人David Sliver,在UCL(伦敦大学)主讲的强化学习经典课程☆24Feb 1, 2019Updated 7 years ago
- Package (ROS 1 & ROS 2) for human keypoints identification, 3D reconstruction, tracking, and filtering in collaborative robotics.☆18Nov 20, 2025Updated 5 months ago
- Experiments of the three PPO-Algorithms (PPO, clipped PPO, PPO with KL-penalty) proposed by John Schulman et al. on the 'Cartpole-v1' env…☆13Nov 14, 2021Updated 4 years ago
- Transfer Learning in Reinforcement Learning using Stable-Baseline3 | Transfer Reinforcement Learning for Differing Action Spaces via Q-Ne…☆22Feb 27, 2022Updated 4 years ago
- ☆19Aug 30, 2024Updated last year
- Super Mario Bros training with Ray RLlib DQN algorithm☆25May 22, 2021Updated 4 years ago
- An implementation of (Double/Dueling) Deep-Q Learning to play Super Mario Bros.☆74Apr 3, 2021Updated 5 years ago
- Test server code for Phi-2 model. support OpenAI API spec☆18Dec 15, 2023Updated 2 years ago
- Repository (preliminary codes) for DSTC10 SIMMC track.☆19Dec 9, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- 武器目标分配问题——动态规划算法☆26Aug 27, 2021Updated 4 years ago
- Twisting Lids Off with Two Hands [CoRL 2024]☆41Mar 16, 2025Updated last year
- A voice chatbot built with Meta Llama 3 and Ollama Python Library☆16May 25, 2024Updated last year
- Trajectory optimization of hypersonic reentry vehicle☆19Jul 3, 2023Updated 2 years ago
- Neo4j Cybersecurity Demo☆19Mar 16, 2022Updated 4 years ago
- Official code for "Continual Prompt Tuning for Dialog State Tracking" (ACL 2022).☆27Mar 14, 2023Updated 3 years ago
- Apply LSTM neural network and reinforcement learning to trading Forex on mt5☆24Jul 4, 2022Updated 3 years ago
- ☆48Mar 27, 2026Updated last month
- Using deep reinforcement learning to play Snake game. The used algorithm is PPO for discrete! It has the brilliant performance in the fi…☆34Nov 3, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Information and Materials for the Deep Learning Course☆31Jun 16, 2022Updated 3 years ago
- 基于opencv和多线程的别踩白块儿游戏辅助☆18Apr 21, 2021Updated 5 years ago
- ☆14Mar 24, 2021Updated 5 years ago
- Machine learning on knowledge graphs for context-aware security monitoring (data and model)☆18Mar 11, 2022Updated 4 years ago
- Qimen表示的是奇门遁甲之术,用于抽取各种实体的工具。☆29Jan 12, 2020Updated 6 years ago
- A fork of Dana S. Nau's A Hierarchical Ordered Planner for Python, Pyhop☆64Nov 22, 2021Updated 4 years ago
- Implementation of ECO-DQN as reported in "Exploratory Combinatorial Optimization with Reinforcement Learning".☆81Oct 23, 2020Updated 5 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- 专注于中国商品期货市场的AI分析系统☆57Nov 30, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Code implementation for NeurIPS 2019 submission 'Reinforcement Learning for Integer Programming: Learning to Cut'☆41Jul 31, 2019Updated 6 years ago
- PoC of Swift for Compute@Edge☆12Feb 3, 2022Updated 4 years ago
- HAProxy combined with confd for HTTP load balancing with SSL offloading☆10Feb 5, 2017Updated 9 years ago
- ☆13Sep 11, 2024Updated last year
- An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES☆851Aug 1, 2023Updated 2 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- ☆27Updated this week