Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.
☆15Jun 30, 2020Updated 5 years ago
Alternatives and similar repositories for DQN-pytorch-Atari
Users that are interested in DQN-pytorch-Atari are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implement PPO algorithm on mujoco environment,such as Ant-v2, Humanoid-v2, Hopper-v2, Halfcheeth-v2.☆58Jun 30, 2020Updated 5 years ago
- Play Atari(Breakout) Game by DRL - DQN, Noisy DQN and A3C☆15May 30, 2020Updated 6 years ago
- use DQN(pytorch) to play pong☆12May 30, 2021Updated 5 years ago
- ☆17Oct 12, 2023Updated 2 years ago
- ☆10May 15, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple human interface for human-in-the-loop machine learning research, which allows: 1. annote image on webpage, 2. collect human feed…☆14May 24, 2024Updated 2 years ago
- A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].☆16Oct 21, 2023Updated 2 years ago
- an experimental implementation of Burrow's delta in Python 3☆22Oct 1, 2021Updated 4 years ago
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Oct 24, 2023Updated 2 years ago
- PyTorch implementation of Vanilla PG, TNPG, TRPO, PPO on Mujoco environment☆15Jul 1, 2018Updated 7 years ago
- A simple baseline for mountain-car @ gym☆12Jan 15, 2020Updated 6 years ago
- A SpringBoot-Dubbo demo!☆16Aug 26, 2024Updated last year
- Implementation of Russo and Van Roy work on Information Directed Sampling (2017)☆21Jan 18, 2019Updated 7 years ago
- DQN with pytorch with on Breakout and SpaceInvaders☆27Aug 13, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Solving the OpenAI Gym (MountainCarContinuous-v0) with DDPG☆21Jan 23, 2023Updated 3 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- Codebase of NeurIPS 2022 paper ''Planning for Sample Efficient Imitation Learning''☆41Oct 25, 2022Updated 3 years ago
- 一个强大的MCP(Model Context Protocol)开发框架,一个用于SSE对接的模块化工具框架。该框架允许开发者轻松创建和扩展自定义工具,支持JWT鉴权,并通过MCP协议与模型交互。☆17May 15, 2025Updated last year
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- Train and test Smile-Detector Machine Learning models / 利用机器学习模型训练和检测笑脸☆32Feb 26, 2019Updated 7 years ago
- https://sites.google.com/site/multidimensionaltagger☆38Dec 6, 2023Updated 2 years ago
- ARMv6 binary build for Trojan-GFW(树莓派用上trojan)☆17Mar 10, 2020Updated 6 years ago
- General-Purpose Reinforcement Learning☆18Oct 31, 2021Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 全自动地从 jBox 转移文件到新一代交大云盘☆38Mar 29, 2025Updated last year
- MydockFinder是一款极致模拟Mac OS的软件,这款软件不是黑苹果系统,也不需要重装系统,仅需要打开运行程序并完成个性化设置后即可伪装为Mac OS系统,设置为开机自启后即可随时体验到Mac OS的动画效果。☆21Nov 5, 2023Updated 2 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- Incremental Convex Hull Algorithm and SAT Collision Detection for 3D Objects.☆19Jun 19, 2021Updated 5 years ago
- 🎳 Environments for Reinforcement Learning☆65Feb 5, 2026Updated 4 months ago
- PoC of Swift for Compute@Edge☆12Feb 3, 2022Updated 4 years ago
- HAProxy combined with confd for HTTP load balancing with SSL offloading☆10Feb 5, 2017Updated 9 years ago
- 🎾 Multi-Agent Proximal Policy Optimization approach to a competitive reinforcement learning problem☆22Sep 25, 2022Updated 3 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆31Sep 28, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PyTorch code to train and evaluate Procgen tasks☆25Nov 1, 2020Updated 5 years ago
- This project shall be based on setting up of Google Football Research Environment as an OpenAI gym for code purposes.☆20Jul 4, 2019Updated 6 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- ☆54Nov 10, 2022Updated 3 years ago
- A fork of the Linux kernel for NVMEoF target driver using PCI P2P capabilities for full I/O path offloading.☆15Jun 20, 2021Updated 4 years ago
- An OpenAI gym multi-agent environment implementing the Commons Game proposed in "A multi-agent reinforcement learning model of common-poo…☆22Jun 20, 2020Updated 5 years ago
- ☆11Jan 11, 2022Updated 4 years ago