Implement DQN and DDQN algorithm on Atari games,such as BreakoutNoFrameskip-v4, PongNoFrameskip-v4,BoxingNoFrameskip-v4.
☆15Jun 30, 2020Updated 5 years ago
Alternatives and similar repositories for DQN-pytorch-Atari
Users that are interested in DQN-pytorch-Atari are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- use DQN(pytorch) to play pong☆12May 30, 2021Updated 4 years ago
- [AAAI 2026] Few-step Flow for 3D Generation via Marginal-Data Transport Distillation☆50Jan 9, 2026Updated 2 months ago
- ☆17Oct 12, 2023Updated 2 years ago
- A simple human interface for human-in-the-loop machine learning research, which allows: 1. annote image on webpage, 2. collect human feed…☆14May 24, 2024Updated last year
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Oct 24, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Official code repository of '3DreamBooth: High-Fidelity 3D Subject-Driven Video Generation Model'☆47Mar 20, 2026Updated last week
- A simple baseline for mountain-car @ gym☆11Jan 15, 2020Updated 6 years ago
- ☆12Apr 27, 2018Updated 7 years ago
- A SpringBoot-Dubbo demo!☆15Aug 26, 2024Updated last year
- Deep Q-Learning (DQN) implementation for Atari pong.☆85Nov 22, 2022Updated 3 years ago
- ☆67Oct 18, 2025Updated 5 months ago
- DQN with pytorch with on Breakout and SpaceInvaders☆27Aug 13, 2019Updated 6 years ago
- ☆15May 24, 2023Updated 2 years ago
- A psycholinguistic modeling toolkit☆31Mar 5, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Udacity Deep Reinforcement Learning Nanodegree Program☆11Jul 12, 2019Updated 6 years ago
- Playing Mountain-Car without reward engineering, by combining DQN and Random Network Distillation (RND)☆41Jan 28, 2019Updated 7 years ago
- 一个强大的MCP(Model Context Protocol)开发框架,一个用于SSE对接的模块化工具框架。该框架允许开发者轻松创建和扩展自定义工具,支持JWT鉴权,并通过MCP协议与模型交互。☆17May 15, 2025Updated 10 months ago
- Source and solution codes for Professional CUDA C Programming book.☆15Aug 20, 2020Updated 5 years ago
- ☆12Feb 23, 2023Updated 3 years ago
- Minimal example to apply Decision Transformer in Atari Pong☆15Feb 1, 2025Updated last year
- Timing prediction dataset download and instructions.☆18Jun 7, 2023Updated 2 years ago
- General-Purpose Reinforcement Learning☆18Oct 31, 2021Updated 4 years ago
- [NeurIPS 2024 Spotlight] Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier Designs☆15Feb 22, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This is a concise Pytorch implementation of Rainbow DQN, including Double Q-learning, Dueling network, Noisy network, PER and n-steps Q-l…☆37Jul 23, 2022Updated 3 years ago
- Official implementation of NeurIPS'24 paper "Reinforcement Learning Policy as Macro Regulator Rather than Macro Placer".☆20Aug 13, 2025Updated 7 months ago
- MydockFinder是一款极致模拟Mac OS的软件,这款软件不是黑苹果系统,也不需要重装系统,仅需要打开运行程序并完成个性化设置后即可伪装为Mac OS系统,设置为开机自启后即可随时体验到Mac OS的动画效果。☆19Nov 5, 2023Updated 2 years ago
- Mirror of the Si2 LEF/DEF parser (v5.8)☆19Oct 8, 2021Updated 4 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- ☆17Mar 4, 2024Updated 2 years ago
- Incremental Convex Hull Algorithm and SAT Collision Detection for 3D Objects.☆19Jun 19, 2021Updated 4 years ago
- ☆103Oct 17, 2025Updated 5 months ago
- 🎳 Environments for Reinforcement Learning☆63Feb 5, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- PoC of Swift for Compute@Edge☆12Feb 3, 2022Updated 4 years ago
- HAProxy combined with confd for HTTP load balancing with SSL offloading☆10Feb 5, 2017Updated 9 years ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆30Sep 28, 2024Updated last year
- PyTorch code to train and evaluate Procgen tasks☆25Nov 1, 2020Updated 5 years ago
- This project shall be based on setting up of Google Football Research Environment as an OpenAI gym for code purposes.☆20Jul 4, 2019Updated 6 years ago
- ☆53Nov 10, 2022Updated 3 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago