强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)
☆34Aug 8, 2018Updated 7 years ago
Alternatives and similar repositories for Reinforcment-Leanring-algorithm
Users that are interested in Reinforcment-Leanring-algorithm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目是同济大学人工智能课程的第二次大作业——五子棋问题,内含工程文件与报告。必须要说明的是,我上传这次作业的主要目的是抛砖引玉,以期学弟学妹在做作业的过程中少走弯路,报告内容也仅供参考,切勿全局抄袭,否则后果自负。如果认为这个工程有帮助的话,希望各位能给我点一个star,…☆14Jul 16, 2020Updated 5 years ago
- 同济大学2023-2024软件工程课程资料与笔记☆31Jan 14, 2024Updated 2 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- [MIG2021] Deep Reinforcement Learning with Particle Filtering Policy Network for Physics-Based Character Control☆18Feb 25, 2022Updated 4 years ago
- ☆14Dec 4, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MATLAB codes for cognitive radio which we used through the year☆11Jul 18, 2018Updated 7 years ago
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated 2 years ago
- 同济大学操作系统课程小学期课设:基于Rust的多任务操作系统的设计和实现。仅供学习参考。An Operating System Designed and Implemented in Rust lang.☆11Feb 20, 2024Updated 2 years ago
- ☆14Jul 4, 2022Updated 3 years ago
- 操作系统进程管理项目之电梯调度,写的比较简单☆16May 24, 2021Updated 5 years ago
- 人工智能大作业,剪枝算法五子棋☆13Nov 23, 2020Updated 5 years ago
- A deep reinforcement learning based approach is used to allocate downlink power for multi-cell wireless system.☆23Feb 21, 2020Updated 6 years ago
- 同济大学2022-2023第二学期计算机视觉课程作业☆14Jun 27, 2023Updated 3 years ago
- ☆14Jun 19, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is a part of MATLAB implementation of the paper "Machine Learning Techniques for Cooperative Spectrum Sensing in Cognitive Radio Net…☆24Oct 1, 2020Updated 5 years ago
- MATLAB files of modulation classification in cognitive radios☆25Jul 12, 2016Updated 9 years ago
- 强化学习炒股,走向人生巅峰(或倾家荡产)☆56Mar 8, 2022Updated 4 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- 2022秋-同济大学软件学院-分布式系统课程项目☆10Jun 29, 2023Updated 3 years ago
- Reinforcement Learning from Hierarchical Critics☆14Jul 30, 2020Updated 5 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 4 years ago
- 兰州大学在线OJ判题平台项目【代码沙箱】☆19Jul 19, 2024Updated last year
- ☆12Jan 3, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆37Jul 22, 2025Updated 11 months ago
- Catch game example is translated by TensorFlow☆16May 8, 2017Updated 9 years ago
- 同济大学软件学院《计算机系统结构》复习笔记☆12Jun 19, 2025Updated last year
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- 点云配准入门知识☆10Nov 14, 2019Updated 6 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- 电梯调度,操作系统课程作业☆18Jun 26, 2018Updated 8 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago
- Learn Microservices with Spring Boot (2nd edition) - Chapter 6☆23Feb 13, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A new way to pygame.☆10Oct 25, 2015Updated 10 years ago
- 同济大学软件学院2020-2021学年下 软件测试作业☆19Jul 1, 2021Updated 4 years ago
- Reinforcement Leanring Algorithms Trained with Unity☆13Apr 26, 2019Updated 7 years ago
- [AAAI 2022] PyTorch implementation of Sim2Real Object-Centric Keypoint Detection and Description☆17Jul 3, 2022Updated 3 years ago
- ☆96May 13, 2026Updated last month
- RTKLIB 手册解读与源码解析,涵盖工具使用、算法解析和工程优化,助力开发者与研究者深入探索 GNSS 算法。☆51Jul 17, 2025Updated 11 months ago
- Code for training policies based on paper Coordinated Multi-Agent Imitation Learning☆26Aug 7, 2017Updated 8 years ago