强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)
☆33Aug 8, 2018Updated 7 years ago
Alternatives and similar repositories for Reinforcment-Leanring-algorithm
Users that are interested in Reinforcment-Leanring-algorithm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目是同济大学人工智能课程的第二次大作业——五子棋问题,内含工程文件与报告。必须要说明的是,我上传这次作业的主要目的是抛砖引玉,以期学弟学妹在做作业的过程中少走弯路,报告内容也仅供参考,切勿全局抄袭,否则后果自负。如果认为这个工程有帮助的话,希望各位能给我点一个star,…☆14Jul 16, 2020Updated 5 years ago
- Gobang MCTS :蒙特卡洛搜索树使用C++实现五子棋AI算法 ——同济大学☆11Nov 15, 2023Updated 2 years ago
- ☆12Feb 20, 2021Updated 5 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- [MIG2021] Deep Reinforcement Learning with Particle Filtering Policy Network for Physics-Based Character Control☆18Feb 25, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆14Dec 4, 2018Updated 7 years ago
- MATLAB codes for cognitive radio which we used through the year☆11Jul 18, 2018Updated 7 years ago
- 同济大学软件学院数据结构课程作业,含10个实验,期末论文,深度学习加分项☆11Dec 9, 2022Updated 3 years ago
- 同济大学操作系统课程小学期课设:基于Rust的多任务操作系统的设计和实现。仅供学习参考。An Operating System Designed and Implemented in Rust lang.☆12Feb 20, 2024Updated 2 years ago
- 2023 同济大学 计算机网络 课程☆16Jan 9, 2024Updated 2 years ago
- ☆14Jul 4, 2022Updated 3 years ago
- 操作系统进程管理项目之电梯调度,写的比较简单☆16May 24, 2021Updated 4 years ago
- [一个聊天软件Demo] a chat software powered by libevent/mysql and qt☆10Sep 10, 2021Updated 4 years ago
- ☆11Apr 26, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- UE4☆17Jul 19, 2021Updated 4 years ago
- A deep reinforcement learning based approach is used to allocate downlink power for multi-cell wireless system.☆23Feb 21, 2020Updated 6 years ago
- ☆14Jun 19, 2024Updated last year
- A High-Quality Diabetic Retinopathy Pixel-Level Annotation Dataset☆17Dec 9, 2025Updated 5 months ago
- intelligent-virtual agent authoring toolkit with interoperability of rocketbox , cc4 and didimo characters☆30Apr 21, 2026Updated last month
- 2022秋-同济大学软件学院-分布式系统课程项目☆11Jun 29, 2023Updated 2 years ago
- Joint spectrum and power allocation for cognitive capacity harvested network with using DQN learning method☆24Sep 15, 2019Updated 6 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- [NAACL 2025🔥] MEDA: Dynamic KV Cache Allocation for Efficient Multimodal Long-Context Inference☆20Jun 19, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- SCV is a distributed cluster GPU sniffer. SCV是一个分布式GPU嗅探器☆20Feb 25, 2023Updated 3 years ago
- Place to experiment with using facial detection on HoloLens in research mode☆12Jun 19, 2018Updated 7 years ago
- 兰州大学在线OJ判题平台项目【代码沙箱】☆19Jul 19, 2024Updated last year
- Energy Detection Algorithm for Cognitive Radio☆30Apr 10, 2015Updated 11 years ago
- 操作系统第三次课程项目,一个简单的文件系统☆12Jun 24, 2021Updated 4 years ago
- This software is a module for 3D Slicer to perform the accuracy test of a tracking system as described in the ASTM standard F2554.☆15Jan 28, 2026Updated 3 months ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- PyTorch Implementation of Visual GAIL in Atari Games☆14Dec 7, 2022Updated 3 years ago
- 同济大学软件学院《计算机系统结构》复习笔记☆12Jun 19, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 点云配准入门知识☆10Nov 14, 2019Updated 6 years ago
- SLAM simulation in Unity 3D☆18May 9, 2016Updated 10 years ago
- Mixed Reality Hololens 2 application, Fracture Surgery Assistant, (Mixed Reality Lab, ETH A.Y. 2019/2020)☆11Apr 21, 2024Updated 2 years ago
- ☆34May 25, 2020Updated 5 years ago
- ☆34Jan 17, 2025Updated last year
- 电梯调度,操作系统课程作业☆18Jun 26, 2018Updated 7 years ago
- Keras implementation of DQN for the MsPacman-v0 OpenAI Gym environment.☆36Dec 8, 2022Updated 3 years ago