强化学习经典算法(offline\online learning, q-learning, DQN)的实现在平衡杆游戏和几个Atari 游戏 (CartPole\Pong\Boxing\MsPacman)
☆33Aug 8, 2018Updated 7 years ago
Alternatives and similar repositories for Reinforcment-Leanring-algorithm
Users that are interested in Reinforcment-Leanring-algorithm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 本项目是同济大学 人工智能课程的第二次大作业——五子棋问题,内含工程文件与报告。必须要说明的是,我上传这次作业的主要目的是抛砖引玉,以期学弟学妹在做作业的过程中少走弯路,报告内容也仅供参考,切勿全局抄袭,否则后果自负。如果认为这个工程有帮助的话,希望各位能给我点一个star,…☆14Jul 16, 2020Updated 5 years ago
- Gobang MCTS :蒙特卡洛搜索树使用C++实现五子棋AI算法 ——同济大学☆12Nov 15, 2023Updated 2 years ago
- 同济大学2023-2024软件工程课程资料与笔记☆31Jan 14, 2024Updated 2 years ago
- Project explores collaboration capabilities of VDN and IQL agents on a custom MARL Food Collector environment☆11Apr 6, 2022Updated 4 years ago
- ☆14Dec 4, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simulation Study of Double Threshold Energy Detection Method for Cognitive Radios☆14Aug 11, 2018Updated 7 years ago
- MATLAB codes for cognitive radio which we used through the year☆11Jul 18, 2018Updated 7 years ago
- Neural Time Series Analysis☆14Nov 21, 2022Updated 3 years ago
- 这是同济大学软件学院2024年网络方向数据分析与数据挖掘专选作业和笔记🌸~☆21Jun 20, 2024Updated last year
- Official Code Repo for the paper "Learning to Play Atari in a World of Tokens" accepted at ICML, 2024☆11Jun 6, 2024Updated last year
- ☆14Aug 5, 2020Updated 5 years ago
- 2023 同济大学 计算机网络 课程☆16Jan 9, 2024Updated 2 years ago
- ☆14Jul 4, 2022Updated 3 years ago
- This contains joint channel and power allocation scheme for a full duplex cognitive radio network underlying a cellular network☆25Oct 20, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A modular implementation for Proximal Policy Optimization in Tensorflow 2 using Eagerly Execution for the Super Mario Bros enviroment.☆21Nov 6, 2019Updated 6 years ago
- 操作系统进程管理项目之电梯调度,写的比较简单☆16May 24, 2021Updated 4 years ago
- A deep reinforcement learning based approach is used to allocate downlink power for multi-cell wireless system.☆25Feb 21, 2020Updated 6 years ago
- ☆14Jun 19, 2024Updated last year
- MATLAB files of modulation classification in cognitive radios☆24Jul 12, 2016Updated 9 years ago
- 强化学习炒股,走向人生巅峰(或倾家荡产)☆56Mar 8, 2022Updated 4 years ago
- This is our Final Year Project in Bachelors. We try to avoid congestion on two levels i.e Intersection level and Infrastructure to Vehicl…☆10Oct 12, 2020Updated 5 years ago
- Implementation for "ROLL: Visual Self-Supervised Reinforcement Learning with Object Reasoning", CoRL 2020☆16Jun 22, 2022Updated 3 years ago
- 兰州大学在线OJ判题平台项目【代码沙箱】☆19Jul 19, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Energy Detection Algorithm for Cognitive Radio☆30Apr 10, 2015Updated 11 years ago
- ☆12Jan 3, 2022Updated 4 years ago
- This software is a module for 3D Slicer to perform the accuracy test of a tracking system as described in the ASTM standard F2554.☆15Jan 28, 2026Updated 2 months ago
- ☆19May 22, 2021Updated 4 years ago
- Code and Experiments for L4DC 2021 Paper: "Learning Visually Guided Latent Actions for Assistive Teleoperation"☆13May 4, 2021Updated 4 years ago
- Constrained episodic reinforcement learning in concave-convex and knapsack settings☆11Oct 3, 2023Updated 2 years ago
- PyTorch implementation of R2D2 (Recurrent Reply Distributed DQN)☆12Nov 14, 2019Updated 6 years ago
- 在MFC下开发一款基于Caffe的图像识别、分类、去重软件,软件已申请著作权保护,但允许个人研究免费使用,若涉及商业活动请联系我们:muyouhang@gmail.com☆14Dec 29, 2016Updated 9 years ago
- Deep reinforcement learning in autonomous driving☆12Aug 25, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A set of algorithms and environments to train SafeRL agents, written in TensorFlow2 and OpenAI Gym.☆12Jul 26, 2022Updated 3 years ago
- 学习强化学习过程中的笔记和代码☆12Jul 27, 2020Updated 5 years ago
- 点云配准入门知识☆10Nov 14, 2019Updated 6 years ago
- Code for ICRA 2018 paper - Interactive Robot Knowledge Patching using Augmented Reality☆14Aug 22, 2018Updated 7 years ago
- SLAM simulation in Unity 3D☆19May 9, 2016Updated 9 years ago
- ☆14Aug 26, 2018Updated 7 years ago
- Mixed Reality Hololens 2 application, Fracture Surgery Assistant, (Mixed Reality Lab, ETH A.Y. 2019/2020)☆11Apr 21, 2024Updated last year