本课程主要介绍强化学习的基础知识,其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程,动态规划,无模型预测与控制(SASA,Q-Learning),价值函数逼近(DQN),策略梯度方法(REINFORCE),执行者/评论者方法(AC,TRPO,PPO),连续动作空间的确定性策略(DDPG)。
☆18Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for A05_rl
Users that are interested in A05_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 利用遗传算法做基于客流需求的列车时刻表的优化☆15Apr 25, 2021Updated 4 years ago
- ☆10Jun 13, 2023Updated 2 years ago
- ☆10Jul 13, 2019Updated 6 years ago
- ☆41Oct 29, 2025Updated 4 months ago
- 画出列车运行图,给出列车运行的最佳调度☆14Mar 9, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Transport video using ROS image_transport to eliminate latency.☆20May 11, 2023Updated 2 years ago
- ☆46Mar 12, 2026Updated 2 weeks ago
- Some notes about reinforce learning, self-driving cars and leetcode☆21Mar 26, 2022Updated 4 years ago
- Implementation of the TD3 algorithm written in Pytorch☆12Dec 8, 2022Updated 3 years ago
- Markdown 语法文档 整理与修缮☆13Jun 25, 2019Updated 6 years ago
- ☆11Apr 16, 2023Updated 2 years ago
- A Benchmark and Evaluation Suite for Zero-shot Singing Voice Synthesis☆24Feb 11, 2026Updated last month
- 基于 MoveIt2 的手眼标定(Hand-Eye Calibration)软件☆43Aug 16, 2025Updated 7 months ago
- 基于 迁移学习的离心泵滚动轴承故障自动识别方法研究☆20May 29, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆22Jan 8, 2020Updated 6 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- 这是高华的部分。列车运行图综合运用系统☆20Dec 8, 2022Updated 3 years ago
- A simple C++ Multi-file VSCode project template based on Makefile.☆16Oct 26, 2021Updated 4 years ago
- Standardized compatibility layer for operating systems and peripheral devices written in C++.☆38Updated this week
- 基于强化学习的炼钢动态调度求解技术和软件实现☆23Apr 26, 2020Updated 5 years ago
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow☆14Apr 8, 2017Updated 8 years ago
- 电巢实训文件☆27Mar 28, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- Domain Adaptive Neural Networks with DJP-MMD☆20Sep 22, 2021Updated 4 years ago
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago
- A Reinforcement Learning Friendly Simulator for Mobile Robot☆25Apr 27, 2025Updated 11 months ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 4 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 4 months ago
- Markov Chain Monte Carlo MCMC methods are implemented in various languages (including R, Python, Julia, Matlab)☆30Jun 20, 2023Updated 2 years ago
- ☆10May 6, 2020Updated 5 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆24Jul 30, 2022Updated 3 years ago
- ☆23Mar 26, 2025Updated last year
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- 我维护的 uBlacklist 订阅列表☆21Feb 23, 2026Updated last month
- GIT useful commands☆12Feb 24, 2017Updated 9 years ago
- multi task learning for multi-classification using keras☆13Feb 10, 2020Updated 6 years ago
- 智能型艾瑟雅机器人(IntelligentItheaBot):一个终末三问(末日时在做什么?有没有空?可以来拯 救吗?)抽卡游戏qq机器人☆19Jul 11, 2022Updated 3 years ago