CV-xueba / A05_rlView external linksLinks
本课程主要介绍强化学习的基础知识,其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程,动态规划,无模型预测与控制(SASA,Q-Learning),价值函数逼近(DQN),策略梯度方法(REINFORCE),执行者/评论者方法(AC,TRPO,PPO),连续动作空间的确定性策略(DDPG)。
☆18Oct 17, 2022Updated 3 years ago
Alternatives and similar repositories for A05_rl
Users that are interested in A05_rl are comparing it to the libraries listed below
Sorting:
- 利用遗传算法做基于客流需求的列车时刻表的优化☆15Apr 25, 2021Updated 4 years ago
- ☆43Dec 1, 2025Updated 2 months ago
- ☆10May 6, 2020Updated 5 years ago
- Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow☆14Apr 8, 2017Updated 8 years ago
- ☆34Oct 29, 2025Updated 3 months ago
- 中文文本的向量表示方法(Sentence-BERT, CoSENT)的PyTorch简单实现,可以用于文本相似度计算。☆10Mar 27, 2022Updated 3 years ago
- 画出列车运行图,给出列车运行的最佳调度☆14Mar 9, 2020Updated 5 years ago
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.☆11Jul 17, 2018Updated 7 years ago
- multi task learning for multi-classification using keras☆13Feb 10, 2020Updated 6 years ago
- ☆11Apr 16, 2023Updated 2 years ago
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- My some projects during i learning ml☆13Jul 31, 2020Updated 5 years ago
- Markdown 语法文档 整理与修缮☆13Jun 25, 2019Updated 6 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- Covert Keras models to Pytorch☆12Dec 22, 2018Updated 7 years ago
- ☆10Jul 13, 2019Updated 6 years ago
- Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"☆11Mar 31, 2022Updated 3 years ago
- Attentional Neural Network that translates text to phones.☆11Jan 25, 2018Updated 8 years ago
- GIT useful commands☆12Feb 24, 2017Updated 8 years ago
- Comparaison of adversarial training algorithms (FreeLB, FreeAT and K-PGD) on natural language tasks☆13Feb 14, 2020Updated 6 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 3 years ago
- Transport video using ROS image_transport to eliminate latency.☆20May 11, 2023Updated 2 years ago
- Implementation of the TD3 algorithm written in Pytorch☆12Dec 8, 2022Updated 3 years ago
- ☆14Jul 20, 2020Updated 5 years ago
- SimCSE☆15Oct 1, 2022Updated 3 years ago
- Official code for "Habibi: Laying the Open-Source Foundation of Unified-Dialectal Arabic Speech Synthesis"☆56Feb 3, 2026Updated last week
- bert-flat 简化版 添加了很多注释☆15Nov 25, 2021Updated 4 years ago
- ☆10Jun 13, 2023Updated 2 years ago
- Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"☆12Mar 31, 2022Updated 3 years ago
- A quick and dirty script to call LLaMA.cpp in Python. Supports streaming and interactive mode.☆13Apr 17, 2023Updated 2 years ago
- Learn Grafana 10.x, published by Packt Publishing☆20Feb 5, 2026Updated last week
- Next.js starter project☆17Jul 26, 2023Updated 2 years ago
- Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Unders…☆17Dec 11, 2018Updated 7 years ago
- A simple C++ Multi-file VSCode project template based on Makefile.☆16Oct 26, 2021Updated 4 years ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated last month
- This repository is a started repo for that uses NextJS, Supabase for Database & Paddle HQ for Payments.☆19Feb 28, 2023Updated 2 years ago
- Some notes about reinforce learning, self-driving cars and leetcode☆21Mar 26, 2022Updated 3 years ago
- 我维护的 uBlacklist 订阅列表☆20Dec 19, 2025Updated last month
- 智能型艾瑟雅机器人(IntelligentItheaBot):一个终末三问(末日时在做什么?有没有空?可以来拯救吗?)抽卡游戏qq机器人☆19Jul 11, 2022Updated 3 years ago