CV-xueba/A05_rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CV-xueba/A05_rl)

CV-xueba / A05_rl

本课程主要介绍强化学习的基础知识，其目标是帮助同学们快速、顺利地进入强化学习及其应用领域的研究工作。课程主要内容包含有限马尔可夫决策过程，动态规划，无模型预测与控制(SASA,Q-Learning)，价值函数逼近(DQN)，策略梯度方法(REINFORCE)，执行者/评论者方法（AC,TRPO,PPO)，连续动作空间的确定性策略(DDPG)。

☆18

Alternatives and similar repositories for A05_rl

Users that are interested in A05_rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DDzzxiaohongdou / GA_BUS_TIME
View on GitHub
利用遗传算法做基于客流需求的列车时刻表的优化
☆15Apr 25, 2021Updated 5 years ago
Shaocr / Train-Schedule
View on GitHub
画出列车运行图，给出列车运行的最佳调度
☆15Mar 9, 2020Updated 6 years ago
pfb2008999 / deep-learning-fault-diagnosis
View on GitHub
☆10Jul 13, 2019Updated 7 years ago
xxblxs / chatpdf-demo
View on GitHub
☆10Jun 13, 2023Updated 3 years ago
zhouie / markdown-doc
View on GitHub
Markdown 语法文档整理与修缮
☆13Jun 25, 2019Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
djbyrne / TD3
View on GitHub
Implementation of the TD3 algorithm written in Pytorch
☆12Dec 8, 2022Updated 3 years ago
SunHaoOne / RLsilde
View on GitHub
Some notes about reinforce learning, self-driving cars and leetcode
☆22Mar 26, 2022Updated 4 years ago
gaohua-lingzhongyu / TrainDiagramDataUseSystem
View on GitHub
这是高华的部分。列车运行图综合运用系统
☆21Dec 8, 2022Updated 3 years ago
alliens / -
View on GitHub
基于迁移学习的离心泵滚动轴承故障自动识别方法研究
☆20May 29, 2020Updated 6 years ago
diagnosisda / dxda
View on GitHub
☆22Jan 8, 2020Updated 6 years ago
Digi-Metal / Reinforce-learning-based-algorithm-for-dynamic-scheduling-problem-in-steelmaking-workshop
View on GitHub
基于强化学习的炼钢动态调度求解技术和软件实现
☆25Apr 26, 2020Updated 6 years ago
HighCWu / rwkv-paddle
View on GitHub
☆11Apr 16, 2023Updated 3 years ago
landian60 / OpenClaw-LiveAsset
View on GitHub
☆16Apr 8, 2026Updated 3 months ago
AreteQin / compressed-video-transport
View on GitHub
Transport video using ROS image_transport to eliminate latency.
☆20May 11, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Locietta / vscode-makefile-template
View on GitHub
A simple C++ Multi-file VSCode project template based on Makefile.
☆16Oct 26, 2021Updated 4 years ago
4AI / langml
View on GitHub
A Keras-based and TensorFlow-backend NLP Models Toolkit.
☆12Jul 7, 2022Updated 4 years ago
menajosep / tensorflow-doc2vec
View on GitHub
Reproduce the paper Distributed Representations of Sentences and Documents in tensorflow
☆14Apr 8, 2017Updated 9 years ago
chamwen / DaNN_DJP
View on GitHub
Domain Adaptive Neural Networks with DJP-MMD
☆20Sep 22, 2021Updated 4 years ago
liu115 / QuickSplat
View on GitHub
QuickSplat: Fast 3D Surface Reconstruction via Learned Gaussian Initialization
☆25Nov 11, 2025Updated 8 months ago
johncf / text2phones
View on GitHub
Attentional Neural Network that translates text to phones.
☆11Jan 25, 2018Updated 8 years ago
ictnlp / GMA
View on GitHub
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
☆11Mar 31, 2022Updated 4 years ago
jinxulin / chinese-text2vec
View on GitHub
中文文本的向量表示方法（Sentence-BERT, CoSENT）的PyTorch简单实现，可以用于文本相似度计算。
☆10Mar 27, 2022Updated 4 years ago
26hzhang / SequenceToSequence
View on GitHub
A seq2seq with attention dialogue/MT model implemented by TensorFlow.
☆11Jul 17, 2018Updated 8 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Fight-hawk / TextCNN-keras
View on GitHub
☆10May 6, 2020Updated 6 years ago
MTSchool / MT-paper-list-of-ACL
View on GitHub
ACL Paper Lists(machine translation)
☆13Mar 23, 2022Updated 4 years ago
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago
bhushangawde / Fault-Diagnosis-of-Roller-Bearings
View on GitHub
☆24Jul 30, 2022Updated 3 years ago
Steventian-wen / Robot-control
View on GitHub
控制算法
☆25May 21, 2025Updated last year
xgnd / IntelligentIthea
View on GitHub
智能型艾瑟雅机器人（IntelligentItheaBot）：一个终末三问（末日时在做什么？有没有空？可以来拯救吗？）抽卡游戏qq机器人
☆19Jul 11, 2022Updated 4 years ago
zang-langyan / Markov-Chain-Monte-Carlo-MCMC
View on GitHub
Markov Chain Monte Carlo MCMC methods are implemented in various languages (including R, Python, Julia, Matlab)
☆29Jun 20, 2023Updated 3 years ago
CuiShaohua / MultiTaskLearning
View on GitHub
multi task learning for multi-classification using keras
☆13Feb 10, 2020Updated 6 years ago
koking0 / MLLM-Algorithm-Application-Finetuning
View on GitHub
☆24Mar 26, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
clearlab-sustech / MujocoTutorial
View on GitHub
☆31Apr 5, 2025Updated last year
seanzhang-zhichen / simcse-pytorch
View on GitHub
SimCSE
☆15Oct 1, 2022Updated 3 years ago
XinJingHao / Sparrow-V1
View on GitHub
A Reinforcement Learning Friendly Simulator for Mobile Robot
☆30Apr 27, 2025Updated last year
dhruvramani / keras2pytorch
View on GitHub
Covert Keras models to Pytorch
☆12Dec 22, 2018Updated 7 years ago
PierreGtch / AMAL-project
View on GitHub
Comparaison of adversarial training algorithms (FreeLB, FreeAT and K-PGD) on natural language tasks
☆12Feb 14, 2020Updated 6 years ago
Hongyang-Du / awesome-3d-datasets
View on GitHub
[CVPRW'26] A collection and survey of 3d dataset
☆34Jun 4, 2026Updated last month
ictnlp / Dual-Path
View on GitHub
Code for ACL 2022 main conference paper "Modeling Dual Read/Write Paths for Simultaneous Machine Translation"
☆12Mar 31, 2022Updated 4 years ago