chenyitian/reinforcement-learning-an-introduction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chenyitian/reinforcement-learning-an-introduction)

chenyitian / reinforcement-learning-an-introduction

《Reinforcement Learning: An Introduction》（第二版）中文翻译

☆57

Alternatives and similar repositories for reinforcement-learning-an-introduction

Users that are interested in reinforcement-learning-an-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ethanluoyc / transporter-pytorch
View on GitHub
Transporter implementation in PyTorch
☆20Jul 24, 2019Updated 6 years ago
Jittor / TrafficSignDetection
View on GitHub
☆15Mar 6, 2021Updated 5 years ago
qiwihui / spinningup
View on GitHub
OpenAI团队的深度强化学习教程中文版
☆36May 16, 2020Updated 6 years ago
stanford-iprl-lab / Concept2Robot
View on GitHub
simulations used in "Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations"
☆28Jan 1, 2023Updated 3 years ago
zhangdddong / beautifulNLP
View on GitHub
美丽东自然语言处理百宝箱~命名实体识别，文本分类，语言模型，文本摘要。
☆10Nov 28, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
daohu527 / dig-into-simulator
View on GitHub
Simulator notes
☆12Jun 7, 2020Updated 6 years ago
SwordHG / LLM-PEFT-
View on GitHub
本项目旨在分享大模型相关技术原理以及实战经验。
☆12Sep 6, 2023Updated 2 years ago
Hongtao-Lin / NER
View on GitHub
Named entity recognition system using multi-stage CRF and statistical rules
☆11Oct 3, 2016Updated 9 years ago
skystrife / clickstream-hmm
View on GitHub
Code from "Modeling MOOC Student Behavior with Two-Layer Hidden Markov Models"
☆15Jun 2, 2018Updated 8 years ago
TemporalKGTeam / TANGO
View on GitHub
☆12Jul 8, 2022Updated 4 years ago
albertnadal / GJKCollisionDetection
View on GitHub
C++ implementation of the GJK algorithm for convex polygon collision detection.
☆11Aug 22, 2019Updated 6 years ago
core-power / 2021baidu-TOP27
View on GitHub
☆15Jun 8, 2021Updated 5 years ago
DeathWish5 / rCore_tutorial_tests
View on GitHub
rCore_tutorial_tests
☆11Aug 8, 2021Updated 4 years ago
MateuszKubuszok / GameTheoryTools
View on GitHub
Tools for calculating some Nash equilibria with thesis document
☆13Jul 24, 2014Updated 11 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
qiwihui / reinforcement-learning-an-introduction-chinese
View on GitHub
《Reinforcement Learning: An Introduction》（第二版）中文翻译
☆681Apr 9, 2022Updated 4 years ago
zawnpn / RL_RunFast
View on GitHub
一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm
☆13Jul 25, 2024Updated last year
Keycatowo / rough-set
View on GitHub
Rough Set Python Package is a Python library that provides a set of tools to calculate rough sets and obtain reduct rules.
☆15Mar 30, 2024Updated 2 years ago
luo-yuanfu / is-despot
View on GitHub
importance sampling for online planning under uncertainty
☆13Oct 27, 2019Updated 6 years ago
studentdeng / ActivityGame
View on GitHub
an example of how to use state machine to build a game
☆17Nov 6, 2014Updated 11 years ago
yxBeginner / RL-and-Robot
View on GitHub
Asynchronous Off-Policy Deep Reinforcement Learning For Wheeled Robot Path Planning
☆45Jun 14, 2019Updated 7 years ago
AI-FE / ai-friendly-clean-business-component-template
View on GitHub
AI友好的整洁业务组件架构模版
☆13Jan 26, 2025Updated last year
CircuitCoder / mill
View on GitHub
RV32I by cats
☆15Sep 4, 2023Updated 2 years ago
zhengsizuo / Deep-Learning-Note
View on GitHub
Some notes and code test about Deep Learning
☆15Jul 12, 2020Updated 5 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
sodalone / paper-reading-skill
View on GitHub
☆103Apr 20, 2026Updated 2 months ago
talebolano / example_of_reinforcement_lreaning_by_pytorch
View on GitHub
一些利用pytorch编程实现的强化学习例子
☆35Apr 7, 2019Updated 7 years ago
lyeemax / motion_planning_occ
View on GitHub
☆11Jul 21, 2020Updated 5 years ago
cseryp / stopwords
View on GitHub
1208 Chinese stopwords
☆14Feb 5, 2017Updated 9 years ago
MarshallEriksen-Neura / Deeting
View on GitHub
Listen. Remember. Evolve. 开源桌面端ai引擎
☆60Jun 27, 2026Updated last week
xhwdlgy5201 / SuperThermal-thermal-matching
View on GitHub
An unofficial PyTorch implementation of SuperThermal: Matching Thermal as Visible Through Thermal Feature Exploration
☆18Jun 9, 2023Updated 3 years ago
yotick / BGDC-TIP2022-pansharpening
View on GitHub
TIP2022-BGDC-A Unified Pansharpening Model Based on Band-Adaptive Gradient and Detail Correction
☆13Mar 7, 2025Updated last year
zoulala / CCKS_QA
View on GitHub
REF//biendata.com/competition/CCKS2018_3/make-submission/
☆17Aug 12, 2018Updated 7 years ago
FreedomIntelligence / Awesome-Rubrics
View on GitHub
A curated list of resources (surveys, papers, benchmarks, and opensource projects) on Rubrics
☆96Jul 2, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
FlorianPusse / OpenDS-CTS
View on GitHub
☆13Mar 26, 2019Updated 7 years ago
gogojjh / pointcloud_image_converter
View on GitHub
☆22Sep 25, 2024Updated last year
ytwboxing / cartpole_casadi_cplusplus
View on GitHub
使用casadi的C++接口写的shooting/collocation轨迹优化示例代码
☆60Nov 18, 2021Updated 4 years ago
ChenYong1993 / LRSDN
View on GitHub
☆11Oct 9, 2024Updated last year
iieir-km / KALE
View on GitHub
first commit
☆16Jun 10, 2020Updated 6 years ago
luisfelipewb / RL4WasteCapture
View on GitHub
A Deep Reinforcement Learning Strategy and Framework for Floating Waste Capture
☆13Mar 13, 2025Updated last year
kylewray / nova
View on GitHub
CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.
☆18Jun 18, 2021Updated 5 years ago