斯坦福 cs234 强化学习中文讲义
☆209Jan 2, 2021Updated 5 years ago
Alternatives and similar repositories for stanford-cs234-notes-zh
Users that are interested in stanford-cs234-notes-zh are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 我的强化学习笔记和学习材料 still updating ... ...☆372Sep 27, 2025Updated 8 months ago
- 中文整理的强化学习资料(Reinforcement Learning)☆2,170Apr 30, 2020Updated 6 years ago
- 斯坦福 CS224n 自然语言处理中文笔记☆349Sep 17, 2020Updated 5 years ago
- Gradient descent algorithms for LQG control☆14Feb 20, 2022Updated 4 years ago
- 这是一个学习强化学习基础原理的仓库,主要包括了《深入浅出强化学习原理入门》书中一些例子和课后作业的代码☆271Dec 4, 2018Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"☆12Nov 24, 2021Updated 4 years ago
- Deep Reinforcement Learning Lab, a platform designed to make DRL technology and fun for everyone☆2,561Apr 11, 2022Updated 4 years ago
- Intro to Reinforcement Learning (强化学习纲要)☆3,582Jul 25, 2020Updated 5 years ago
- ☆27Apr 22, 2024Updated 2 years ago
- Theory of Reinforcement Learning☆18Apr 20, 2021Updated 5 years ago
- ☆11Sep 17, 2020Updated 5 years ago
- ☆12May 14, 2024Updated 2 years ago
- This repository contains PyTorch implementations of deep reinforcement learning algorithms and environments for Robotics and Controls. T…☆19Mar 20, 2022Updated 4 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆673Apr 9, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Open MMLab Detection Toolbox with PyTorch☆12Jun 11, 2019Updated 6 years ago
- 强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/☆14,217Dec 30, 2025Updated 4 months ago
- Some basic examples of playing with RL☆1,273Feb 18, 2026Updated 3 months ago
- Modelling bus-on-demand using SUMO and TraCI.☆19Apr 30, 2014Updated 12 years ago
- Revisiting Peng's Q(lambda) for Modern Reinforcement Learning☆15Jul 23, 2021Updated 4 years ago
- 应用强化学习在复杂的交通环境下自动学习最佳驾驶策略的方案,在测试环境下准确率达到100%。☆22Feb 26, 2017Updated 9 years ago
- [译] UCB DS100 数据科学的原理与技巧☆117Jan 2, 2021Updated 5 years ago
- A longitudinal dataset for academic literature, including papers, metadata, and citation graphs, Also available on 🤗 HuggingFace and Kag…☆17Sep 6, 2025Updated 8 months ago
- AI项目(强化学习、深度学习、计算机视觉、推荐系统、自然语言处理、机器导航、医学影像处理)☆92Aug 8, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [译] 笨办法学 Linux 中文版☆16Dec 24, 2020Updated 5 years ago
- Python Library for Dynamic Movement Primitives with Reinforcement Learning☆14Jun 21, 2022Updated 3 years ago
- [译] ApacheCN 计算机系统译文集☆23Jul 7, 2022Updated 3 years ago
- [译] Think Python 中文第二版☆76Dec 24, 2020Updated 5 years ago
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆20Sep 19, 2019Updated 6 years ago
- ☆30Nov 15, 2023Updated 2 years ago
- A translation of Reinforcement Learning: An Introduction☆114Aug 20, 2018Updated 7 years ago
- 深度学习、强化学习、模仿学习与机器人☆479Oct 31, 2020Updated 5 years ago
- homework for CS234 2017☆150May 4, 2018Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 《Natural Language Processing with PyTorch》中文翻译☆718Jan 4, 2021Updated 5 years ago
- ArduSub Ground Control Station.☆10Mar 9, 2020Updated 6 years ago
- A Chinese learning note with python codes for Pattern Recognition and Machine Learning.☆30Aug 25, 2018Updated 7 years ago
- Low-Order modelling of Floating offshore wind Turbines/Farms for grid integration research☆20Aug 9, 2025Updated 9 months ago
- ☆12Sep 17, 2020Updated 5 years ago
- ☆16Feb 13, 2021Updated 5 years ago
- 本项目以一个可视化配置的、以AgentRL为核心的强化学习框架,实现30分钟上手AgentRL 编程。后续增加AgentRL和本地Agent、MCP、A2A相关特性。☆80Jul 9, 2025Updated 10 months ago