《Reinforcement Learning: An Introduction》(第二版)中文翻译
☆54Jul 25, 2019Updated 6 years ago
Alternatives and similar repositories for reinforcement-learning-an-introduction
Users that are interested in reinforcement-learning-an-introduction are comparing it to the libraries listed below
Sorting:
- 强化学习资料☆23Sep 5, 2019Updated 6 years ago
- PyTorch implementation of "Learning from Students: Online Contrastive Distillation Network for General Continual Learning" (IJCAI 2022)☆11Dec 29, 2022Updated 3 years ago
- ECHO is a semi-supervised framework for classifying evolving data streams based on our previous approach SAND. The most expensive module …☆12Dec 25, 2017Updated 8 years ago
- Doccano annotation server together with a Spacy backend☆11Apr 5, 2023Updated 2 years ago
- PowerDEVS is an integrated tool for hybrid systems modeling and simulation based on the DEVS formalism.☆12Mar 20, 2021Updated 5 years ago
- simulations used in "Concept2Robot: Learning Manipulation Concepts from Instructions and Human Demonstrations"☆28Jan 1, 2023Updated 3 years ago
- ☆17Jan 1, 2019Updated 7 years ago
- Simulator notes☆12Jun 7, 2020Updated 5 years ago
- 本项目旨在分享大模型相关技术原理以及实战经验。☆12Sep 6, 2023Updated 2 years ago
- A Simple LateX Template☆14Dec 1, 2021Updated 4 years ago
- Named entity recognition system using multi-stage CRF and statistical rules☆12Oct 3, 2016Updated 9 years ago
- implement of ICDE 2019 paper: Robust High Dimensional Stream Classification with Novel Class Detection☆15Aug 15, 2020Updated 5 years ago
- PyTorch implementation of the ExStream method from our ICRA-2019 paper "Memory Efficient Experience Replay for Streaming Learning"☆22Nov 26, 2019Updated 6 years ago
- 《Reinforcement Learning: An Introduction》(第二版)中文翻译☆656Apr 9, 2022Updated 3 years ago
- 基于pdfium的pdf/ofd双引擎解析渲染引擎☆14Oct 15, 2024Updated last year
- 一款基于DQN算法的牌类游戏AI框架 / An AI framework for card games based on DQN algorithm☆13Jul 25, 2024Updated last year
- 基于golang go语言(beego框架)下的ONLYOFFICE Document Server二次开发。 主要功能为文档的上传、预览、覆盖、回调等功能。☆10Oct 20, 2023Updated 2 years ago
- Rough Set Python Package is a Python library that provides a set of tools to calculate rough sets and obtain reduct rules.☆15Mar 30, 2024Updated last year
- 自用LaTex模板,主要用来编写数学解答及书写论文☆10Jun 3, 2024Updated last year
- 国内外大厂系统架构案例,系统架构面试题与从零到一的实践☆18Updated this week
- importance sampling for online planning under uncertainty☆13Oct 27, 2019Updated 6 years ago
- An unofficial PyTorch implementation of SuperThermal: Matching Thermal as Visible Through Thermal Feature Exploration☆16Jun 9, 2023Updated 2 years ago
- Estimates fatigue loads in wind turbines from SCADA data based on supervised learning.☆10Sep 11, 2018Updated 7 years ago
- 在监控画质下实现对校园自行车的重识别,包含REID模型识别,向量数据库检索,UI展示☆11Feb 13, 2024Updated 2 years ago
- The code and data for the paper "Lost-in-the-Middle in Long-Text Generation: Synthetic Dataset, Evaluation Framework, and Mitigation"☆13Oct 8, 2025Updated 5 months ago
- Some notes and code test about Deep Learning☆15Jul 12, 2020Updated 5 years ago
- Simple ROS2 node for creating segmentation masks based on pixel colors☆10Dec 19, 2023Updated 2 years ago
- 一些利用pytorch编程实现的强化学习例子☆36Apr 7, 2019Updated 6 years ago
- 1208 Chinese stopwords☆14Feb 5, 2017Updated 9 years ago
- REF//biendata.com/competition/CCKS2018_3/make-submission/☆17Aug 12, 2018Updated 7 years ago
- Deep Q-Network (DQN) and DDPG to address the problem of stall around the wing sail of an autonomous sailing robot☆11Sep 18, 2018Updated 7 years ago
- collect robot technology books☆15Mar 21, 2023Updated 3 years ago
- 使用casadi的C++接口写的shooting/collocation轨迹优化示例代码☆59Nov 18, 2021Updated 4 years ago
- ☆11Oct 9, 2024Updated last year
- 开源扫雷网是专业玩家建设的扫雷排名网站。在这里,你可以上传扫雷录像参与全球排名;也希望有开发能力的雷友可以发挥专业能力,为网站贡献代码、增加功能。Open minesweeper website is a community-built ranking website fo…☆11Updated this week
- first commit☆16Jun 10, 2020Updated 5 years ago
- The source code for paper:Two-level Consistency Metric for Infrared and Visible Image Fusion☆11Jun 30, 2023Updated 2 years ago
- Empowering RAG with a versatile model-driven data interface for all-purpose applications!☆17Sep 10, 2024Updated last year
- A trajectory planning using ilqr☆56Dec 6, 2024Updated last year