《强化学习中的数学原理》笔记-个人 学习的思考和补充
☆96Jun 11, 2026Updated last week
Alternatives and similar repositories for Mathematical-Foundations-of-Reinforcement-Learning-Notes
Users that are interested in Mathematical-Foundations-of-Reinforcement-Learning-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python notebook implementation of hyperparameter tuning of LSTM deep learning model using Genetic algorithm☆23Aug 18, 2021Updated 4 years ago
- Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).☆82Dec 26, 2025Updated 5 months ago
- [CVPR 2025] Lifelong Knowledge Editing for Vision Language Models with Low-Rank Mixture-of-Experts☆23Jun 22, 2025Updated 11 months ago
- A lightweight MCP server that encapsulates the Exa Pool API as a toolkit for AI assistants to call.☆59Mar 11, 2026Updated 3 months ago
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Scientific Programming IRL: An intro to scientific programming concepts by scientists, for scientists☆12May 9, 2023Updated 3 years ago
- 嘉立创EDA&EasyEDA插件,提供实用的本地工具☆17Apr 17, 2026Updated 2 months ago
- Unlocks articles on theinitium.com.☆11Jul 29, 2023Updated 2 years ago
- Program developed for fly bebop drone, it integrate de Myo armband and Bebop on python.☆12May 17, 2026Updated last month
- pip install poocr☆17Apr 6, 2025Updated last year
- 继续!继续!继续!☆90Apr 27, 2026Updated last month
- Debugging PyO3 with Visual Studio Code☆16Apr 5, 2025Updated last year
- MinRL provides clean, minimal implementations of fundamental reinforcement learning algorithms in a customizable GridWorld environment. T…☆124May 15, 2025Updated last year
- auto deploy neovim like chxuan/vimplus☆12Apr 22, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Constrainted optimization algorithms in python including linear conjugate gradient, log barrier, primal-dual interior-point methods☆12Jul 7, 2019Updated 6 years ago
- Create documentation with social cards☆18May 28, 2024Updated 2 years ago
- A C++ iLQR library that allows you to solve iLQR optimization problem on any robot as long as you provide an URDF file describing the kin…☆15Aug 7, 2024Updated last year
- 用requests等库封装的东莞理工学院相关系统的爬虫脚本库☆14Apr 12, 2023Updated 3 years ago
- A mkdocs plugin to generage summary with the help of AI.☆10Dec 27, 2024Updated last year
- Variational Autoencoder (VAE) PyTorch Tutorial from Scratch☆10Nov 22, 2023Updated 2 years ago
- Mono repository automation toolkit☆29Jun 9, 2026Updated last week
- Mkdocs plugin which displays links in a more elegant way. Links will automatically be populated with an image, description, fav icon, and…☆11May 4, 2025Updated last year
- 个人网站☆11Jun 6, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 一款综合了 ivy-hugo-theme 和 cupper-hugo-theme 特点的简洁轻量化的响应式 Hugo 博客主题。☆12Sep 4, 2025Updated 9 months ago
- A MkDocs plugin that uses heti to improve typesetting☆12Sep 30, 2023Updated 2 years ago
- 基于蒙特卡洛树搜索算法实现多机器人区域覆盖路径规划,并将覆盖结果可视化☆13Jun 6, 2022Updated 4 years ago
- ☆16May 9, 2020Updated 6 years ago
- ☆17Sep 2, 2017Updated 8 years ago
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation" (ICLR2026)☆149Mar 3, 2026Updated 3 months ago
- A simple implentment of Maximum Classifier Discrepancy for Unsupervised Domain Adaptation with pytorch☆14Jun 1, 2020Updated 6 years ago
- Minimalist macOS OCR tool. Open-source, privacy-first, and built with SwiftUI.☆48Mar 2, 2026Updated 3 months ago
- 破解CAJViewer带有效期的文档,支持破解科学文库、标准全文数据库下载的文档。无损破解,保留文字和目录,解除有效期限制。☆14Jan 2, 2022Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Jun 18, 2021Updated 5 years ago
- ⚡️😎Mac's Ultimate Clipboard: Built for Speed, Born Native. 为 Mac 打造的高性能原生剪贴板。☆138Updated this week
- Code for "SUGAR: Subgraph Neural Network with Reinforcement Pooling and Self-Supervised Mutual Information Mechanism"☆62May 7, 2021Updated 5 years ago
- Generating minimum snap trajectories between waypoints for quadrotors☆18Sep 21, 2024Updated last year
- Official implementation of "ResAD: Normalized Residual Trajectory Modeling for End-to-End Autonomous Driving"☆69Oct 9, 2025Updated 8 months ago
- Analyse your YouTube watch history using Data Science, ML and NLP.☆14Jun 16, 2025Updated last year
- DB-BERT tunes database systems for optimal performance, using tuning hints mined from text.☆62Aug 19, 2023Updated 2 years ago