《强化学习中的数学原 理》笔记-个人学习的思考和补充
☆94Apr 22, 2026Updated 2 weeks ago
Alternatives and similar repositories for Mathematical-Foundations-of-Reinforcement-Learning-Notes
Users that are interested in Mathematical-Foundations-of-Reinforcement-Learning-Notes are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).☆76Dec 26, 2025Updated 4 months ago
- A lightweight MCP server that encapsulates the Exa Pool API as a toolkit for AI assistants to call.☆59Mar 11, 2026Updated last month
- Markovian State and Action Abstractions for MDPs via Hierarchical MCTS within a POMDP Formulation☆11Jul 26, 2016Updated 9 years ago
- Scientific Programming IRL: An intro to scientific programming concepts by scientists, for scientists☆12May 9, 2023Updated 3 years ago
- 嘉立创EDA&EasyEDA插件,提供实用的本地工具☆17Apr 17, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- bilibil视频鸡语言.exe视频的源代码上传github喽,大家快来下载☆15Jan 25, 2024Updated 2 years ago
- pip install poocr☆17Apr 6, 2025Updated last year
- Debugging PyO3 with Visual Studio Code☆16Apr 5, 2025Updated last year
- 继续!继续!继续!☆88Apr 27, 2026Updated last week
- auto deploy neovim like chxuan/vimplus☆12Apr 22, 2025Updated last year
- Constrainted optimization algorithms in python including linear conjugate gradient, log barrier, primal-dual interior-point methods☆12Jul 7, 2019Updated 6 years ago
- A C++ iLQR library that allows you to solve iLQR optimization problem on any robot as long as you provide an URDF file describing the kin…☆14Aug 7, 2024Updated last year
- 用requests等库封装的东莞理工学院相关系统的爬虫脚本库☆13Apr 12, 2023Updated 3 years ago
- 🌟 My personal website, build width Nextjs☆11Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A mkdocs plugin to generage summary with the help of AI.☆11Dec 27, 2024Updated last year
- Mkdocs plugin which displays links in a more elegant way. Links will automatically be populated with an image, description, fav icon, and…☆12May 4, 2025Updated last year
- 校园网(锐捷)自动登录☆13Jun 8, 2019Updated 6 years ago
- 一款综合了 ivy-hugo-theme 和 cupper-hugo-theme 特点的简洁轻量化的响应式 Hugo 博客主题。☆13Sep 4, 2025Updated 8 months ago
- 基于蒙特卡洛树搜索算法实现多机器人区域覆盖路径规划,并将覆盖结果可视化☆13Jun 6, 2022Updated 3 years ago
- Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation" (ICLR2026)☆140Mar 3, 2026Updated 2 months ago
- ☆17Sep 2, 2017Updated 8 years ago
- Minimalist macOS OCR tool. Open-source, privacy-first, and built with SwiftUI.☆50Mar 2, 2026Updated 2 months ago
- 金智教育统一身份认证登录获取cookie☆16Apr 3, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- CUDA optimized code for solving MDPs, POMDPs, and Dec-POMDPs.☆18Jun 18, 2021Updated 4 years ago
- Build a VisionOS style web personal homepage that allows us to experience the visual experience of Apple Vision Pro on the front-end as w…☆11Sep 3, 2024Updated last year
- C++ implementation of iLQR☆13Mar 24, 2017Updated 9 years ago
- Generating minimum snap trajectories between waypoints for quadrotors☆18Sep 21, 2024Updated last year
- Analyse your YouTube watch history using Data Science, ML and NLP.☆14Jun 16, 2025Updated 10 months ago
- ☆13Oct 9, 2023Updated 2 years ago
- DIY Myo Gesture Control Arduino-based Controller. Muscle data (EMG, muscle electrodes) are read and processed by simple machine learning …☆17Jun 6, 2018Updated 7 years ago
- Project for 16-748 implementing CILQR for real-time vehicle trajectory planning.☆21Dec 14, 2018Updated 7 years ago
- Using recurrent neural networks with LSTM cells to predict stock prices. Takes into account twitter trends.☆17May 22, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Iterative LQR for a differential drive robot C++☆23Apr 21, 2019Updated 7 years ago
- 一款可以透明化的浏览网页应用☆23Sep 3, 2025Updated 8 months ago
- Implemenation of DQN for lane changes☆19Sep 6, 2018Updated 7 years ago
- Othello AI | Monte Carlo tree search☆16Sep 9, 2018Updated 7 years ago
- PasteMemo - Smart clipboard manager for macOS☆98Updated this week
- The Myo MATLAB data streaming interface relies on real time data read from the text files updated by Python. Python is used to communicat…☆20Dec 15, 2015Updated 10 years ago
- 📃 MkDocs with ▲ Vercel (minimal configuration)☆20Apr 23, 2025Updated last year