强化学习的数学原理代码练习
☆19Apr 17, 2024Updated last year
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a Screeps World Bot☆14Nov 1, 2025Updated 4 months ago
- The graphics renderer library for the Screeps game☆28Mar 1, 2026Updated 3 weeks ago
- 🎮 AI plays the game Balatro with CV & LLM combined. Powered by YOLO / RapidOCR (PaddleOCR) / LLM.☆29Oct 24, 2025Updated 5 months ago
- Demo of using WASM to sandbox Plotly execution☆19Mar 30, 2025Updated 11 months ago
- 基于Alpha- Beta剪枝Max-Min博弈树的五子棋对战AI + 搜索优化(IDA*,A*,Zobrist,Ac自动机,贪心优化) + Qt-UI界面☆35Sep 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- C++版本的sort算法,可无缝添加在检测器后进行实时多目标跟踪☆12Dec 1, 2022Updated 3 years ago
- 亚博智能 Jetson Orin NX 课程资料文档个人汉化☆17Nov 7, 2024Updated last year
- Google《Introduction to Agents》中文翻译☆35Nov 14, 2025Updated 4 months ago
- ☆12Jan 14, 2025Updated last year
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 4 years ago
- 今天Doro是什么结局?一个关于Doro结局的抽卡小游戏,基于HTML5+CSS+JS实现☆37Oct 11, 2025Updated 5 months ago
- Rime五笔☆45May 29, 2024Updated last year
- An artificial bee colony implementation in Python☆11Oct 7, 2020Updated 5 years ago
- a python tool for analyze can signal with ZLG can device☆42Jul 29, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆19Mar 9, 2023Updated 3 years ago
- ☆21Dec 24, 2024Updated last year
- Heima data structure course OOP implementations.☆47Feb 6, 2021Updated 5 years ago
- ☆823Jul 6, 2023Updated 2 years ago
- A lightweight image converter which supports PNG, PNM, BMP, QOI, JPEG-LS, and H.265 intra-frame.☆57Jun 7, 2025Updated 9 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 8 months ago
- ☆59May 30, 2024Updated last year
- PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency☆19Mar 29, 2024Updated last year
- BIBench:数据分析领域LLM评测基准☆22Mar 2, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 一大波学习onnx的案例☆25Sep 20, 2024Updated last year
- PAT题解(C/C++/JAVA)☆14Apr 3, 2020Updated 5 years ago
- ☆38Dec 5, 2025Updated 3 months ago
- 时间关键词正则提取以及标准化☆20Dec 19, 2021Updated 4 years ago
- Course Assignment Solutions for Motion Planning for Mobile Robots☆87Mar 14, 2022Updated 4 years ago
- This repository includes the source code associated with the paper "RACP: Risk-Aware Contingency Planning with Multi-Modal Predictions", …☆82May 29, 2024Updated last year
- The mirror of RL_Coding_Exercise.☆118Sep 4, 2024Updated last year
- ☆105Jan 21, 2025Updated last year
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆95Dec 5, 2019Updated 6 years ago
- 多路rtsp硬解码☆28Jan 22, 2024Updated 2 years ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated last year
- 激光雷达的结构以及PCB设计文件☆120Jan 25, 2022Updated 4 years ago
- 使用强化学习训练PPT的Agent☆68Oct 16, 2025Updated 5 months ago
- ☆33Jul 14, 2021Updated 4 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated last year