强化学习的数学原理代码练习
☆19Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a Screeps World Bot☆15Nov 1, 2025Updated 5 months ago
- The graphics renderer library for the Screeps game☆28Apr 4, 2026Updated last week
- Demo of using WASM to sandbox Plotly execution☆19Mar 30, 2025Updated last year
- 基于Alpha- Beta剪枝Max-Min博弈树的五子棋对战AI + 搜索优化(IDA*,A*,Zobrist,Ac自动机,贪心优化) + Qt-UI界面☆33Sep 14, 2023Updated 2 years ago
- 🎮 AI plays the game Balatro with CV & LLM combined. Powered by YOLO / RapidOCR (PaddleOCR) / LLM.☆31Mar 23, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- C++版本的sort算法,可无缝添加在检测器后进行实时多目标跟踪☆12Dec 1, 2022Updated 3 years ago
- 亚博智能 Jetson Orin NX 课程资料文档个人汉化☆17Nov 7, 2024Updated last year
- ☆12Jan 14, 2025Updated last year
- 今天Doro是什么结局?一个关于Doro结局的抽卡小游戏,基于HTML5+CSS+JS实现☆36Oct 11, 2025Updated 6 months ago
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 4 years ago
- Google《Introduction to Agents》中文翻译☆42Nov 14, 2025Updated 5 months ago
- Rime五笔☆47May 29, 2024Updated last year
- An artificial bee colony implementation in Python☆11Oct 7, 2020Updated 5 years ago
- a python tool for analyze can signal with ZLG can device☆42Jul 29, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆19Mar 9, 2023Updated 3 years ago
- ☆21Dec 24, 2024Updated last year
- Heima data structure course OOP implementations.☆47Feb 6, 2021Updated 5 years ago
- ☆841Jul 6, 2023Updated 2 years ago
- A lightweight image converter which supports PNG, PNM, BMP, QOI, JPEG-LS, and H.265 intra-frame.☆57Jun 7, 2025Updated 10 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 8 months ago
- ☆59May 30, 2024Updated last year
- PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency☆19Mar 29, 2024Updated 2 years ago
- BIBench:数据分析领域LLM评测基准☆23Mar 2, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 一大波学习onnx的案例☆25Sep 20, 2024Updated last year
- PAT题解(C/C++/JAVA)☆14Apr 3, 2020Updated 6 years ago
- ☆40Dec 5, 2025Updated 4 months ago
- 时间关键词正则提取以及标准化☆20Dec 19, 2021Updated 4 years ago
- Course Assignment Solutions for Motion Planning for Mobile Robots☆87Mar 14, 2022Updated 4 years ago
- This repository includes the source code associated with the paper "RACP: Risk-Aware Contingency Planning with Multi-Modal Predictions", …☆82May 29, 2024Updated last year
- The mirror of RL_Coding_Exercise.☆117Sep 4, 2024Updated last year
- ☆105Jan 21, 2025Updated last year
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- DistilBERT for Chinese 海量中文预训练蒸馏bert模型☆96Dec 5, 2019Updated 6 years ago
- 多路rtsp硬解码☆29Jan 22, 2024Updated 2 years ago
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated 2 years ago
- 激光雷达的结构以及PCB设计文件☆122Jan 25, 2022Updated 4 years ago
- 使用强化学习训练PPT的Agent☆68Oct 16, 2025Updated 6 months ago
- ☆33Jul 14, 2021Updated 4 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated last year