morphotherain / RLView external linksLinks
强化学习的数学原理代码练习
☆19Apr 17, 2024Updated last year
Alternatives and similar repositories for RL
Users that are interested in RL are comparing it to the libraries listed below
Sorting:
- Google《Introduction to Agents》中文翻译☆26Nov 14, 2025Updated 3 months ago
- ☆11Jan 14, 2025Updated last year
- An artificial bee colony implementation in Python☆11Oct 7, 2020Updated 5 years ago
- C++版本的sort算法,可无缝添加在检测器后进行实时多目标跟踪☆12Dec 1, 2022Updated 3 years ago
- Demo of using WASM to sandbox Plotly execution☆19Mar 30, 2025Updated 10 months ago
- h264的软解和硬解,基于FFmpeg和MPP☆11Mar 23, 2022Updated 3 years ago
- a Screeps World Bot☆14Nov 1, 2025Updated 3 months ago
- PAT题解(C/C++/JAVA)☆14Apr 3, 2020Updated 5 years ago
- ☆18Mar 9, 2023Updated 2 years ago
- ☆33Dec 5, 2025Updated 2 months ago
- ☆20Dec 24, 2024Updated last year
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Jul 25, 2025Updated 6 months ago
- 一大波学习onnx的案例☆25Sep 20, 2024Updated last year
- PET-SQL: A Prompt-enhanced Two-stage Text-to-SQL Framework with Cross-consistency☆19Mar 29, 2024Updated last year
- BIBench:数据分析领域LLM评测基准☆22Mar 2, 2024Updated last year
- 时间关键词正则提取以及标准化☆20Dec 19, 2021Updated 4 years ago
- 使用强化学习训练PPT的Agent☆58Oct 16, 2025Updated 3 months ago
- 多路rtsp硬解码☆28Jan 22, 2024Updated 2 years ago
- The graphics renderer library for the Screeps game☆28Feb 5, 2026Updated last week
- This is a repository to practice multi-thread programming in C++☆28Feb 21, 2024Updated last year
- Explore what LLMs are really leanring over SFT☆28Mar 30, 2024Updated last year
- 基于Alpha- Beta剪枝Max-Min博弈树的五子棋对战AI + 搜索优化(IDA*,A*,Zobrist,Ac自动机,贪心优化) + Qt-UI界面☆36Sep 14, 2023Updated 2 years ago
- ☆33Jul 14, 2021Updated 4 years ago
- [NeurIPS 2023] Large Language Models Are Semi-Parametric Reinforcement Learning Agents☆40May 2, 2024Updated last year
- Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)☆33Mar 16, 2020Updated 5 years ago
- code for paper Query-Dependent Prompt Evaluation and Optimization with Offline Inverse Reinforcement Learning☆43Mar 20, 2024Updated last year
- ☆798Jul 6, 2023Updated 2 years ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆40May 15, 2024Updated last year
- 今天Doro是什么结局?一个关于Doro结局的抽卡小游戏,基于HTML5+CSS+JS实现☆38Oct 11, 2025Updated 4 months ago
- 将原本Keras版本的AdvancedEAST改写成PyTorch版,将数据集由.npy文件改成一个LMDB文件,加入Precision,Recall, F1 score方便训练以及调试,底层网络仍然用VGG16。☆38Aug 22, 2020Updated 5 years ago
- "Semantic Evaluation for Text-to-SQL with Distilled Test Suite", EMNLP2020☆42Dec 1, 2020Updated 5 years ago
- Rime五笔☆43May 29, 2024Updated last year
- The blog, read report and code example for AGI/LLM related knowledge.☆56Feb 1, 2025Updated last year
- ☆58Jun 3, 2024Updated last year
- 适用于2代live2d的Ai vtuber 适用于本地或网页部署☆56May 16, 2024Updated last year
- a python tool for analyze can signal with ZLG can device☆42Jul 29, 2020Updated 5 years ago
- Heima data structure course OOP implementations.☆46Feb 6, 2021Updated 5 years ago
- DJL Spring Boot Starter Demo apps☆45Nov 14, 2024Updated last year
- ☆54Feb 21, 2022Updated 3 years ago