An reconstruction of RL Introduction and its course materials for a more efficient entry
☆21Mar 4, 2026Updated this week
Alternatives and similar repositories for distil-rl-introduction
Users that are interested in distil-rl-introduction are comparing it to the libraries listed below
Sorting:
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆23Jan 16, 2026Updated last month
- Robot Learning Algorithms☆26Aug 19, 2024Updated last year
- This announcement is used in the ATMHUFK's video. The original is from the another up,Which is called 原无奇变in Chinese.You can use it to av…☆10Jan 26, 2025Updated last year
- 强化学习贪吃蛇☆14Oct 19, 2023Updated 2 years ago
- Datawhale开源教程《人工智能的数学基础》☆288Feb 14, 2026Updated 3 weeks ago
- This is official repository of Physics-AD☆18Feb 24, 2026Updated last week
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆27Apr 27, 2025Updated 10 months ago
- A simple 1-d diffusion/flow model tutorial for LeCAR group meeting☆16Sep 27, 2025Updated 5 months ago
- AI 可以在 50 trun 内实现一个简单的高性能向量数据库吗?☆24Feb 27, 2026Updated last week
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆15May 24, 2024Updated last year
- 从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/☆340Updated this week
- A platform for Applied Reinforcement Learning (Applied RL)☆13Jan 19, 2019Updated 7 years ago
- MLLM @ Game☆16May 12, 2025Updated 9 months ago
- Linux操作系统学习笔记☆20Jan 11, 2024Updated 2 years ago
- dieshot PSD files☆22Mar 6, 2025Updated last year
- ☆286Nov 26, 2025Updated 3 months ago
- A LaTeX beamer theme template for Jilin University students. 吉林大学beamer模板.☆18May 12, 2021Updated 4 years ago
- ☆21Dec 8, 2024Updated last year
- 利用kNN算法实现图书推荐系统,前台使用的是微信小程序,后台使用的是Spring Boot+MyBatis,数据库使用的是MySQL+Redis☆17May 19, 2022Updated 3 years ago
- 这是2023华为软件精英挑战赛 初赛阶段319万分的代码,广西省第一名,粤港澳区排名第8。该比赛要求选手在一个50m*50m的地图上,控制4台机器人进入任务调度,设计机器人的运动算法、路径规划算法、任务调度算法,去分布在地图上的各种类型的工作台购买或者出售商品,赚取差价,以…☆17Sep 2, 2023Updated 2 years ago
- ☆34Jul 12, 2025Updated 7 months ago
- 液体火箭推力矢量LQR控制算法☆31Apr 11, 2025Updated 10 months ago
- 🌟 Datawhale 贡献者可视化平台,在线地址:https://mv.datawhale.cc/☆31Updated this week
- Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube☆18Mar 12, 2019Updated 6 years ago
- Universal, language-agnostic development standards for software projects. Includes coding standards, git workflows, testing guidelines, d…☆45Updated this week
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆23May 20, 2025Updated 9 months ago
- ☆25Jun 2, 2025Updated 9 months ago
- GenRec: Generative Recommender Systems with RQ-VAE semantic IDs, Transformer-based retrieval, and LLM integration. Built on PyTorch with …☆51Feb 24, 2026Updated last week
- CS109-23S course project example☆16May 30, 2023Updated 2 years ago
- An easy-use coroutine lib implement by C++ coroutine and liburing☆29Jun 6, 2025Updated 9 months ago
- Official implementation of TailedCore(CVPR25)☆25Jun 12, 2025Updated 8 months ago
- Flash-Linear-Attention models beyond language☆21Aug 28, 2025Updated 6 months ago
- ☆13Jan 31, 2023Updated 3 years ago
- Docker 教程:从零基础到掌握 Docker☆45Jan 2, 2026Updated 2 months ago
- MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering☆23Nov 28, 2025Updated 3 months ago
- ☆29Aug 16, 2025Updated 6 months ago
- ☆25Jan 19, 2026Updated last month
- Official implement of ICLR 2025 "One-for-All Few-Shot Anomaly Detection via Instance-Induced Prompt Learning"☆37May 8, 2025Updated 10 months ago