An reconstruction of RL Introduction and its course materials for a more efficient entry
☆23Mar 4, 2026Updated last month
Alternatives and similar repositories for distil-rl-introduction
Users that are interested in distil-rl-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆27Jan 16, 2026Updated 3 months ago
- ☆17Sep 17, 2023Updated 2 years ago
- Robot Learning Algorithms☆26Aug 19, 2024Updated last year
- This announcement is used in the ATMHUFK's video. The original is from the another up,Which is called 原无奇变in Chinese.You can use it to av…☆10Jan 26, 2025Updated last year
- A platform for Applied Reinforcement Learning (Applied RL)☆13Jan 19, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- android compose catalog☆17Jul 4, 2025Updated 9 months ago
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- 《python并发编程》简体中文版克隆并完善的☆21Mar 7, 2023Updated 3 years ago
- 本项目是一个围绕 DeepLearning.AI 出品的 Post-Training for LLMs 系列课程,为国内学习者量身打造的中文翻译与知识整理教程。项目提供课程内容翻译、知识点梳理和示例代码等内容,旨在降低语言门槛,让更多学生、研究人员和开发者系统掌握大语言模型…☆179Jan 4, 2026Updated 3 months ago
- Datawhale开源教程《人工智能的数学基础》☆308Feb 14, 2026Updated 2 months ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated last year
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- ☆27Sep 1, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Oct 27, 2025Updated 5 months ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆36Apr 27, 2025Updated 11 months ago
- 从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/☆652Apr 2, 2026Updated 2 weeks ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆46Sep 19, 2025Updated 7 months ago
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆17May 24, 2024Updated last year
- A Zola theme for hosting summary posts about academic papers☆20Oct 22, 2025Updated 5 months ago
- A curated list of awesome frameworks, libraries, tools, environments, tutorials, research papers, and resources for reinforcement learnin…☆41Mar 3, 2026Updated last month
- A collection of advanced tools for large-scale high-quality mesh data preparing☆34May 16, 2025Updated 11 months ago
- 强化学习贪吃蛇☆15Oct 19, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- typora免费版-源自于网络☆13Feb 12, 2025Updated last year
- A LaTeX beamer theme template for Jilin University students. 吉林大学beamer模板.☆18May 12, 2021Updated 4 years ago
- Projectwork of a mini-drone offboard application using PX4-ros2☆16Jan 25, 2024Updated 2 years ago
- [CoRL 2024] Official code for "Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models"☆29Dec 11, 2024Updated last year
- Ailanxier's note of Database Systems☆11Jan 18, 2022Updated 4 years ago
- The SAIL-VL2 series model developed by the BytedanceDouyinContent Group☆79Sep 18, 2025Updated 7 months ago
- Repository for ‘Anomaly Detection and Generation with Diffusion Models: A Survey’.☆37Jun 15, 2025Updated 10 months ago
- Multi-View Monocular 3D (MVM3D) detection dataset based on RoboMaster University AI Challenge.☆25Sep 6, 2022Updated 3 years ago
- Project Report Format in Typst☆23May 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆17Aug 5, 2025Updated 8 months ago
- A collection of themes for the unreal editor.☆46Dec 9, 2025Updated 4 months ago
- ☆27Jun 2, 2025Updated 10 months ago
- The old version of Hugo academic theme | 旧版Hugo学术主题☆14Aug 23, 2021Updated 4 years ago
- ☆291Nov 26, 2025Updated 4 months ago
- 这是2023华为软件精英挑战赛 初赛阶段319万分的代码,广西省第一名,粤港澳区排名第8。该比赛要求选手在一个50m*50m的地图上,控制4台机器人进入任务调度,设计机器人的运动算法、路径规划算法、任务调度算法,去分布在地图上的各种类型的工作台购买或者出售商品,赚取差价,以…☆17Sep 2, 2023Updated 2 years ago
- ☆37Apr 27, 2023Updated 2 years ago