An reconstruction of RL Introduction and its course materials for a more efficient entry
☆21Mar 4, 2026Updated 3 weeks ago
Alternatives and similar repositories for distil-rl-introduction
Users that are interested in distil-rl-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆23Jan 16, 2026Updated 2 months ago
- A platform for Applied Reinforcement Learning (Applied RL)☆13Jan 19, 2019Updated 7 years ago
- android compose catalog☆17Jul 4, 2025Updated 8 months ago
- 本项目是一个围绕 DeepLearning.AI 出品的 Post-Training for LLMs 系列课程,为国内学习者量身打造的中文翻译与知识整理教程。项目提供课程内容翻译、知识点梳理和示例代码等内容,旨在降低语言门槛,让更多学生、研究人员和开发者系统掌握大语言模型…☆171Jan 4, 2026Updated 2 months ago
- Datawhale开源教程《人工智能的数学基础》☆296Feb 14, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆32Apr 27, 2025Updated 11 months ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated last year
- dieshot PSD files☆22Mar 6, 2025Updated last year
- MLLM @ Game☆16May 12, 2025Updated 10 months ago
- OmiAD: One-Step Adaptive Masked Diffusion Model for Multi-class Anomaly Detection via Adversarial Distillation(ICML 2025)☆16May 29, 2025Updated 10 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆45Sep 19, 2025Updated 6 months ago
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆17May 24, 2024Updated last year
- [ArXiv 2025] Official Implementation for "CoPS: Conditional Prompt Synthesis for Zero-Shot Anomaly Detection"☆27Aug 11, 2025Updated 7 months ago
- 强化学习贪吃蛇☆15Oct 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- 从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/☆514Updated this week
- This is official repository of Physics-AD☆20Feb 24, 2026Updated last month
- [CoRL 2024] Official code for "Scaling Robot Policy Learning via Zero-Shot Labeling with Foundation Models"☆28Dec 11, 2024Updated last year
- The official repo for ”[WACV2025] Towards Accurate Unified Anomaly Segmentation“☆15Apr 14, 2025Updated 11 months ago
- Repository for ‘Anomaly Detection and Generation with Diffusion Models: A Survey’.☆35Jun 15, 2025Updated 9 months ago
- Multi-View Monocular 3D (MVM3D) detection dataset based on RoboMaster University AI Challenge.☆25Sep 6, 2022Updated 3 years ago
- Simulation for DJI Pupper v2 robot☆17Oct 17, 2023Updated 2 years ago
- TLoL (Python Module) - League of Legends Deep Learning AI (Research and Development)☆20Dec 23, 2023Updated 2 years ago
- Implementation of the DreamerV2 agent in torch☆20Sep 4, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 2024编译系统实现赛RISC-V赛道一等奖作品(A compiler of SysY (subset of C) )☆25Sep 4, 2024Updated last year
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- 中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌 词生成器、职业名称词库、同义词库、反义词库、否定词库、汽…☆13Oct 2, 2020Updated 5 years ago
- ☆33Jul 12, 2025Updated 8 months ago
- ☆288Nov 26, 2025Updated 4 months ago
- The old version of Hugo academic theme | 旧版Hugo学术主题☆14Aug 23, 2021Updated 4 years ago
- 🌟 Datawhale 贡献者可视化平台,在线地址:https://mv.datawhale.cc/☆34Updated this week
- 这是2023华为软件精英挑战赛 初赛阶段319万分的代码,广西省第一名,粤港澳区排名第8。该比赛要求选手在一个50m*50m的地图上,控制4台机器人进入任务调度,设计机器人的运动算法、路径规划算法、任务调度算法,去分布在地图上的各种类型的工作台购买或者出售商品,赚取差价,以…☆17Sep 2, 2023Updated 2 years ago
- Official Pytorch Codebase for Towards Online Domain Adaptive Object Detection [WACV 2023]☆44Mar 20, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube☆18Mar 12, 2019Updated 7 years ago
- Benchmarks of different devices I have come across☆40Aug 28, 2025Updated 7 months ago
- ☆26Apr 6, 2024Updated last year
- A Really Scalable RL Framework to 10k+ CPUs☆39Feb 29, 2024Updated 2 years ago
- Official implementation for: Consistency Models as a Rich and Efficient Policy Class for Reinforcement Learning ICLR'24☆26Aug 28, 2024Updated last year
- RLA is a tool for managing your RL experiments automatically☆32Jan 11, 2025Updated last year
- ☆114Mar 18, 2026Updated last week