An reconstruction of RL Introduction and its course materials for a more efficient entry
☆22Mar 4, 2026Updated 2 months ago
Alternatives and similar repositories for distil-rl-introduction
Users that are interested in distil-rl-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 这是对基于大模型的多智能体系统论文的总结☆10Jun 23, 2024Updated last year
- 最基本最小白的自然语言处理入门读物,基于deepseek-r1,涵盖了传统NLP和现代大模型☆27Jan 16, 2026Updated 3 months ago
- ☆17Sep 17, 2023Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 2 years ago
- 臸娥粂陆亩竟☆10May 11, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Robot Learning Algorithms☆26Aug 19, 2024Updated last year
- This announcement is used in the ATMHUFK's video. The original is from the another up,Which is called 原无奇变in Chinese.You can use it to av…☆10Jan 26, 2025Updated last year
- A platform for Applied Reinforcement Learning (Applied RL)☆13Jan 19, 2019Updated 7 years ago
- android compose catalog☆17Jul 4, 2025Updated 10 months ago
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- SRL: Scaling Distributed Reinforcement Learning to Over Ten Thousand Cores☆15Apr 24, 2024Updated 2 years ago
- 本项目是一个围绕 DeepLearning.AI 出品的 Post-Training for LLMs 系列课程,为国内学习者量身打造的中文翻译与知识整理教程。项目提供课程内容翻译、知识点梳理和示例代码等内容,旨在降低语言门槛,让更多学生、研究人员和开发者系统掌握大语言模型…☆197Jan 4, 2026Updated 4 months ago
- Intrinsic Curiosity Module (ICM) + PPO on the Pyramid and PushBlock environment.☆12Sep 3, 2019Updated 6 years ago
- A simple 1-d diffusion/flow model tutorial for LeCAR group meeting☆16Sep 27, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MLLM @ Game☆16May 12, 2025Updated 11 months ago
- OmiAD: One-Step Adaptive Masked Diffusion Model for Multi-class Anomaly Detection via Adversarial Distillation(ICML 2025)☆18May 29, 2025Updated 11 months ago
- ☆19Oct 27, 2025Updated 6 months ago
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆17May 24, 2024Updated last year
- A collection of advanced tools for large-scale high-quality mesh data preparing☆34May 16, 2025Updated 11 months ago
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆42Apr 27, 2025Updated last year
- 强化学习贪吃蛇☆17Oct 19, 2023Updated 2 years ago
- typora免费版-源自于网络☆13Feb 12, 2025Updated last year
- The official repo for ”[WACV2025] Towards Accurate Unified Anomaly Segmentation“☆15Apr 14, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This is official repository of Physics-AD☆21Feb 24, 2026Updated 2 months ago
- The SAIL-VL2 series model developed by the BytedanceDouyinContent Group☆79Sep 18, 2025Updated 7 months ago
- Repository for ‘Anomaly Detection and Generation with Diffusion Models: A Survey’.☆38Jun 15, 2025Updated 10 months ago
- Simulation for DJI Pupper v2 robot☆17Oct 17, 2023Updated 2 years ago
- Project Report Format in Typst☆23May 22, 2023Updated 2 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- ☆291Apr 21, 2026Updated 2 weeks ago
- The old version of Hugo academic theme | 旧版Hugo学术主题☆15Aug 23, 2021Updated 4 years ago
- AMD YES!☆47Apr 30, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 这是2023华为软件精英挑战赛 初赛阶段319万分的代码,广西省第一名,粤港澳区排名第8。该比赛要求选手在一个50m*50m的地图上,控制4台机器人进入任务调度,设计机器人的运动算法、路径规划算法、任务调度算法,去分布在地图上的各种类型的工作台购买或者出售商品,赚取差价,以…☆17Sep 2, 2023Updated 2 years ago
- Official Pytorch Codebase for Towards Online Domain Adaptive Object Detection [WACV 2023]☆44Mar 20, 2023Updated 3 years ago
- ☆37Apr 27, 2023Updated 3 years ago
- A community-driven open-source meme culture music community☆68Apr 7, 2026Updated last month
- Implementation of an RL based agent, which utilizes Q-Learning to develop a policy for effectively solving a 3x3x3 rubiks cube☆18Mar 12, 2019Updated 7 years ago
- ☆37Jul 12, 2025Updated 9 months ago
- A Really Scalable RL Framework to 10k+ CPUs☆38Feb 29, 2024Updated 2 years ago