An reconstruction of RL Introduction and its course materials for a more efficient entry
☆21Mar 4, 2026Updated 3 months ago
Alternatives and similar repositories for distil-rl-introduction
Users that are interested in distil-rl-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 3 years ago
- Robot Learning Algorithms☆26Aug 19, 2024Updated last year
- This announcement is used in the ATMHUFK's video. The original is from the another up,Which is called 原无奇变in Chinese.You can use it to av…☆10Jan 26, 2025Updated last year
- A lightweight cross-platform prompt manager for researchers to organize, reuse, and iterate high-quality prompts.☆47Apr 12, 2026Updated 2 months ago
- A platform for Applied Reinforcement Learning (Applied RL)☆14Jan 19, 2019Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Ossian generic framework☆12Aug 25, 2021Updated 4 years ago
- 华中科技大学计算机网络实验2019级☆12Oct 24, 2022Updated 3 years ago
- 《python并发编程》简体中文版克隆并完善的☆21Mar 7, 2023Updated 3 years ago
- Datawhale开源教程《人工智能的数学基础》☆343Feb 14, 2026Updated 4 months ago
- 本项目是一个围绕 DeepLearning.AI 出品的 Post-Training for LLMs 系列课程,为国内学习者量身打造的中文翻译与知识整理教程。项目提供课程内容翻译、知识点梳理和示例代码等内容,旨在降低语言门槛,让更多学生、研究人员和开发者系统掌握大语言模型…☆223Jan 4, 2026Updated 5 months ago
- MLLM @ Game☆17May 12, 2025Updated last year
- OmiAD: One-Step Adaptive Masked Diffusion Model for Multi-class Anomaly Detection via Adversarial Distillation(ICML 2025)☆19May 29, 2025Updated last year
- ☆29Sep 1, 2025Updated 9 months ago
- ☆19Oct 27, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MathorCup杯数学建模论文模板☆22Jan 14, 2020Updated 6 years ago
- Awesome_CV的中文版本,clone本项目到overleaf即可轻松愉快编写自己的CV☆17May 24, 2024Updated 2 years ago
- ☆13Jan 31, 2023Updated 3 years ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆51Sep 19, 2025Updated 8 months ago
- A Zola theme for hosting summary posts about academic papers☆24Apr 17, 2026Updated 2 months ago
- 从 NLP 到 LLM 的算法全栈教程,在线阅读地址:https://datawhalechina.github.io/base-llm/☆835Apr 22, 2026Updated last month
- [ICLR 2025 SynthData Workshop Spotlight] Empowering LLMs in Decision Games through Algorithmic Data Synthesis☆43Apr 27, 2025Updated last year
- SHIT journal Latex template 《SHIT》期刊latex模板☆55Mar 7, 2026Updated 3 months ago
- 强化学习贪吃蛇☆17Oct 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- typora免费版-源自于网络☆13Feb 12, 2025Updated last year
- CS 294-112 @ UCB Deep RL☆30Mar 24, 2023Updated 3 years ago
- A LaTeX beamer theme template for Jilin University students. 吉林大学beamer模板.☆18May 12, 2021Updated 5 years ago
- [CVPR 2026 Findings] Official Implementation for "CoPS: Conditional Prompt Synthesis for Zero-Shot Anomaly Detection"☆42Jun 2, 2026Updated 2 weeks ago
- The official repo for ”[WACV2025] Towards Accurate Unified Anomaly Segmentation“☆15Apr 14, 2025Updated last year
- A curated list of awesome frameworks, libraries, tools, environments, tutorials, research papers, and resources for reinforcement learnin…☆50May 11, 2026Updated last month
- 液体火箭推力矢量LQR控制算法☆33Apr 11, 2025Updated last year
- Ailanxier's note of Database Systems☆11Jan 18, 2022Updated 4 years ago
- This is official repository of Physics-AD☆21Feb 24, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Multi-View Monocular 3D (MVM3D) detection dataset based on RoboMaster University AI Challenge.☆25Sep 6, 2022Updated 3 years ago
- Repository for ‘Anomaly Detection and Generation with Diffusion Models: A Survey’.☆39Jun 15, 2025Updated last year
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆63Mar 22, 2025Updated last year
- AI Creative Platform☆26Jan 18, 2026Updated 5 months ago
- TLoL (Python Module) - League of Legends Deep Learning AI (Research and Development)☆21Dec 23, 2023Updated 2 years ago
- Home page for Microsoft Phi-Ground tech-report☆23Sep 8, 2025Updated 9 months ago
- ☆21Dec 8, 2024Updated last year