zhaoxlpku / DASC7606-A3
☆13Updated 3 months ago
Alternatives and similar repositories for DASC7606-A3:
Users that are interested in DASC7606-A3 are comparing it to the libraries listed below
- ☆26Updated 5 months ago
- NJUAI-Master-Courses☆18Updated last year
- 这是一个高效,快捷的arXiv论文爬虫,它可以将指定时间范围,指定主题,包含指定关键词的论文信息爬取到本地,并且将其中的标题和摘要翻译成中文。☆70Updated 5 months ago
- Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.☆298Updated 5 months ago
- AutoDL平台服务器适配梯子, 使用 Clash 作为代理工具☆208Updated 3 months ago
- Official code for ICML 2024 paper, "RIME: Robust Preference-based Reinforcement Learning with Noisy Preferences" (ICML 2024 Spotlight)☆28Updated 4 months ago
- 强化学习第二版习题解答与代码案例 Solutions and codes for Reinforcement Learning second edition☆139Updated 3 years ago
- Implement a MNIST(also minimal) version of denoising diffusion probabilistic model from scratch.The model only has 4.55MB.☆97Updated 2 years ago
- Deep Learning For Computer Vision Winter 2022 By Prof. Justin Johnson☆22Updated 2 years ago
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆541Updated this week
- 复旦大学体育场馆自动预约 FDU Sports Auto Reserve☆70Updated last year
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆72Updated 5 months ago
- ☆15Updated 10 months ago
- Official code for article "LLMLight: Large Language Models as Traffic Signal Control Agents".☆179Updated last week
- Download papers and supplemental materials from open-access paper website, such as AAAI, AAMAS, AISTATS, COLT, CORL, CVPR, ECCV, ICCV, IC…☆246Updated this week
- code for DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation☆19Updated 3 months ago
- ☆323Updated 2 weeks ago
- ☆90Updated 2 years ago
- 必要的计算机科学及软件开发知识☆25Updated 3 weeks ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,661Updated last month
- Large Reasoning Models☆801Updated 2 months ago
- classification and solutions for PKU-CSSummerCamp-OnlineJudge☆14Updated last year
- LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmi…☆107Updated last month
- ☆476Updated last month
- modern AI for beginners☆102Updated 3 weeks ago
- Shanghai Jiao Tong University 2023-2024, CS3601 Operating System☆20Updated last year
- ☆20Updated 2 years ago