TianshuoY / HKU-DASC7606-A1Links
☆26Updated 8 months ago
Alternatives and similar repositories for HKU-DASC7606-A1
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
Sorting:
- ☆16Updated 8 months ago
- ☆13Updated 6 months ago
- 《EasyOffer》(<大模型面经合集>)是针对LLM宝宝们量身打造的大模型暑期实习Offer指南,主要记录大模型暑期实习和秋招准备的一些常见大厂手撕代码、大厂面经经验、常见大厂思考题等;小白一个,正在学习ing......有问题各位大佬随时指正,希望大家都能拿到心仪Of…☆231Updated 2 months ago
- ICLR 2025 Agent-Related Papers☆71Updated 6 months ago
- ☆383Updated last month
- Awesome RL Reasoning Recipes ("Triple R")☆605Updated this week
- Latest Advances on Long Chain-of-Thought Reasoning☆358Updated last week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆531Updated last week
- Awesome RL-based LLM Reasoning☆511Updated last month
- ☆216Updated 2 weeks ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆283Updated 5 months ago
- AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)☆250Updated last month
- 关于LLM和Multimodal LLM的paper list☆40Updated last week
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆184Updated last month
- ☆540Updated 5 months ago
- ☆19Updated last week
- Latest Advances on System-2 Reasoning☆1,052Updated last month
- Survey on LLM Agents (Published on CoLing 2025)☆283Updated last month
- TTRL: Test-Time Reinforcement Learning☆592Updated 2 weeks ago
- 😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and Beyond☆237Updated this week
- Awesome Agent Training☆141Updated this week
- llm & rl☆139Updated this week
- minimal-cost for training 0.5B R1-Zero☆734Updated 3 weeks ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆391Updated last month
- llm相关内容,包括:基础知识、八股文、面经、经典论文☆130Updated last year
- Awesome-Long2short-on-LRMs is a collection of state-of-the-art, novel, exciting long2short methods on large reasoning models. It contains…☆221Updated this week
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆381Updated 4 months ago
- ☆76Updated 9 months ago
- Agentic Workflow - Daily Track on Arxiv.org Paper☆44Updated 3 months ago
- Paper list for Efficient Reasoning.☆467Updated last week