TianshuoY / HKU-DASC7606-A1Links
☆25Updated last year
Alternatives and similar repositories for HKU-DASC7606-A1
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆55Updated 9 months ago
- ICLR 2025 Agent-Related Papers☆75Updated last year
- ☆25Updated 6 months ago
- ☆1,513Updated 2 weeks ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆64Updated last month
- ☆13Updated last year
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,201Updated last week
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,471Updated last week
- Awesome List for Agentic RL☆760Updated last month
- A Collection of Papers about Memory for Language Agents☆310Updated 2 weeks ago
- ☆492Updated 2 weeks ago
- ☆24Updated 5 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆418Updated last month
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,316Updated 2 months ago
- LLM 101: 一起入门大语言模型 课程网站☆14Updated last year
- Building a comprehensive and handy list of papers for GUI agents☆620Updated 3 months ago
- Latest Advances on System-2 Reasoning☆1,320Updated 7 months ago
- ☆490Updated 3 months ago
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆470Updated 2 months ago
- ☆25Updated last year
- ☆15Updated 10 months ago
- ☆17Updated last year
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆116Updated last year
- ☆25Updated 11 months ago
- ☆467Updated 6 months ago
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 9 months ago
- An example reproduction checklist for AAAI-26 submissions.☆103Updated 6 months ago
- Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)☆1,196Updated 2 months ago
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Updated 11 months ago