TianshuoY / HKU-DASC7606-A1Links
☆25Updated last year
Alternatives and similar repositories for HKU-DASC7606-A1
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- ☆13Updated last year
- ICLR 2025 Agent-Related Papers☆74Updated last year
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆52Updated 8 months ago
- ☆1,292Updated 3 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,025Updated 3 weeks ago
- ☆441Updated 2 months ago
- ☆25Updated last year
- A multi-agent LaTeX translation system that converts English LaTeX documents (e.g., arXiv papers) into PDFs in other languages with a sin…☆20Updated 2 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,272Updated this week
- A collection on the recent reproduction papers and projects on DeepSeek-R1☆32Updated 9 months ago
- The official code of ARPO & AEPO☆822Updated last month
- 本仓库是关于大模型面试中常见面试试题和面试经验的整理。这里收集了各类与大模型相关的面试题目,并提供详细的解答和分析。本仓库由上海交大交影社区维护☆110Updated last year
- Survey on LLM Agents (Published on CoLing 2025)☆442Updated 2 months ago
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆448Updated last month
- An example reproduction checklist for AAAI-26 submissions.☆105Updated 4 months ago
- A Collection of Papers about Memory for Language Agents☆188Updated 3 weeks ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆48Updated 3 months ago
- 李宏毅2022机器学习作业HW01-HW15完整解答☆26Updated last month
- Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)☆1,152Updated 3 weeks ago
- This is the repository for the Tool Learning survey.☆461Updated 4 months ago
- Awesome List for Agentic RL☆585Updated last week
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆94Updated 4 months ago
- ☆21Updated 9 months ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆477Updated 11 months ago
- ☆440Updated 4 months ago
- ☆466Updated 4 months ago
- Training VLM agents with multi-turn reinforcement learning☆342Updated 2 weeks ago
- Latest Advances on Long Chain-of-Thought Reasoning☆568Updated 4 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆665Updated 3 months ago