TianshuoY / HKU-DASC7606-A1Links
☆25Updated last year
Alternatives and similar repositories for HKU-DASC7606-A1
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- ICLR 2025 Agent-Related Papers☆75Updated last year
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆55Updated 9 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,215Updated last week
- ☆25Updated 6 months ago
- ☆1,541Updated 3 weeks ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,501Updated last week
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆510Updated last month
- ☆13Updated last year
- ☆24Updated 5 months ago
- A collection of recent papers on building autonomous agent. Two topics included: RL-based / LLM-based agents.☆739Updated last year
- ☆467Updated 6 months ago
- Survey on LLM Agents (Published on CoLing 2025)☆470Updated 4 months ago
- Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)☆1,203Updated 2 months ago
- 中山大学知识工程实验室介绍。☆40Updated 5 months ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,316Updated 3 months ago
- ☆25Updated last year
- The official code of ARPO & AEPO☆880Updated last week
- Awesome List for Agentic RL☆787Updated 2 months ago
- ☆493Updated 4 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆424Updated last month
- ☆456Updated last year
- ☆25Updated 11 months ago
- An example reproduction checklist for AAAI-26 submissions.☆103Updated 6 months ago
- The paper list of "Memory in the Age of AI Agents: A Survey"☆1,179Updated this week
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆712Updated 4 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆172Updated last month
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆470Updated 3 months ago
- Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个☆1,221Updated last year
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 9 months ago