TianshuoY / HKU-DASC7606-A1Links
☆25Updated last year
Alternatives and similar repositories for HKU-DASC7606-A1
Users that are interested in HKU-DASC7606-A1 are comparing it to the libraries listed below
Sorting:
- ☆15Updated last year
- ☆24Updated 5 months ago
- An example reproduction checklist for AAAI-26 submissions.☆103Updated 6 months ago
- ☆13Updated last year
- ☆1,482Updated last week
- 这是一个open-r1的复现项目,对0.5B、1.5B、3B、7B的qwen模型进行GRPO训练,观察到一些有趣的现象。☆54Updated 9 months ago
- ICLR 2025 Agent-Related Papers☆75Updated last year
- ☆25Updated 6 months ago
- Latest Advances on System-2 Reasoning☆1,320Updated 7 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆1,471Updated this week
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,201Updated this week
- 中山大学知识工程实验室介绍。☆39Updated 5 months ago
- Official repository for "CODI: Compressing Chain-of-Thought into Continuous Space via Self-Distillation"☆64Updated last month
- Survey on LLM Agents (Published on CoLing 2025)☆467Updated 3 months ago
- Building a comprehensive and handy list of papers for GUI agents☆620Updated 3 months ago
- ☆412Updated 11 months ago
- Latest Advances on Long Chain-of-Thought Reasoning☆605Updated 6 months ago
- ☆25Updated 10 months ago
- ☆455Updated 11 months ago
- Awesome List for Agentic RL☆738Updated last month
- Daily updated LLM papers. 每日更新 LLM 相关的论文,欢迎订阅 👏 喜欢的话动动你的小手 🌟 一个☆1,219Updated last year
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,291Updated 2 months ago
- ☆25Updated last year
- [ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"☆468Updated 2 months ago
- MAD: The first work to explore Multi-Agent Debate with Large Language Models :D☆508Updated last month
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,314Updated 8 months ago
- Analyze top AI conference papers to discover research hotspots and trends using topic modeling.☆127Updated 3 weeks ago
- The official code of ARPO & AEPO☆872Updated 3 weeks ago
- Large Language Model based Multi-Agents: A Survey of Progress and Challenges (In IJCAI 2024)☆1,196Updated 2 months ago
- 🔥🔥 🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆418Updated last month