HKAIR-Lab / HK-O1aw
☆41Updated 5 months ago
Alternatives and similar repositories for HK-O1aw:
Users that are interested in HK-O1aw are comparing it to the libraries listed below
- ☆115Updated 2 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆64Updated 8 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆63Updated last month
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆50Updated this week
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆30Updated 4 months ago
- ☆44Updated 3 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆133Updated last week
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 6 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆75Updated 2 weeks ago
- ☆47Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆30Updated 10 months ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆59Updated last month
- ☆84Updated last month
- ☆88Updated last year
- Code and data for QueryAgent(ACL 2024)☆21Updated 3 months ago
- ☆47Updated last month
- ☆36Updated 6 months ago
- connecting humans and agents☆80Updated 3 months ago
- ☆51Updated 6 months ago
- ☆54Updated 5 months ago
- ☆54Updated 5 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 2 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆63Updated 2 months ago
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆88Updated this week
- ☆142Updated 9 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 9 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- ☆92Updated 3 months ago
- ☆125Updated 3 weeks ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆44Updated this week