HKAIR-Lab / HK-O1awLinks
☆41Updated 7 months ago
Alternatives and similar repositories for HK-O1aw
Users that are interested in HK-O1aw are comparing it to the libraries listed below
Sorting:
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆155Updated last week
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 9 months ago
- ☆145Updated 5 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 5 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆116Updated 3 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆68Updated last month
- The demo, code and data of FollowRAG☆73Updated 2 months ago
- ☆47Updated 2 weeks ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆58Updated last month
- ☆36Updated 9 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆53Updated last month
- ☆142Updated 11 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆67Updated 11 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆65Updated last month
- MPO: Boosting LLM Agents with Meta Plan Optimization☆58Updated 3 months ago
- An Awesome List of Reinforcement Learning-based Large Language Agent Works. Collect directly from official code base.☆154Updated this week
- ☆50Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆91Updated last year
- ☆103Updated 6 months ago
- Awesome Deep Research list☆104Updated last week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆100Updated 4 months ago
- ☆53Updated 9 months ago
- ☆56Updated 7 months ago
- ☆241Updated 2 weeks ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆97Updated last week
- This repo aims to record resource of role-playing abilities in LLMs, including dataset, paper, application, etc.☆123Updated 9 months ago
- ☆43Updated 8 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆84Updated 2 weeks ago