HKAIR-Lab / HK-O1awLinks
☆43Updated 8 months ago
Alternatives and similar repositories for HK-O1aw
Users that are interested in HK-O1aw are comparing it to the libraries listed below
Sorting:
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆62Updated 9 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆67Updated 11 months ago
- ☆91Updated last year
- ☆147Updated 5 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆87Updated last month
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆79Updated last month
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆119Updated last month
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆60Updated 2 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆150Updated 3 months ago
- ☆94Updated 7 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 4 months ago
- ☆36Updated 10 months ago
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆33Updated 8 months ago
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆73Updated 5 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆201Updated this week
- ☆69Updated 10 months ago
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆45Updated 5 months ago
- connecting humans and agents☆86Updated 7 months ago
- ☆53Updated 10 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆123Updated last week
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆115Updated 10 months ago
- ☆155Updated 2 months ago
- ☆144Updated last year
- Scaling Preference Data Curation via Human-AI Synergy☆80Updated 2 weeks ago
- ☆50Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆36Updated 2 months ago
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆103Updated last month
- The demo, code and data of FollowRAG☆73Updated 2 weeks ago
- ☆47Updated last month
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆82Updated last week