HKAIR-Lab / HK-O1awLinks
☆42Updated last year
Alternatives and similar repositories for HK-O1aw
Users that are interested in HK-O1aw are comparing it to the libraries listed below
Sorting:
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated last year
- ☆161Updated 11 months ago
- Repo for for paper "AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction".☆71Updated last year
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆166Updated 9 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆130Updated 9 months ago
- ☆96Updated last year
- Code and Data for Our NeurIPS 2024 paper "AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback"☆33Updated last year
- ☆51Updated last year
- Code and data for QueryAgent(ACL 2024)☆20Updated last year
- Adapt an LLM model to a Mixture-of-Experts model using Parameter Efficient finetuning (LoRA), injecting the LoRAs in the FFN.☆72Updated 2 months ago
- ☆54Updated last year
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆44Updated last year
- Scaling Preference Data Curation via Human-AI Synergy☆133Updated 5 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆106Updated 6 months ago
- ☆147Updated last year
- ☆233Updated last year
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆39Updated 11 months ago
- IKEA: Reinforced Internal-External Knowledge Synergistic Reasoning for Efficient Adaptive Search Agent☆67Updated 7 months ago
- This is the reading list for the survey "A Survey on the Optimization of LLM-based Agents ". We will keep adding papers and improving the…☆178Updated 5 months ago
- ☆95Updated last year
- OpenRFT: Adapting Reasoning Foundation Model for Domain-specific Tasks with Reinforcement Fine-Tuning☆154Updated last year
- ☆175Updated 7 months ago
- ☆36Updated last year
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆232Updated 11 months ago
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆41Updated 7 months ago
- ☆58Updated last year
- The demo, code and data of FollowRAG☆75Updated 5 months ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆32Updated 7 months ago
- PGRAG☆51Updated last year
- The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization☆128Updated last year