MadeAgents / HammerLinks
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
☆84Updated 2 weeks ago
Alternatives and similar repositories for Hammer
Users that are interested in Hammer are comparing it to the libraries listed below
Sorting:
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆50Updated 7 months ago
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆44Updated 4 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆68Updated last month
- ☆95Updated 6 months ago
- Code implementation of synthetic continued pretraining☆114Updated 5 months ago
- ☆50Updated last year
- ☆103Updated 6 months ago
- ☆86Updated last month
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆159Updated 3 weeks ago
- [ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios☆58Updated last year
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆64Updated 3 weeks ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆135Updated last week
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆155Updated last week
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆47Updated 2 weeks ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆57Updated 8 months ago
- Reformatted Alignment☆113Updated 9 months ago
- ☆142Updated 11 months ago
- ☆47Updated 2 weeks ago
- Official code for the publication "Large Language Models as Zero-shot Dialogue State Tracker through Function Calling" https//arxiv.org/a…☆62Updated 10 months ago
- A Comprehensive Survey on Long Context Language Modeling☆152Updated 3 weeks ago
- ☆63Updated 7 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆144Updated 7 months ago
- Official repository for ACL 2025 paper "Model Extrapolation Expedites Alignment"☆73Updated last month
- On Memorization of Large Language Models in Logical Reasoning☆67Updated 2 months ago
- Collection of papers for scalable automated alignment.☆91Updated 8 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆116Updated 3 months ago
- ☆121Updated last year
- Critique-out-Loud Reward Models☆66Updated 8 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆53Updated last month
- ☆42Updated 8 months ago