MadeAgents/Hammer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MadeAgents/Hammer)

MadeAgents / Hammer

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

☆120

Alternatives and similar repositories for Hammer

Users that are interested in Hammer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

swt-user / DMPO
View on GitHub
☆54Oct 10, 2024Updated last year
NJUDeepEngine / CAEF
View on GitHub
Code for paper: "Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines"
☆11Oct 11, 2024Updated last year
fairyshine / Seal-Tools
View on GitHub
The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…
☆57Nov 5, 2024Updated last year
hrwise-nlp / ToolsMeetLLMs
View on GitHub
☆33May 8, 2025Updated last year
SalesforceAIResearch / xLAM
View on GitHub
xLAM: A Family of Large Action Models to Empower AI Agent Systems
☆634Jun 2, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bespokelabsai / verifiers
View on GitHub
Verifiers for LLM Reinforcement Learning
☆81Jul 17, 2026Updated last week
hrwise-nlp / AppBench
View on GitHub
This is for EMNLP 2024 Paper: AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction
☆16Nov 4, 2024Updated last year
IBM / API-BLEND
View on GitHub
Companion code to https://arxiv.org/abs/2402.15491
☆22Sep 18, 2025Updated 10 months ago
kongjiellx / octupus-tool-call
View on GitHub
☆64May 4, 2025Updated last year
emrecanacikgoz / Tool-R0
View on GitHub
☆35Apr 3, 2026Updated 3 months ago
THUNLP-MT / StableToolBench
View on GitHub
A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
☆237Apr 15, 2025Updated last year
IBM / NESTFUL
View on GitHub
Companion code to https://arxiv.org/abs/2409.03797v2
☆19Sep 18, 2025Updated 10 months ago
ulab-uiuc / AgentProtocols
View on GitHub
Opensource code for ICML 2026 poster
☆15Nov 26, 2025Updated 7 months ago
open-compass / T-Eval
View on GitHub
[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
☆312Apr 3, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JoeYing1019 / UltraTool
View on GitHub
[ACL2024] Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios
☆71Aug 5, 2025Updated 11 months ago
xverse-ai / XVERSE-MoE-A36B
View on GitHub
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆37Sep 12, 2024Updated last year
USTC-StarTeam / ZIP
View on GitHub
arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.
☆28Jun 10, 2026Updated last month
zai-org / ComplexFuncBench
View on GitHub
Complex Function Calling Benchmark.
☆180Jan 20, 2025Updated last year
inclusionAI / AWorld-RL
View on GitHub
Agentic Learning Powered by AWorld
☆117Jun 18, 2026Updated last month
yuyq18 / StepTool
View on GitHub
☆36May 24, 2025Updated last year
Di-viner / LLM-Robustness-to-Irrelevant-Information
View on GitHub
[COLM'24] How Easily do Irrelevant Inputs Skew the Responses of Large Language Models?
☆23Oct 13, 2024Updated last year
Junjie-Ye / ToolSword
View on GitHub
[ACL 2024] ToolSword: Unveiling Safety Issues of Large Language Models in Tool Learning Across Three Stages
☆15Sep 12, 2024Updated last year
Reason-Wang / ToolGen
View on GitHub
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
☆183Mar 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhaoxlpku / SubgoalXL
View on GitHub
☆26Aug 23, 2024Updated last year
SIMONLQY / RethinkMCTS
View on GitHub
☆34Oct 2, 2024Updated last year
zwq2018 / Agent-Pro
View on GitHub
The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
☆129Sep 2, 2024Updated last year
qdrant / workshop-rag-optimization
View on GitHub
Notebooks for RAG optimization workshop, using HackerNews data
☆21Updated this week
WangHanLinHenry / STeCa
View on GitHub
(ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"
☆29Mar 2, 2026Updated 4 months ago
john-hewitt / implicit-ins
View on GitHub
Codebase for Instruction Following without Instruction Tuning
☆36Sep 24, 2024Updated last year
emrecanacikgoz / awesome-conversational-agents
View on GitHub
Awesome paper lists for "A Desideratum for Conversational Agents: Capabilities, Challenges, and Future Directions""
☆34Apr 25, 2025Updated last year
microsoft / ToolTalk
View on GitHub
Evaluating tool-augmented LLMs in conversation settings
☆89May 31, 2024Updated 2 years ago
apple / ToolSandbox
View on GitHub
☆269Nov 7, 2025Updated 8 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ferologics / Piwork
View on GitHub
Work with Pi
☆15Feb 9, 2026Updated 5 months ago
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Updated this week
Ber666 / ToolkenGPT
View on GitHub
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆271Apr 18, 2024Updated 2 years ago
layer6ai-labs / msc-sql
View on GitHub
Text-2-SQL
☆19Feb 21, 2025Updated last year
RUCKBReasoning / SoAy
View on GitHub
Codes for paper SoAy: A Service-oriented APIs Applying Framework of Large Language Models
☆27Jul 14, 2025Updated last year
HuanzhiMao / BFCL-Result
View on GitHub
Public Evaluation Result Archieve for BFCL
☆30Dec 17, 2025Updated 7 months ago
divelab / Sys2Bench
View on GitHub
Sys2Bench is a benchmarking suite designed to evaluate reasoning and planning capabilities of large language models across algorithmic, l…
☆31Mar 5, 2025Updated last year