MadeAgents / Hammer
Hammer: Robust Function-Calling for On-Device Language Models via Function Masking
☆31Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Hammer
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆59Updated last month
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆25Updated 5 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆35Updated last week
- ☆48Updated 8 months ago
- ☆51Updated 3 weeks ago
- ☆34Updated 2 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆62Updated 7 months ago
- ☆78Updated 6 months ago
- Fantastic Data Engineering for Large Language Models☆49Updated 3 months ago
- Official repository for paper "GTA: A Benchmark for General Tool Agents" (NeurIPS 2024 D&B Track)☆43Updated this week
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆38Updated 4 months ago
- 代码大模型 预训练&微调&DPO 数据处理 业界处理pipeline sota☆26Updated 3 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆46Updated last month
- Reformatted Alignment☆112Updated last month
- ☆37Updated 3 weeks ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆37Updated 4 months ago
- FuseAI Project☆76Updated 2 months ago
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆58Updated 7 months ago
- Code implementation of synthetic continued pretraining☆54Updated last month
- ☆37Updated 4 months ago
- CLongEval: A Chinese Benchmark for Evaluating Long-Context Large Language Models☆38Updated 8 months ago
- ☆56Updated 2 weeks ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆34Updated 10 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆20Updated 3 weeks ago
- code for paper 《RankingGPT: Empowering Large Language Models in Text Ranking with Progressive Enhancement》☆29Updated 10 months ago
- ☆129Updated 4 months ago
- ☆55Updated this week
- ☆29Updated this week
- The official implementation of "Ada-LEval: Evaluating long-context LLMs with length-adaptable benchmarks"☆50Updated 6 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆56Updated 8 months ago