☆261Nov 7, 2025Updated 7 months ago
Alternatives and similar repositories for ToolSandbox
Users that are interested in ToolSandbox are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code and Data for Tau-Bench☆1,302Mar 18, 2026Updated 3 months ago
- Complex Function Calling Benchmark.☆181Jan 20, 2025Updated last year
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 9 months ago
- [ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆114Mar 21, 2024Updated 2 years ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆630Jun 2, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This is the repository for the Tool Learning survey.☆485Aug 9, 2025Updated 10 months ago
- ☆21Jul 25, 2025Updated 11 months ago
- Official repository of Graph RAG-Tool Fusion and ToolLinkOS dataset.☆24Feb 13, 2025Updated last year
- Nexusflow function call, tool use, and agent benchmarks.☆29Dec 13, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- [ICLR 2025 SCI-FM Workshop] Lemur: Log Parsing with Entropy Sampling and Chain-of-Thought Merging☆14Mar 27, 2025Updated last year
- ☆419Feb 13, 2024Updated 2 years ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated last year
- [ICLR 2025] Automated Design of Agentic Systems☆1,598Jan 28, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2☆146Apr 20, 2026Updated 2 months ago
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆30May 14, 2025Updated last year
- Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024☆27Jul 7, 2024Updated last year
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated last year
- ☆17Sep 1, 2024Updated last year
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆119Jun 13, 2025Updated last year
- ☆187Oct 29, 2025Updated 8 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆36Oct 3, 2024Updated last year
- Code for 'Contrastive Multi-Document Question Generation'☆11Oct 16, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Chat language model that can use tools and interpret the results☆1,596Dec 3, 2025Updated 6 months ago
- [ACL2025 Findings] Benchmarking Multihop Multimodal Internet Agents☆54Feb 27, 2025Updated last year
- RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best…☆62Mar 17, 2025Updated last year
- Federated Learning - PyTorch☆15Jun 27, 2021Updated 5 years ago
- ☆15Jul 1, 2020Updated 6 years ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,921Apr 13, 2026Updated 2 months ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- Companion code to https://arxiv.org/abs/2409.03797v2☆19Sep 18, 2025Updated 9 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Jun 3, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆120Mar 18, 2026Updated 3 months ago
- SKT A.X LLM 3.1☆13Jul 24, 2025Updated 11 months ago
- ☆16Mar 24, 2023Updated 3 years ago
- ☆27Feb 18, 2025Updated last year
- ☆646Jun 2, 2026Updated 3 weeks ago
- Adversarial Training and SFT for Bot Safety Models☆41Apr 18, 2023Updated 3 years ago
- Core ML Compiler is an iPadOS/iOS app to convert a .mlmodel file into a .mlmodelc file.☆17Jan 8, 2026Updated 5 months ago