ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents
☆112Jun 24, 2025Updated 10 months ago
Alternatives and similar repositories for ShortcutsBench
Users that are interested in ShortcutsBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation☆36Jun 20, 2023Updated 2 years ago
- ☆62Updated this week
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 11 months ago
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆25Apr 6, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆14Jun 3, 2025Updated 11 months ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆35Dec 23, 2024Updated last year
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 11 months ago
- 西北工业大学本科毕业设计论文模版ctex版☆14Mar 12, 2022Updated 4 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- ☆15Jul 13, 2021Updated 4 years ago
- Automates the creation of JSON template files for Obsidian WebClipper.☆49Feb 19, 2025Updated last year
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 9 months ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆22May 23, 2025Updated 11 months ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- ☆25Nov 16, 2023Updated 2 years ago
- ☆28Sep 15, 2025Updated 7 months ago
- ☆213Jan 17, 2024Updated 2 years ago
- ☆14Oct 18, 2024Updated last year
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆22Mar 2, 2025Updated last year
- ☆13May 11, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆10Mar 20, 2020Updated 6 years ago
- ☆14Jul 23, 2017Updated 8 years ago
- An Open Source Machine Learning Framework for Everyone☆16Aug 27, 2020Updated 5 years ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 5 months ago
- Chain of Images for Intuitively Reasoning☆10Nov 29, 2023Updated 2 years ago
- [AAAI 2025] CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities☆53Jan 12, 2025Updated last year
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- Allow you walk when others run☆10Dec 6, 2019Updated 6 years ago
- ☆20Apr 16, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- watch your screen while doing sales and fill your crm automatically☆17Jun 2, 2024Updated last year
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- KafeDB: End-to-End Structurally-Encrypted Database System. Based on Apache Spark SQL.☆12Nov 11, 2021Updated 4 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 11 months ago
- 🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…☆409Feb 17, 2026Updated 2 months ago
- ☆12Nov 26, 2019Updated 6 years ago