ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents
☆111Jun 24, 2025Updated 10 months ago
Alternatives and similar repositories for ShortcutsBench
Users that are interested in ShortcutsBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆73May 8, 2026Updated 2 weeks ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆56Nov 5, 2024Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 11 months ago
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆25Apr 6, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆14Jun 3, 2025Updated 11 months ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆70Aug 9, 2024Updated last year
- Automate the Boring Stuff with Apple Shortcuts☆121May 3, 2026Updated 2 weeks ago
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆35Dec 23, 2024Updated last year
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated last year
- iOS shortcut / Docker endpoint to access LLM models on iOS☆25Jan 13, 2024Updated 2 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- Automates the creation of JSON template files for Obsidian WebClipper.☆49Feb 19, 2025Updated last year
- 计算语言学22-23学年秋季学期 课程大作业baseline实现☆38Dec 8, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- OpenAI Chat Completion right in your keyboard on iOS☆15Mar 7, 2023Updated 3 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 8 months ago
- ☆11Nov 8, 2023Updated 2 years ago
- ☆22May 23, 2025Updated last year
- ☆24Dec 9, 2024Updated last year
- Latest: 7.0.0 - Lightweight and ready-to-use services to easily connect an IDS-Connector to different IDS-Infrastructure-Components.☆14Mar 4, 2024Updated 2 years ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- ☆14Aug 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆25Nov 16, 2023Updated 2 years ago
- ☆28Sep 15, 2025Updated 8 months ago
- ☆215Jan 17, 2024Updated 2 years ago
- Collectd exec plugin for monitoring the network bandwitdh usage☆13Aug 20, 2015Updated 10 years ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆23Mar 2, 2025Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 4 months ago
- Real time high precision network monitor☆10Feb 24, 2019Updated 7 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sync GitHub starred repos to a Raindrop.io collection☆89Feb 7, 2025Updated last year
- 西北工业大学本科毕业设计论文模版 | Thesis Template for Northwestern Polytechnical University☆290May 17, 2023Updated 3 years ago
- ☆21Apr 16, 2025Updated last year
- watch your screen while doing sales and fill your crm automatically☆17Jun 2, 2024Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆34Aug 10, 2021Updated 4 years ago
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated last year