ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents
☆112Jun 24, 2025Updated 9 months ago
Alternatives and similar repositories for ShortcutsBench
Users that are interested in ShortcutsBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction☆172Feb 9, 2026Updated 2 months ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- [IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation☆35Jun 20, 2023Updated 2 years ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Nov 5, 2024Updated last year
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆17Mar 4, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 10 months ago
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- Automate the Boring Stuff with Apple Shortcuts☆109Mar 15, 2026Updated 3 weeks ago
- ☆49Jun 2, 2022Updated 3 years ago
- OVMR: Open-Vocabulary Recognition with Multi-Modal References (CVPR24)☆36Jun 16, 2025Updated 9 months ago
- 西北工业大学本科毕业设计论文模版ctex版☆14Mar 12, 2022Updated 4 years ago
- ☆102Jan 17, 2024Updated 2 years ago
- Implementation of the logging layer of our SOSP '23 paper Halfmoon☆11Jul 28, 2023Updated 2 years ago
- Automates the creation of JSON template files for Obsidian WebClipper.☆46Feb 19, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 6 months ago
- ☆11Nov 8, 2023Updated 2 years ago
- ☆22May 23, 2025Updated 10 months ago
- Latest: 7.0.0 - Lightweight and ready-to-use services to easily connect an IDS-Connector to different IDS-Infrastructure-Components.☆14Mar 4, 2024Updated 2 years ago
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- [COLING 2025] NesTools: A Dataset for Evaluating Nested Tool Learning Abilities of Large Language Models☆18Jan 18, 2025Updated last year
- ☆45Apr 11, 2024Updated 2 years ago
- ☆14Aug 3, 2024Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 新燕园人的私人班车助手(非官方)。☆71Jul 20, 2025Updated 8 months ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- ☆12Apr 26, 2023Updated 2 years ago
- A website for song obsessions☆15Mar 27, 2026Updated 2 weeks ago
- Automatic ReLU Reduction☆15Dec 20, 2023Updated 2 years ago
- ☆13Jan 22, 2025Updated last year
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆14Mar 7, 2024Updated 2 years ago
- Download Spotify Tracks, Albums, Playlists as MP3/OGG/Opus with High Quality.☆32Jan 13, 2026Updated 3 months ago
- The working repository of the IDSA Rulebook Working Group☆24Mar 30, 2026Updated 2 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 西北工业大学本科毕业设计论文模版 | Thesis Template for Northwestern Polytechnical University☆282May 17, 2023Updated 2 years ago
- watch your screen while doing sales and fill your crm automatically☆17Jun 2, 2024Updated last year
- Python implementation for Zone Routing Protocol for satellite network☆13Jan 4, 2024Updated 2 years ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- Official Pytorch implementation of 'Facing the Elephant in the Room: Visual Prompt Tuning or Full Finetuning'? (ICLR2024)☆13Mar 8, 2024Updated 2 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- Demo project for Talk @ Swift Heroes 2024☆10Sep 5, 2024Updated last year