ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents
☆110Jun 24, 2025Updated 8 months ago
Alternatives and similar repositories for ShortcutsBench
Users that are interested in ShortcutsBench are comparing it to the libraries listed below
Sorting:
- Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction☆166Feb 9, 2026Updated 3 weeks ago
- iOS shortcut / Docker endpoint to access LLM models on iOS☆21Jan 13, 2024Updated 2 years ago
- ☆14Jun 3, 2025Updated 9 months ago
- A mobile GUI search engine using a vision-language model☆14May 5, 2025Updated 9 months ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- The implementation of RAGSynth: Synthetic Data for Robust and Faithful RAG Component Optimization☆21May 26, 2025Updated 9 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Nov 5, 2024Updated last year
- watch your screen while doing sales and fill your crm automatically☆17Jun 2, 2024Updated last year
- ☆48Jun 2, 2022Updated 3 years ago
- OpenAI Chat Completion right in your keyboard on iOS☆15Mar 7, 2023Updated 2 years ago
- combine source code files into single prompt to chat with your repository☆14May 15, 2024Updated last year
- LlamaTouch: A Faithful and Scalable Testbed for Mobile UI Task Automation☆67Aug 9, 2024Updated last year
- ☆17Oct 7, 2025Updated 4 months ago
- ☆22May 23, 2025Updated 9 months ago
- ☆25Jan 13, 2026Updated last month
- ☆11Updated this week
- Accessible Python client to debug and interact with screenpipe.☆25Jan 11, 2025Updated last year
- Johnny.Decimal folders normalization tool☆19Dec 20, 2024Updated last year
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 5 months ago
- Open-Source Signed Shortcut CLI tool for Linux+macOS☆29Jun 12, 2025Updated 8 months ago
- MoonDAO documentation, planning, project notes, and other reference material.☆11Feb 14, 2026Updated 2 weeks ago
- DELT: Data Efficacy for Language Model Training☆43Feb 12, 2026Updated 2 weeks ago
- PSDify: A PowerShell module for workspace management for Dify, featuring various cmdlets for managing Apps, Knowledges, Models, and Membe…☆21Feb 18, 2026Updated 2 weeks ago
- ☆24Nov 24, 2024Updated last year
- 新燕园人的私人班车助手(非官方)。☆69Jul 20, 2025Updated 7 months ago
- Optimizing compiler for SysY (C subset)☆44Apr 14, 2024Updated last year
- ☆24Nov 16, 2023Updated 2 years ago
- 🌴 Simple utility for managing parallel Claude Code instances☆44Jul 7, 2025Updated 7 months ago
- Config files for my GitHub profile.☆20Jan 27, 2026Updated last month
- ☆102Jan 17, 2024Updated 2 years ago
- Apache ECharts MCP Server☆63Updated this week
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated last year
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- ☆23Updated this week
- ☆11Aug 29, 2025Updated 6 months ago
- A community built, component and styles library for Obsidian hosted on Figma.☆10Feb 20, 2024Updated 2 years ago
- ☆28Dec 4, 2025Updated 2 months ago
- Obsidian Plugin for converting PDF files to Markdown☆19Apr 17, 2025Updated 10 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 2 weeks ago