ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents
☆110Jun 24, 2025Updated 9 months ago
Alternatives and similar repositories for ShortcutsBench
Users that are interested in ShortcutsBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction☆171Feb 9, 2026Updated last month
- This repository contains the official implementation of the paper "GL2GPU: Accelerating WebGL Applications via Dynamic API Translation to…☆40Jun 10, 2025Updated 9 months ago
- MobiSys#114☆23Aug 17, 2023Updated 2 years ago
- [IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation☆34Jun 20, 2023Updated 2 years ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆53Nov 5, 2024Updated last year
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆16Mar 4, 2025Updated last year
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Aug 4, 2022Updated 3 years ago
- Berkeley Function Calling Leaderboard (BFCL) with Chinese-Language Evaluation☆23Apr 6, 2025Updated 11 months ago
- ☆14Jun 3, 2025Updated 9 months ago
- ☆48Jun 2, 2022Updated 3 years ago
- iOS shortcut / Docker endpoint to access LLM models on iOS☆22Jan 13, 2024Updated 2 years ago
- ☆15Jul 13, 2021Updated 4 years ago
- 计算语言学22-23学年秋季学期 课程大作业baseline实现☆38Dec 8, 2022Updated 3 years ago
- ☆22May 23, 2025Updated 10 months ago
- ☆14Aug 3, 2024Updated last year
- ☆24Nov 16, 2023Updated 2 years ago
- DELT: Data Efficacy for Language Model Training☆45Feb 12, 2026Updated last month
- ☆212Jan 17, 2024Updated 2 years ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆28Mar 14, 2024Updated 2 years ago
- ☆10Mar 20, 2020Updated 6 years ago
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 2 months ago
- An Open Source Machine Learning Framework for Everyone☆16Aug 27, 2020Updated 5 years ago
- Real time high precision network monitor☆10Feb 24, 2019Updated 7 years ago
- ☆21Dec 5, 2022Updated 3 years ago
- [ICLR 2024] Towards Robust Multi-Modal Reasoning via Model Selection☆15Mar 7, 2024Updated 2 years ago
- Optimizing compiler for SysY (C subset)☆43Apr 14, 2024Updated last year
- Allow you walk when others run☆10Dec 6, 2019Updated 6 years ago
- The working repository of the IDSA Rulebook Working Group☆24Mar 4, 2026Updated 2 weeks ago
- 服务器 GPU 监控程序,当 GPU 属性满足预设条件时通过微信发送提示消息☆34Aug 10, 2021Updated 4 years ago
- The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'☆24May 20, 2025Updated 10 months ago
- ☆13Jun 7, 2022Updated 3 years ago
- ☆15Oct 8, 2024Updated last year
- ☆15May 7, 2024Updated last year
- ☆13Feb 5, 2022Updated 4 years ago
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- combine source code files into single prompt to chat with your repository☆14May 15, 2024Updated last year
- Bayesian Inverse Reinforcement Learning with simple environments