EachSheep / ShortcutsBenchLinks
ShortcutsBench: A Large-Scale Real-World Benchmark for API-Based Agents
☆104Updated 2 months ago
Alternatives and similar repositories for ShortcutsBench
Users that are interested in ShortcutsBench are comparing it to the libraries listed below
Sorting:
- Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction☆147Updated 3 months ago
- Survey Paper List - Efficient LLM and Foundation Models☆255Updated 11 months ago
- Simple extension on vLLM to help you speed up reasoning model without training.☆181Updated 3 months ago
- A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing☆41Updated 9 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆31Updated 6 months ago
- Reproducing R1 for Code with Reliable Rewards☆251Updated 4 months ago
- a curated list of high-quality papers on resource-efficient LLMs 🌱☆134Updated 5 months ago
- ☆80Updated 5 months ago
- ☆27Updated 6 months ago
- ☆33Updated 5 months ago
- PyTorch implementation of paper "Response Length Perception and Sequence Scheduling: An LLM-Empowered LLM Inference Pipeline".☆90Updated 2 years ago
- Paper list for Personal LLM Agents☆406Updated last year
- A Comprehensive Benchmark for Software Development.☆113Updated last year
- Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**☆201Updated 6 months ago
- ☆128Updated 2 weeks ago
- [ICLR 2025] SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration☆52Updated 6 months ago
- A Comprehensive Survey on Long Context Language Modeling☆180Updated last month
- Repo for EmbedLLM: Learning Compact Representations of Large Language Models☆16Updated 6 months ago
- This is the official repo of "QuickLLaMA: Query-aware Inference Acceleration for Large Language Models"☆54Updated last year
- ☆100Updated last year
- This repository serves as a comprehensive survey of LLM development, featuring numerous research papers along with their corresponding co…☆187Updated last month
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆190Updated this week
- awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.☆209Updated last week
- ☆84Updated 2 weeks ago
- [OSDI'24] Serving LLM-based Applications Efficiently with Semantic Variable☆181Updated 11 months ago
- This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.☆153Updated 2 weeks ago
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆167Updated 2 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆58Updated 10 months ago
- Multi-Candidate Speculative Decoding☆36Updated last year
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆84Updated 9 months ago