duowuyms / OpenCATP-LLMLinks
The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".
☆18Updated 2 months ago
Alternatives and similar repositories for OpenCATP-LLM
Users that are interested in OpenCATP-LLM are comparing it to the libraries listed below
Sorting:
- [ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo☆63Updated 6 months ago
- PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆14Updated 2 years ago
- ☆11Updated 2 years ago
- ☆43Updated last year
- Artifacts for our SIGCOMM'23 paper Ditto☆15Updated 2 years ago
- AI model training on heterogeneous, geo-distributed resources☆34Updated 2 months ago
- Enabling High Quality Real-Time Communications with Adaptive Frame-Rate (USENIX NSDI 2023)☆23Updated 2 years ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆36Updated 5 months ago
- ☆21Updated 3 years ago
- Burstable Cloud Scheduler☆16Updated last year
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆69Updated last year
- ☆49Updated last year
- Implementation of paper "MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning" and a t…☆20Updated last year
- ☆64Updated last year
- ☆150Updated last year
- ☆22Updated 2 years ago
- ☆37Updated 2 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆77Updated 3 months ago
- ☆22Updated last year
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆46Updated 3 years ago
- GPU-accelerated LLM Training Simulator☆51Updated 7 months ago
- ☆174Updated last year
- A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems☆238Updated last week
- Primo: Practical Learning-Augmented Systems with Interpretable Models☆19Updated 2 years ago
- ☆17Updated last year
- "How to Do Great Research" Course for Ph.D. Students☆137Updated 3 months ago
- NetLLM: Adapting Large Language Models for Networking (SIGCOMM 2024) - Official Repository☆190Updated last year
- ☆102Updated 2 years ago
- ☆51Updated 9 months ago
- Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction☆164Updated 2 months ago