duowuyms / OpenCATP-LLMLinks
The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".
☆16Updated 3 weeks ago
Alternatives and similar repositories for OpenCATP-LLM
Users that are interested in OpenCATP-LLM are comparing it to the libraries listed below
Sorting:
- [ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo☆51Updated 3 months ago
- ☆40Updated last year
- ☆10Updated 2 years ago
- Enabling High Quality Real-Time Communications with Adaptive Frame-Rate (USENIX NSDI 2023)☆21Updated last year
- MetaOpt: Towards efficient heuristic design with quantifiable and confident performance☆21Updated this week
- ☆20Updated 2 years ago
- ☆140Updated last year
- Implementation of paper "MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning" and a t…☆20Updated 11 months ago
- ☆101Updated last year
- ☆37Updated 2 years ago
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆73Updated last month
- [ACL'25] Code for ACL'25 paper "IRT-Router: Effective and Interpretable Multi-LLM Routing via Item Response Theory"☆19Updated 9 months ago
- AI model training on heterogeneous, geo-distributed resources☆21Updated this week
- ☆61Updated 11 months ago
- "How to Do Great Research" Course for Ph.D. Students☆131Updated last month
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆15Updated last week
- NetLLM: Adapting Large Language Models for Networking (SIGCOMM 2024) - Official Repository☆172Updated 11 months ago
- Official implementation of MASS: Multi-Agent Simulation Scaling for Portfolio Construction☆153Updated last week
- ☆23Updated last year
- Benchmark for evaluating LLMs in network configuration problems.☆32Updated 8 months ago
- Burstable Cloud Scheduler☆15Updated last year
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆64Updated last year
- ☆18Updated last year
- Systems for GenAI☆147Updated 7 months ago
- PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆14Updated 2 years ago
- ☆21Updated last year
- Survey on LLM Inference via Search (TMLR 2025)☆14Updated 6 months ago
- ☆42Updated last year
- Artifacts for our SIGCOMM'23 paper Ditto☆15Updated 2 years ago
- PoC for "SpecReason: Fast and Accurate Inference-Time Compute via Speculative Reasoning" [NeurIPS '25]☆59Updated last month