duowuyms / OpenCATP-LLMLinks
The official repository of ICCV 2025 paper "CATP-LLM: Empowering Large Language Models for Cost-Aware Tool Planning".
☆16Updated 2 months ago
Alternatives and similar repositories for OpenCATP-LLM
Users that are interested in OpenCATP-LLM are comparing it to the libraries listed below
Sorting:
- PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆13Updated 2 years ago
- [ASPLOS'25] Towards End-to-End Optimization of LLM-based Applications with Ayo☆45Updated 2 months ago
- ☆37Updated 2 years ago
- GPU-accelerated LLM Training Simulator☆36Updated 3 months ago
- ☆10Updated 2 years ago
- MetaOpt: Towards efficient heuristic design with quantifiable and confident performance☆20Updated last month
- Artifacts for our SIGCOMM'23 paper Ditto☆15Updated 2 years ago
- ☆20Updated 2 years ago
- ☆136Updated last year
- "How to Do Great Research" Course for Ph.D. Students☆130Updated 2 years ago
- Enabling High Quality Real-Time Communications with Adaptive Frame-Rate (USENIX NSDI 2023)☆21Updated last year
- Codebase for Teal (SIGCOMM 2023)☆53Updated last year
- ☆164Updated last year
- ☆41Updated last year
- NetLLM: Adapting Large Language Models for Networking (SIGCOMM 2024) - Official Repository☆166Updated 10 months ago
- [TBD] "m4: A Learned Flow-level Network Simulator" by Chenning Li, Anton A. Zabreyko, Om Chabra, Arash Nasr-Esfahany, Kevin Zhao, Pratees…☆15Updated this week
- ☆60Updated 10 months ago
- [NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank☆60Updated 11 months ago
- An evaluation framework for data center traffic engineering.☆12Updated last year
- ☆34Updated last year
- ☆17Updated last year
- THC: Accelerating Distributed Deep Learning Using Tensor Homomorphic Compression☆19Updated last year
- ☆23Updated last year
- Open-source implementation for "Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow"☆67Updated this week
- Data repository of NAssim☆27Updated 3 years ago
- ☆22Updated last year
- ☆68Updated 3 years ago
- [ACM SIGCOMM 2024] "m3: Accurate Flow-Level Performance Estimation using Machine Learning" by Chenning Li, Arash Nasr-Esfahany, Kevin Zha…☆24Updated last year
- Repository for Reinforcement learning based bandwidth estimation challenge☆36Updated last year
- ☆50Updated last month