☆15Aug 15, 2024Updated last year
Alternatives and similar repositories for cocktail
Users that are interested in cocktail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 4 months ago
- ☆21May 13, 2022Updated 3 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- Notes for CMU Deep Learning Systems Course (2022 online public run)☆16Jan 31, 2023Updated 3 years ago
- [NeurIPS 2022] Code for paper "Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation"☆27Dec 10, 2023Updated 2 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆26Jun 7, 2023Updated 2 years ago
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Secure Inference Resilient Against Malicious Clients☆15May 3, 2022Updated 3 years ago
- ☆10Jun 18, 2024Updated last year
- Prefix-Aware Attention for LLM Decoding☆33Jan 23, 2026Updated 2 months ago
- 一个简单的 C++ Linux 控制台(西北大学操作系统作业)☆11Jun 4, 2021Updated 4 years ago
- ☆10Mar 8, 2025Updated last year
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 4 years ago
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"☆18Sep 19, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆34Jun 12, 2025Updated 9 months ago
- ☆16Oct 3, 2023Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- Belief-state planning for POMDPs using learned approximations☆23Jan 21, 2025Updated last year
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- Source code for research papers about the semantic communication approach SINFONY☆24Feb 17, 2026Updated last month
- 分布式系统期末大作业:模拟一个简单的分布式文件系统☆12Jan 6, 2019Updated 7 years ago
- Athena: A Framework for Defending Machine Learning Systems Against Adversarial Attacks☆44Sep 23, 2021Updated 4 years ago
- Some simple tutorials about python☆12Oct 11, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 一个操作系统的实现-课程设计报告☆14Aug 28, 2016Updated 9 years ago
- ☆12Apr 9, 2025Updated 11 months ago
- ☆15Jul 25, 2023Updated 2 years ago
- Edge Video Services (EVS) is a Microsoft platform for developing video analytics solutions that can be deployed across the edge and the c…☆30Jul 5, 2022Updated 3 years ago
- Pose refinement with differentiable rendering☆10Dec 27, 2020Updated 5 years ago
- iGniter, an interference-aware GPU resource provisioning framework for achieving predictable performance of DNN inference in the cloud.☆39Jun 11, 2024Updated last year
- This CG provides a safe space to assess use cases, modularization (role, scope, outcomes), existing and emerging AI architectures, progre…☆24Oct 9, 2025Updated 5 months ago