☆15Aug 15, 2024Updated last year
Alternatives and similar repositories for cocktail
Users that are interested in cocktail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 5 months ago
- ☆21May 13, 2022Updated 3 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆59Aug 21, 2024Updated last year
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆107Dec 24, 2022Updated 3 years ago
- ☆12Sep 25, 2019Updated 6 years ago
- Notes for CMU Deep Learning Systems Course (2022 online public run)☆16Jan 31, 2023Updated 3 years ago
- [NeurIPS 2022] Code for paper "Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation"☆27Dec 10, 2023Updated 2 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆27Jun 7, 2023Updated 2 years ago
- MIRIS: Fast Object Track Queries in Video☆17Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 操作系统课程实验设计☆12Mar 16, 2025Updated last year
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Fall Detection using Open Pose (Pose Detection)☆11Jul 23, 2019Updated 6 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Secure Inference Resilient Against Malicious Clients☆14May 3, 2022Updated 3 years ago
- ☆16Oct 18, 2023Updated 2 years ago
- Prefix-Aware Attention for LLM Decoding☆35Mar 31, 2026Updated 2 weeks ago
- ☆10Mar 8, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 4 years ago
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- LaTeX report template for Nanjing University. 南京大学作业通用简易模板☆19Dec 30, 2025Updated 3 months ago
- Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"☆18Sep 19, 2024Updated last year
- ☆34Jun 12, 2025Updated 10 months ago
- ☆16Oct 3, 2023Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- Simple Github Action that prints the go version.☆15Sep 24, 2022Updated 3 years ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Source code for research papers about the semantic communication approach SINFONY☆24Feb 17, 2026Updated 2 months ago
- TensorFlow re-implementation of SQN for weakly supervised segmentation on point clouds.☆13Nov 5, 2021Updated 4 years ago
- TJ 计算机系统实验: 89条指令CPU☆12Nov 11, 2024Updated last year
- 分布式系统期末大作业:模拟一个简单的分布式文件系统☆12Jan 6, 2019Updated 7 years ago
- ☆13Jan 31, 2019Updated 7 years ago
- Athena: A Framework for Defending Machine Learning Systems Against Adversarial Attacks☆44Sep 23, 2021Updated 4 years ago
- Some simple tutorials about python☆12Oct 11, 2020Updated 5 years ago