☆15Aug 15, 2024Updated last year
Alternatives and similar repositories for cocktail
Users that are interested in cocktail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆21May 13, 2022Updated 4 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆37Nov 18, 2025Updated 7 months ago
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆108Dec 24, 2022Updated 3 years ago
- ☆12Sep 25, 2019Updated 6 years ago
- [NeurIPS 2022] Code for paper "Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation"☆28Dec 10, 2023Updated 2 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆27Jun 7, 2023Updated 3 years ago
- MIRIS: Fast Object Track Queries in Video☆17Mar 24, 2023Updated 3 years ago
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 3 years ago
- Secure Inference Resilient Against Malicious Clients☆14May 3, 2022Updated 4 years ago
- ☆11May 25, 2026Updated last month
- ☆10Mar 8, 2025Updated last year
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 4 years ago
- Implementation of SPW and DPW for Monte Carlo Tree Search in Continuous action/state space☆20Oct 3, 2023Updated 2 years ago
- Prefix-Aware Attention for LLM Decoding☆40May 26, 2026Updated last month
- Crawled Wikipedia Tables with Passages☆14Aug 19, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"☆18Sep 19, 2024Updated last year
- 阅读指南☆13Jul 13, 2020Updated 5 years ago
- The source code of INFless,a native serverless platform for AI inference.☆46Oct 10, 2022Updated 3 years ago
- ☆34Jun 12, 2025Updated last year
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- Simple Github Action that prints the go version.☆15Sep 24, 2022Updated 3 years ago
- Buy Books Online☆10Jan 11, 2021Updated 5 years ago
- Source code for research papers about the semantic communication approach SINFONY☆25Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TensorFlow re-implementation of SQN for weakly supervised segmentation on point clouds.☆14Apr 16, 2026Updated 2 months ago
- TJ 计算机系统实验: 89条指令CPU☆12Nov 11, 2024Updated last year
- 分布式系统期末大作业:模拟一个简单的分布式文件系统☆12Jan 6, 2019Updated 7 years ago
- ☆13Jan 31, 2019Updated 7 years ago
- 一个操作系统的实现-课程设计报告☆14Aug 28, 2016Updated 9 years ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated 2 years ago
- ☆13Apr 9, 2025Updated last year