☆15Aug 15, 2024Updated last year
Alternatives and similar repositories for cocktail
Users that are interested in cocktail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for Jellyfish, a soft real-time inference serving system☆15Dec 20, 2022Updated 3 years ago
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- Exploiting Cloud Services for Cost-Effective, SLO-Aware Machine Learning Inference Serving☆37Dec 27, 2019Updated 6 years ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 5 months ago
- ☆21May 13, 2022Updated 3 years ago
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆59Aug 21, 2024Updated last year
- ☆12Sep 25, 2019Updated 6 years ago
- Project for CS101016 and CS100160, Tongji University. Use Verilog HDL to build a CPU.☆10Mar 20, 2021Updated 5 years ago
- Notes for CMU Deep Learning Systems Course (2022 online public run)☆16Jan 31, 2023Updated 3 years ago
- [NeurIPS 2022] Code for paper "Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation"☆28Dec 10, 2023Updated 2 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆27Jun 7, 2023Updated 2 years ago
- MIRIS: Fast Object Track Queries in Video☆17Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 操作系统课程实验设计☆12Mar 16, 2025Updated last year
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 2 years ago
- Implementation of Proximal Policy Optimization (PPO) for continuous action space (`Pendulum-v1` from gym) using tensorflow2.x and pytorch…☆11Aug 8, 2022Updated 3 years ago
- Secure Inference Resilient Against Malicious Clients☆14May 3, 2022Updated 4 years ago
- ☆16Oct 18, 2023Updated 2 years ago
- ☆10Mar 8, 2025Updated last year
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Prefix-Aware Attention for LLM Decoding☆37Mar 31, 2026Updated last month
- LaTeX report template for Nanjing University. 南京大学作业通用简易模板☆20Dec 30, 2025Updated 4 months ago
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"☆18Sep 19, 2024Updated last year
- The source code of INFless,a native serverless platform for AI inference.☆46Oct 10, 2022Updated 3 years ago
- 阅读指南☆13Jul 13, 2020Updated 5 years ago
- ☆34Jun 12, 2025Updated 10 months ago
- ☆16Oct 3, 2023Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Security Attacks on LLM-based Code Completion Tools (AAAI 2025)☆22Dec 31, 2025Updated 4 months ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- Source code for research papers about the semantic communication approach SINFONY☆24Updated this week
- TJ 计算机系统实验: 89条指令CPU☆12Nov 11, 2024Updated last year
- 分布式系统期末大作业:模拟一个简单的分布式文件系统☆12Jan 6, 2019Updated 7 years ago
- Athena: A Framework for Defending Machine Learning Systems Against Adversarial Attacks☆44Sep 23, 2021Updated 4 years ago
- 一个操作系统的实现-课程设计报告☆14Aug 28, 2016Updated 9 years ago