☆15Aug 15, 2024Updated last year
Alternatives and similar repositories for cocktail
Users that are interested in cocktail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BATCH: Adaptive Batching for Efficient MachineLearning Serving on Serverless Platforms☆11Aug 7, 2021Updated 4 years ago
- Model-less Inference Serving☆94Nov 4, 2023Updated 2 years ago
- [MobiCom '23] AccuMO: Accuracy-Centric Multitask Offloading in Edge-Assisted Mobile Augmented Reality☆18Oct 8, 2023Updated 2 years ago
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- ☆21May 13, 2022Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- DISB is a new DNN inference serving benchmark with diverse workloads and models, as well as real-world traces.☆58Aug 21, 2024Updated last year
- REEF is a GPU-accelerated DNN inference serving system that enables instant kernel preemption and biased concurrent execution in GPU sche…☆108Dec 24, 2022Updated 3 years ago
- ☆12Sep 25, 2019Updated 6 years ago
- Project for CS101016 and CS100160, Tongji University. Use Verilog HDL to build a CPU.☆10Mar 20, 2021Updated 5 years ago
- [NeurIPS 2022] Code for paper "Efficiently Computing Local Lipschitz Constants of Neural Networks via Bound Propagation"☆28Dec 10, 2023Updated 2 years ago
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆26Jun 7, 2023Updated 3 years ago
- [SIGCOMM 2023] PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆15Jul 1, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- FTPipe and related pipeline model parallelism research.☆44May 16, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Implementation of Proximal Policy Optimization (PPO) for continuous action space (`Pendulum-v1` from gym) using tensorflow2.x and pytorch…☆11Aug 8, 2022Updated 3 years ago
- Secure Inference Resilient Against Malicious Clients☆14May 3, 2022Updated 4 years ago
- 一个简单的 C++ Linux 控制台(西北大学操作系统作业)☆11Jun 4, 2021Updated 5 years ago
- ☆10Mar 8, 2025Updated last year
- [ICLR 2022] "Sparsity Winning Twice: Better Robust Generalization from More Efficient Training" by Tianlong Chen*, Zhenyu Zhang*, Pengjun…☆40Mar 20, 2022Updated 4 years ago
- Prefix-Aware Attention for LLM Decoding☆39May 26, 2026Updated 2 weeks ago
- Crawled Wikipedia Tables with Passages☆14Aug 19, 2021Updated 4 years ago
- Graph algorithms to merge two graphs based on stitching.☆12Oct 18, 2019Updated 6 years ago
- Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"☆18Sep 19, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The source code of INFless,a native serverless platform for AI inference.☆46Oct 10, 2022Updated 3 years ago
- ☆16Oct 3, 2023Updated 2 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆36Jan 9, 2023Updated 3 years ago
- HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units (KDD 2020)☆12Jan 25, 2021Updated 5 years ago
- TensorFlow re-implementation of SQN for weakly supervised segmentation on point clouds.☆14Apr 16, 2026Updated last month
- TJ 计算机系统实验: 89条指令CPU☆12Nov 11, 2024Updated last year
- 分布式系统期末大作业:模拟一个简单的分布式文件系统☆12Jan 6, 2019Updated 7 years ago
- ☆13Jan 31, 2019Updated 7 years ago
- Athena: A Framework for Defending Machine Learning Systems Against Adversarial Attacks☆44Sep 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 一个操作系统的实现-课程设计报告☆14Aug 28, 2016Updated 9 years ago
- Source code of IPA, https://escholarship.org/uc/item/2p0805dq☆12Jun 27, 2024Updated last year
- Pose refinement with differentiable rendering☆10Dec 27, 2020Updated 5 years ago
- ☆30Jun 23, 2025Updated 11 months ago
- 高级语言程序设计课程用机器人☆19Oct 6, 2021Updated 4 years ago
- This CG provides a safe space to assess use cases, modularization (role, scope, outcomes), existing and emerging AI architectures, progre…☆30Oct 9, 2025Updated 7 months ago
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆12Jul 13, 2023Updated 2 years ago