casys-kaist / CoVALinks
Official code repository for "CoVA: Exploiting Compressed-Domain Analysis to Accelerate Video Analytics [USENIX ATC 22]"
☆16Updated 9 months ago
Alternatives and similar repositories for CoVA
Users that are interested in CoVA are comparing it to the libraries listed below
Sorting:
- This is a list of awesome edgeAI inference related papers.☆95Updated last year
- ☆45Updated 2 years ago
- Source code and datasets for Ekya, a system for continuous learning on the edge.☆106Updated 3 years ago
- ☆52Updated 2 weeks ago
- [ACM EuroSys '23] Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access☆56Updated last year
- Source code for the paper: "A Latency-Predictable Multi-Dimensional Optimization Framework forDNN-driven Autonomous Systems"☆22Updated 4 years ago
- ☆28Updated 2 years ago
- PacketGame: Multi-Stream Packet Gating for Concurrent Video Inference at Scale☆11Updated last year
- Model-less Inference Serving☆88Updated last year
- [DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation☆11Updated last year
- ☆21Updated 2 years ago
- ☆56Updated 3 years ago
- [HPCA'24] Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System☆46Updated last year
- LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale☆120Updated last week
- Adaptive Model Streaming for real-time video inference on edge devices☆41Updated 3 years ago
- ☆25Updated last year
- A version of XRBench-MAESTRO used for MLSys 2023 publication☆23Updated 2 years ago
- Experimental deep learning framework written in Rust☆15Updated 2 years ago
- zTT: Learning-based DVFS with Zero Thermal Throttling for Mobile Devices [MobiSys'21] - Artifact Evaluation☆25Updated 4 years ago
- MobiSys#114☆21Updated last year
- FilterForward: Scaling Video Analytics on Constrained Edge Nodes☆28Updated 5 years ago
- To deploy Transformer models in CV to mobile devices.☆18Updated 3 years ago
- Multi-Instance-GPU profiling tool☆59Updated 2 years ago
- Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling☆13Updated last year
- ☆23Updated 2 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆55Updated 2 years ago
- Official Repo for "LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization"☆34Updated last year
- LLM serving cluster simulator☆106Updated last year
- ☆21Updated last year
- Source code for Jellyfish, a soft real-time inference serving system☆13Updated 2 years ago