Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)
☆14Jun 20, 2025Updated 9 months ago
Alternatives and similar repositories for power-aware-triton
Users that are interested in power-aware-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Intel staging area for llvm.org contribution. Home for Intel LLVM-based projects.☆38Jun 20, 2025Updated 9 months ago
- ☆10Jun 20, 2025Updated 9 months ago
- ☆23Dec 29, 2021Updated 4 years ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- Effect-ive Programming in Go☆39Oct 21, 2025Updated 5 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A canonical source of GenAI energy benchmark and meausrements☆50Nov 29, 2025Updated 4 months ago
- ☆14Jun 20, 2025Updated 9 months ago
- Summary of system papers/frameworks/codes/tools on training or serving large model☆57Dec 17, 2023Updated 2 years ago
- U-Net network architecture for semantic segmentation with EfficientNet-B0 encoder☆15Oct 13, 2020Updated 5 years ago
- Predict the performance of LLM inference services☆23Sep 18, 2025Updated 6 months ago
- PiCAS executor + ROS 2 Real-Time Working Group's reference system☆12Oct 4, 2023Updated 2 years ago
- Financial literacy for computer science students☆32Mar 4, 2024Updated 2 years ago
- ☆17Nov 29, 2023Updated 2 years ago
- Deduplication over dis-aggregated memory for Serverless Computing☆14Mar 21, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Deep Learning NLP 101☆16Feb 4, 2022Updated 4 years ago
- ☆22Dec 7, 2023Updated 2 years ago
- ☆15Apr 18, 2023Updated 2 years ago
- 세종말뭉치 가공데이터 Repository☆14Sep 11, 2018Updated 7 years ago
- SmartTLS is the project introduced at the paper "A Case for SmartNIC-accelerated Private Communication" (APNET 20). It accelerates web se…☆17Feb 20, 2025Updated last year
- ☆11Nov 21, 2022Updated 3 years ago
- UT Campus Object Dataset (CODa): Models for 3D Object Detection☆17Feb 4, 2025Updated last year
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Relevant Material for SIGCOMM 2020 paper "Contention-Aware Performance Prediction for Virtualized Network Functions"☆17Jul 1, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Apollo: Online Machine Learning for Performance Portability☆26Aug 27, 2024Updated last year
- MISO: Exploiting Multi-Instance GPU Capability on Multi-Tenant GPU Clusters☆21Apr 21, 2023Updated 2 years ago
- I make a P.O.S system with python for company (Rozeriya Enterprise) using Python. For the database i just use SQLITE for this time becaus…☆21Sep 2, 2024Updated last year
- 소프트웨어 관련 서적들을 볼 수 있는 곳들, by☆31Aug 29, 2019Updated 6 years ago
- A Generic Resource-Aware Hyperparameter Tuning Execution Engine☆15Jan 8, 2022Updated 4 years ago
- (ICPP '20) ShadowTutor: Distributed Partial Distillation for Mobile Video DNN Inference☆12Jun 22, 2020Updated 5 years ago
- ☆325Jan 22, 2024Updated 2 years ago
- Chunky Loop Analyzer: A Polyhedral Representation Extraction Tool for High Level Programs☆25Dec 19, 2022Updated 3 years ago
- A fast and accurate reuse distance analyzer for multi-threaded applications. It leverages existing hardware features in commodity CPUs.☆21Feb 3, 2023Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆102Jul 2, 2023Updated 2 years ago
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Feb 24, 2023Updated 3 years ago
- A python based exploit to test out rapid reset attack (CVE-2023-44487)☆20Oct 16, 2023Updated 2 years ago
- ☆26Jan 3, 2024Updated 2 years ago
- [ACM SIGCOMM 2024] "m3: Accurate Flow-Level Performance Estimation using Machine Learning" by Chenning Li, Arash Nasr-Esfahany, Kevin Zha…☆25Oct 2, 2024Updated last year
- Dockerfiles and scripts to build Kathará Docker images.☆33Mar 23, 2026Updated last week
- Releasing the spot availability traces used in "Can't Be Late" paper.☆25Mar 31, 2024Updated last year