ml-energy / zeusLinks

Measure and optimize the energy consumption of your AI applications!

☆276

Alternatives and similar repositories for zeus

Users that are interested in zeus are comparing it to the libraries listed below

Sorting:

ml-energy / leaderboard
How much energy do GenAI models consume?
☆46Updated 2 months ago
SymbioticLab / Oobleck
A resilient distributed training framework
☆94Updated last year
microsoft / vidur
A large-scale simulation framework for LLM inference
☆414Updated 2 weeks ago
facebookresearch / ACT
ACT An Architectural Carbon Modeling Tool for Designing Sustainable Computer Systems
☆40Updated 2 weeks ago
Azure / msccl
Microsoft Collective Communication Library
☆65Updated 8 months ago
Hsword / SpotServe
SpotServe: Serving Generative Large Language Models on Preemptible Instances
☆126Updated last year
facebookresearch / HolisticTraceAnalysis
A library to analyze PyTorch traces.
☆402Updated this week
EfficientMoE / MoE-Infinity
PyTorch library for cost-effective, fast and easy serving of MoE models.
☆215Updated last month
project-etalon / etalon
LLM Serving Performance Evaluation Harness
☆79Updated 5 months ago
HuaizhengZhang / MIGProfiler
Multi-Instance-GPU profiling tool
☆60Updated 2 years ago
facebookresearch / CarbonExplorer
Carbon Explorer helps evaluating solutions make datacenters operate on renewable energy.
☆81Updated 9 months ago
microsoft / sarathi-serve
A low-latency & high-throughput serving engine for LLMs
☆400Updated 2 months ago
facebookresearch / param
PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…
☆149Updated this week
NVIDIA / kvpress
LLM KV cache compression made easy
☆566Updated last week
HPMLL / BurstGPT
A ChatGPT(GPT-3.5) & GPT-4 Workload Trace to Optimize LLM Serving Systems
☆191Updated 2 weeks ago
microsoft / varuna
☆251Updated last year
eth-easl / orion
An interference-aware scheduler for fine-grained GPU sharing
☆143Updated 6 months ago
mobiusml / gemlite
Fast low-bit matmul kernels in Triton
☆339Updated this week
Shenggan / awesome-distributed-ml
A curated list of awesome projects and papers for distributed training or inference
☆241Updated 10 months ago
efeslab / fiddler
[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
☆225Updated 8 months ago
WukLab / preble
Stateful LLM Serving
☆79Updated 4 months ago
CentML / DeepView.Profile
🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
☆64Updated 6 months ago
stanford-mast / INFaaS
Model-less Inference Serving
☆90Updated last year
tyler-griggs / melange-release
☆47Updated last year
geoffxy / habitat
🔮 Execution time predictions for deep neural network training iterations across different GPUs.
☆63Updated 2 years ago
hao-ai-lab / MuxServe
☆67Updated last year
ppl-ai / pplx-kernels
Perplexity GPU Kernels
☆418Updated 3 weeks ago
ServerlessLLM / ServerlessLLM
Serverless LLM Serving for Everyone.
☆515Updated last week
uccl-project / uccl
Ultra and Unified CCL
☆459Updated this week
UMass-LIDS / Proteus
Proteus: A High-Throughput Inference-Serving System with Accuracy Scaling
☆13Updated last year