efeslab / fiddler

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
209Updated 5 months ago

Alternatives and similar repositories for fiddler:

Users that are interested in fiddler are comparing it to the libraries listed below