efeslab / fiddler

[ICLR'25] Fast Inference of MoE Models with CPU-GPU Orchestration
201Updated 4 months ago

Alternatives and similar repositories for fiddler:

Users that are interested in fiddler are comparing it to the libraries listed below