efeslab / fiddler

Fast Inference of MoE Models with CPU-GPU Orchestration
171Updated this week

Related projects

Alternatives and complementary repositories for fiddler