SJTU-IPADS / PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
7,965Updated 2 months ago

Related projects

Alternatives and complementary repositories for PowerInfer