ArtificialZeng / baichuan-speedup

纯c++的全平台llm加速库,支持python调用,支持baichuan, glm, llama, moss基座,手机端流畅运行chatglm-6B级模型单卡可达10000+token / s,
45Updated last year

Alternatives and similar repositories for baichuan-speedup:

Users that are interested in baichuan-speedup are comparing it to the libraries listed below