caiwanxianhust / FasterLLaMA

使用 CUDA C++ 实现的 llama 模型推理框架
45Updated 3 months ago

Alternatives and similar repositories for FasterLLaMA:

Users that are interested in FasterLLaMA are comparing it to the libraries listed below