caiwanxianhust / FasterLLaMA

使用 CUDA C++ 实现的 llama 模型推理框架
48Updated 4 months ago

Alternatives and similar repositories for FasterLLaMA:

Users that are interested in FasterLLaMA are comparing it to the libraries listed below