caiwanxianhust / FasterLLaMA

使用 CUDA C++ 实现的 llama 模型推理框架
24Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for FasterLLaMA