rogerallen / llama2.cu

Inference Llama 2 in one file of pure C & one file with CUDA
16Updated last year

Related projects

Alternatives and complementary repositories for llama2.cu