aahouzi / llama2-chatbot-cpuView on GitHub
A LLaMA2-7b chatbot with memory running on CPU, and optimized using smooth quantization, 4-bit quantization or Intel® Extension For PyTorch with bfloat16.
15Feb 27, 2024Updated 2 years ago

Alternatives and similar repositories for llama2-chatbot-cpu

Users that are interested in llama2-chatbot-cpu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?