NSTiwari / Llama3-on-Mobile

This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.
60Updated 5 months ago

Related projects

Alternatives and complementary repositories for Llama3-on-Mobile