NSTiwari / Llama3-on-MobileLinks

This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on-device inference.
70Updated last year

Alternatives and similar repositories for Llama3-on-Mobile

Users that are interested in Llama3-on-Mobile are comparing it to the libraries listed below

Sorting: