shixiangcap / llama-jniLinks
Android JNI for port of Facebook's LLaMA model in C/C++
☆25Updated 2 years ago
Alternatives and similar repositories for llama-jni
Users that are interested in llama-jni are comparing it to the libraries listed below
Sorting:
- Demonstration of running a native LLM on Android device.☆202Updated this week
- Inference Llama 2 in one file of pure C☆43Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆88Updated last week
- This repository is an implementation of quantizing and converting the Llama3-8B-Instruct model weights and deploying it on Android for on…☆73Updated last year
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆91Updated last year
- A mobile Implementation of llama.cpp☆322Updated last year
- 使用Android手机的CPU推理stable diffusion☆159Updated 2 years ago
- ☆17Updated 11 months ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆85Updated last year
- llama.cpp tutorial on Android phone☆137Updated 7 months ago
- This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…☆132Updated last year
- Android app for running transformers locally using LLama.cpp & Whisper.cpp☆27Updated last year
- ☆157Updated 3 weeks ago
- ☆125Updated last year
- 使用Android cpu 运行 RWKV V4 ONNX☆69Updated 2 years ago
- High-speed and easy-use LLM serving framework for local deployment☆137Updated 4 months ago
- Inference RWKV with multiple supported backends.☆70Updated this week
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆68Updated last year
- llm-export can export llm model to onnx.☆334Updated last month
- 支持中文场景的的小语言模型 llama2.c-zh☆150Updated last year
- ☆170Updated 3 weeks ago
- ☆268Updated 2 weeks ago
- LLaMa/RWKV onnx models, quantization and testcase☆367Updated 2 years ago
- ☆65Updated last year
- Awesome Mobile LLMs☆280Updated last week
- a lightweight LLM model inference framework☆744Updated last year
- qwen2 and llama3 cpp implementation☆48Updated last year
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆147Updated last year
- C++ implementation of Qwen-LM☆610Updated last year
- ☆70Updated 2 years ago