shixiangcap / llama-jniLinks

Android JNI for port of Facebook's LLaMA model in C/C++

☆23

Alternatives and similar repositories for llama-jni

Users that are interested in llama-jni are comparing it to the libraries listed below

Sorting:

MollySophia / rwkv-qualcomm
Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK
☆72Updated last week
yynil / RWKVinLLAMA
☆18Updated 5 months ago
MollySophia / rwkv-mobile
Inference RWKV with multiple supported backends.
☆50Updated this week
daquexian / faster-rwkv
☆124Updated last year
RWKV / rwkv-onnx
A converter and basic tester for rwkv onnx
☆42Updated last year
cryscan / web-rwkv-inspector
☆13Updated 6 months ago
JL-er / RWKV-PEFT
☆129Updated last week
ZeldaHuang / rwkv-cpp-server
Easily deploy your rwkv model
☆19Updated 2 years ago
DakeQQ / Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
☆146Updated 2 weeks ago
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆148Updated 10 months ago
JackZeng0208 / llama.cpp-android-tutorial
llama.cpp tutorial on Android phone
☆110Updated last month
Abel2076 / json2binidx_tool
☆82Updated last year
Durham / RWKV-finetune-script
Script and instruction how to fine-tune large RWKV model on your data for Alpaca dataset
☆31Updated 2 years ago
saic-fi / MobileQuant
[EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models
☆63Updated 9 months ago
neromous / RWKV-Ouroboros
This project is established for real-time training of the RWKV model.
☆49Updated last year
yvonwin / qwen2.cpp
qwen2 and llama3 cpp implementation
☆44Updated last year
DataXujing / Qwen1.5-0.5b-chat-android
基于MNN-llm的安卓手机部署大语言模型：Qwen1.5-0.5B-Chat
☆79Updated last year
ssbuild / rwkv_finetuning
rwkv finetuning
☆36Updated last year
yuunnn-w / RWKV_Pytorch
This is an inference framework for the RWKV large language model implemented purely in native PyTorch. The official native implementation…
☆128Updated 11 months ago
mlc-ai / relax
☆158Updated last week
TroyTzou / mlc-llm-android
参考自mlc-llm，个人尝试在android手机上部署大模型并运行
☆86Updated 10 months ago
jiaohuix / ppllama
The paddle implementation of meta's LLaMA.
☆45Updated 2 years ago
Manuel030 / llama2.c-android
Inference Llama 2 in one file of pure C
☆42Updated last year
clcarwin / alpaca-weight
Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.
☆51Updated 2 years ago
mzbac / qlora-inference-multi-gpu
☆12Updated 2 years ago
OpenMOSE / RWKV-LM-RLHF
Reinforcement Learning Toolkit for RWKV.(v6,v7,ARWKV) Distillation,SFT,RLHF(DPO,ORPO), infinite context training, Aligning. Exploring the…
☆44Updated last month
lx200916 / ChatBotApp
☆35Updated 2 months ago
wejoncy / QLLM
A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.
☆172Updated 2 months ago
leonsama / web-rwkv-realweb
☆10Updated 2 weeks ago
Triang-jyed-driung / RWKV-LM-RLHF-DPO
Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.
☆11Updated last year