TroyTzou / mlc-llm-android
参考自mlc-llm,个人尝试在android手机上部署大模型并运行
☆62Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for mlc-llm-android
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行 框架。☆105Updated 7 months ago
- 使用Android cpu 运行 RWKV V4 ONNX☆65Updated last year
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆48Updated 7 months ago
- Demonstration of running a native LLM on Android device.☆69Updated last week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆135Updated 2 months ago
- 使用Android手机的CPU推理stable diffusion☆141Updated 11 months ago
- llm-export can export llm model to onnx.☆226Updated last week
- run ChatGLM2-6B in BM1684X☆48Updated 8 months ago
- stable diffusion using mnn☆62Updated last year
- ☆123Updated 10 months ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆79Updated 2 years ago
- CLIP⚡NCNN⚡基于自然语言的图片搜索(Image Search)⚡以字搜图⚡x86⚡Android☆220Updated last year
- 支持中文场景的的小语言模型 llama2.c-zh☆143Updated 8 months ago
- a lightweight LLM model inference framework☆699Updated 7 months ago
- MiniCPM on Android platform.☆557Updated 7 months ago
- Port of Facebook's LLaMA model in C/C++☆72Updated this week
- C++ implementation of Qwen-LM☆551Updated 10 months ago
- vits Android部署☆318Updated 7 months ago
- Inference rwkv5 or rwkv6 with Qualcomm AI Engine Direct SDK☆36Updated this week
- Run generative AI models in sophgo BM1684X☆122Updated this week
- Efficient inference of large language models.☆143Updated 3 weeks ago
- ☆82Updated last year
- ☆49Updated 8 months ago
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆113Updated 2 weeks ago
- Phi3 中文仓库☆319Updated 6 months ago
- 实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果☆248Updated 4 months ago
- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…☆315Updated this week
- ☆433Updated last year
- qwen2 and llama3 cpp implementation☆34Updated 5 months ago
- Phi2-Chinese-0.2B 从0开始训练自己的Phi2中文小模型,支持接入langchain加载本地知识库做检索增强生成RAG。Training your own Phi2 small chat model from scratch.☆483Updated 4 months ago