lx200916 / ChatBotApp
☆23Updated 2 months ago
Alternatives and similar repositories for ChatBotApp:
Users that are interested in ChatBotApp are comparing it to the libraries listed below
- Inference RWKV v5, v6 and (WIP) v7 with Qualcomm AI Engine Direct SDK☆49Updated last week
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆63Updated 9 months ago
- llm deploy project based onnx.☆30Updated 3 months ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆47Updated last week
- ☆124Updated last year
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆71Updated 5 months ago
- A converter for llama2.c legacy models to ncnn models.☆86Updated last year
- stable diffusion using mnn☆65Updated last year
- ☆38Updated 2 years ago
- Demonstration of running a native LLM on Android device.☆106Updated this week
- ☆24Updated 2 years ago
- A light llama-like llm inference framework based on the triton kernel.☆78Updated 3 weeks ago
- LLaMa/RWKV onnx models, quantization and testcase☆356Updated last year
- ggml学习笔记,ggml是一个机器学习的推理框架☆14Updated 10 months ago
- NVIDIA TensorRT Hackathon 2023复赛选题:通义千问Qwen-7B用TensorRT-LLM模型搭建及优化☆41Updated last year
- Run generative AI models in sophgo BM1684X☆155Updated this week
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆33Updated 2 years ago
- A Toolkit to Help Optimize Onnx Model☆106Updated this week
- Standalone Flash Attention v2 kernel without libtorch dependency☆99Updated 4 months ago
- Tiny C++11 GPT-2 inference implementation from scratch☆53Updated last month
- 使用 CUDA C++ 实现的 llama 模型推理框架☆44Updated 2 months ago
- Inference deployment of the llama3☆11Updated 9 months ago
- Explore LLM model deployment based on AXera's AI chips☆71Updated last month
- ☢️ TensorRT 2023复赛——基于TensorRT-LLM的Llama模型推断加速优化☆45Updated last year
- DragGan in NCNN with c++☆49Updated last year
- Common libraries for PPL projects☆29Updated 3 months ago
- Large Language Model Onnx Inference Framework☆28Updated 2 weeks ago
- Efficient inference of large language models.☆145Updated last month
- llm-export can export llm model to onnx.☆257Updated last week
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆135Updated 10 months ago