lx200916 / ChatBotApp
☆28Updated 4 months ago
Alternatives and similar repositories for ChatBotApp:
Users that are interested in ChatBotApp are comparing it to the libraries listed below
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆60Updated this week
- High-speed and easy-use LLM serving framework for local deployment☆94Updated last week
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆56Updated 6 months ago
- ☆45Updated this week
- Demonstration of running a native LLM on Android device.☆126Updated this week
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆51Updated 2 months ago
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆34Updated 2 years ago
- ☆32Updated 3 weeks ago
- ☆124Updated last year
- A converter for llama2.c legacy models to ncnn models.☆87Updated last year
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆26Updated this week
- llm deploy project based onnx.☆35Updated 5 months ago
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆85Updated 7 months ago
- ☆23Updated last month
- Awesome Mobile LLMs☆156Updated last week
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆93Updated last year
- ☆61Updated 4 months ago
- LLM inference in C/C++☆16Updated this week
- Recording models☆13Updated last year
- ☆40Updated 2 years ago
- 本项目是一个通过文字生成图片的项目,基于开源模型Stable Diffusion V1.5生成可以在手机的CPU和NPU上运行的模型,包括其配套的模型运行框架。☆147Updated last year
- Explore LLM model deployment based on AXera's AI chips☆87Updated last week
- ☆157Updated last week
- ncnn android yolov8 realtime detection, segmentation, pose estimation, classification and obb☆75Updated 2 months ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆18Updated 2 years ago
- Fast Multimodal LLM on Mobile Devices☆781Updated last week
- LLM inference in C/C++☆34Updated last week
- Efficient inference of large language models.☆146Updated 3 months ago
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆179Updated last year
- QQQ is an innovative and hardware-optimized W4A8 quantization solution for LLMs.☆109Updated 2 weeks ago