lx200916 / ChatBotApp
☆33Updated last month
Alternatives and similar repositories for ChatBotApp
Users that are interested in ChatBotApp are comparing it to the libraries listed below
Sorting:
- High-speed and easy-use LLM serving framework for local deployment☆104Updated last month
- try to build a fully open-source ggml-hexagon backend for llama.cpp on Android phone equipped with Qualcomm's Hexagon NPU, details can b…☆19Updated this week
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆37Updated this week
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆54Updated 3 months ago
- ☆36Updated this week
- llm deploy project based onnx.☆36Updated 7 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆64Updated last week
- [EMNLP Findings 2024] MobileQuant: Mobile-friendly Quantization for On-device Language Models☆56Updated 7 months ago
- My study note for mlsys☆15Updated 6 months ago
- LLM inference in C/C++☆41Updated this week
- Fast Multimodal LLM on Mobile Devices☆861Updated last month
- ☆36Updated 3 weeks ago
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆19Updated 2 years ago
- ☆156Updated last month
- Awesome Mobile LLMs☆184Updated last month
- 🛠 A lite C++ toolkit: contains 100+ Awesome AI models, support MNN, NCNN, TNN, ONNXRuntime and TensorRT. 🎉🎉☆14Updated last month
- LLM inference in C/C++☆16Updated 2 weeks ago
- ☆58Updated this week
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆77Updated last year
- ☆24Updated 2 years ago
- ☆11Updated 2 months ago
- Penn CIS 5650 (GPU Programming and Architecture) Final Project☆30Updated last year
- 分层解耦的深度学习推理引擎☆73Updated 3 months ago
- Summary of some awesome work for optimizing LLM inference☆73Updated last month
- ☆124Updated last year
- Summary of the Specs of Commonly Used GPUs for Training and Inference of LLM☆39Updated 2 months ago
- mperf是一个面向移动/嵌入式平台的算子性能调优工具箱☆185Updated last year
- ☆65Updated 6 months ago
- ☆32Updated 9 months ago
- RISCV C and Triton AI-Benchmark☆16Updated 6 months ago