lx200916 / ChatBotApp
☆14Updated this week
Related projects ⓘ
Alternatives and complementary repositories for ChatBotApp
- Inference rwkv5 or rwkv6 with Qualcomm AI Engine Direct SDK☆38Updated this week
- Fast Multimodal LLM on Mobile Devices☆537Updated this week
- 参考自mlc-llm,个人尝试在android手机上部署大模型并运行☆67Updated 3 months ago
- ☆123Updated 11 months ago
- 基于MNN-llm的安卓手机部署大语言模型:Qwen1.5-0.5B-Chat☆50Updated 7 months ago
- Code for ACM MobiCom 2024 paper "FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices"☆41Updated last month
- The open-source project for "Mandheling: Mixed-Precision On-Device DNN Training with DSP Offloading"[MobiCom'2022]☆18Updated 2 years ago
- Demonstration of running a native LLM on Android device.☆75Updated this week
- qwen2 and llama3 cpp implementation☆34Updated 5 months ago
- A converter for llama2.c legacy models to ncnn models.☆82Updated 11 months ago
- Explore LLM model deployment based on AXera's AI chips☆54Updated last week
- Tiny C++11 GPT-2 inference implementation from scratch☆48Updated 10 months ago
- llm deploy project based onnx.☆26Updated last month
- Awesome list for LLM quantization☆127Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆137Updated 2 months ago
- GPT2⚡NCNN⚡中文对话⚡x86⚡Android☆79Updated 2 years ago
- mllm-npu: training multimodal large language models on Ascend NPUs☆83Updated 2 months ago
- llama 2 Inference☆37Updated last year
- [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a V…☆325Updated this week
- Compare different hardware platforms via the Roofline Model for LLM inference tasks.☆76Updated 8 months ago
- ☆24Updated last year
- export llama to onnx☆98Updated 5 months ago
- ☆25Updated 11 months ago
- ☆19Updated last month
- stable diffusion using mnn☆64Updated last year
- simplify >2GB large onnx model☆45Updated 8 months ago
- ☆28Updated 4 months ago
- Run generative AI models in sophgo BM1684X☆126Updated this week
- Summary of system papers/frameworks/codes/tools on training or serving large model☆56Updated 11 months ago
- 📒A small curated list of Awesome Diffusion Inference Papers with codes.☆97Updated last week