mlc-ai / binary-mlc-llm-libs
☆228Updated 3 months ago
Alternatives and similar repositories for binary-mlc-llm-libs:
Users that are interested in binary-mlc-llm-libs are comparing it to the libraries listed below
- A mobile Implementation of llama.cpp☆303Updated last year
- llama.cpp tutorial on Android phone☆94Updated 6 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆117Updated last year
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆184Updated 2 weeks ago
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated 9 months ago
- MiniCPM on Android platform.☆625Updated 10 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 4 months ago
- Locally run an Instruction-Tuned Chat-Style LLM (Android/Linux/Windows/Mac)☆265Updated last year
- automatically quant GGUF models☆154Updated this week
- A mobile Implementation of llama.cpp☆25Updated last year
- 使用Android手机的CPU推理stable diffusion☆146Updated last year
- Cortex.Tensorrt-LLM is a C++ inference library that can be loaded by any server at runtime. It submodules NVIDIA’s TensorRT-LLM for GPU a…☆42Updated 4 months ago
- Inference of Mamba models in pure C☆183Updated 11 months ago
- Train your own small bitnet model☆64Updated 4 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆155Updated this week
- A benchmark for emotional intelligence in large language models☆224Updated 6 months ago
- ☆117Updated 10 months ago
- ☆527Updated 3 months ago
- ☆18Updated last month
- Octogen is an Open-Source Code Interpreter Agent Framework☆254Updated 6 months ago
- ☆60Updated 10 months ago
- LLM inference in C/C++☆14Updated this week
- ☆152Updated 7 months ago
- MiniCPM on iOS.☆65Updated 8 months ago
- Visual Studio Code extension for WizardCoder☆145Updated last year
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆514Updated this week
- 支持中文场景的的小语言模型 llama2.c-zh☆145Updated 11 months ago
- MobiLlama : Small Language Model tailored for edge devices☆620Updated 11 months ago
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year