mlc-ai / binary-mlc-llm-libsLinks
☆259Updated 2 months ago
Alternatives and similar repositories for binary-mlc-llm-libs
Users that are interested in binary-mlc-llm-libs are comparing it to the libraries listed below
Sorting:
- A mobile Implementation of llama.cpp☆320Updated last year
- llama.cpp tutorial on Android phone☆133Updated 5 months ago
- IRIS is an android app for interfacing with GGUF / llama.cpp models locally.☆237Updated 8 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- A mobile Implementation of llama.cpp☆26Updated 2 years ago
- MiniCPM on Android platform.☆635Updated 7 months ago
- [ICLR-2025-SLLM Spotlight 🔥]MobiLlama : Small Language Model tailored for edge devices☆663Updated 5 months ago
- Tool to download models from Huggingface Hub and convert them to GGML/GGUF for llama.cpp☆160Updated 5 months ago
- Locally run an Instruction-Tuned Chat-Style LLM (Android/Linux/Windows/Mac)☆262Updated 2 years ago
- automatically quant GGUF models☆214Updated last week
- High-speed and easy-use LLM serving framework for local deployment☆130Updated 2 months ago
- Running any GGUF SLMs/LLMs locally, on-device in Android☆546Updated last month
- Falcon LLM ggml framework with CPU and GPU support☆247Updated last year
- Making offline AI models accessible to all types of edge devices.☆142Updated last year
- C++ implementation for 💫StarCoder☆455Updated 2 years ago
- AMD related optimizations for transformer models☆92Updated last week
- This reference can be used with any existing OpenAI integrated apps to run with TRT-LLM inference locally on GeForce GPU on Windows inste…☆126Updated last year
- INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model☆1,550Updated 7 months ago
- AI for all: Build the large graph of the language models☆276Updated last year
- My develoopment fork of llama.cpp. For now working on RK3588 NPU and Tenstorrent backend☆107Updated last week
- A Ollama client for Android!☆86Updated last year
- Inference Llama 2 in one file of pure C☆43Updated 2 years ago
- MiniCPM on iOS.☆67Updated 7 months ago
- Extension for using alternative GitHub Copilot (StarCoder API) in VSCode☆100Updated last year
- Octogen is an Open-Source Code Interpreter Agent Framework☆257Updated last year
- Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.☆569Updated last year
- starcoder server for huggingface-vscdoe custom endpoint☆175Updated last year
- Demonstration of running a native LLM on Android device.☆191Updated last month
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆727Updated this week
- ggml implementation of BERT☆494Updated last year