mlc-ai / binary-mlc-llm-libs
☆205Updated this week
Related projects ⓘ
Alternatives and complementary repositories for binary-mlc-llm-libs
- A mobile Implementation of llama.cpp☆291Updated 9 months ago
- llama.cpp tutorial on Android phone☆76Updated 3 months ago
- A mobile Implementation of llama.cpp☆25Updated last year
- ☆148Updated 3 months ago
- automatically quant GGUF models☆137Updated this week
- Python bindings for ggml☆132Updated 2 months ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge prompts☆111Updated last year
- Local LLM App☆134Updated last month
- Falcon LLM ggml framework with CPU and GPU support☆244Updated 9 months ago
- C++ implementation for 💫StarCoder☆445Updated last year
- MobiLlama : Small Language Model tailored for edge devices☆593Updated 8 months ago
- Visual Studio Code extension for WizardCoder☆144Updated last year
- EfficientQAT: Efficient Quantization-Aware Training for Large Language Models☆222Updated last month
- Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.☆311Updated this week
- Octogen is an Open-Source Code Interpreter Agent Framework☆252Updated 3 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆124Updated last year
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆112Updated last year
- 1.58-bit LLaMa model☆79Updated 7 months ago
- A multimodal, function calling powered LLM webui.☆205Updated last month
- LLaVA server (llama.cpp).☆177Updated last year
- Run LLMs in the Browser with MLC / WebLLM ✨☆81Updated last month
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated 2 months ago
- ☆103Updated 7 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆56Updated 2 months ago
- Extend the original llama.cpp repo to support redpajama model.☆117Updated 2 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆143Updated last year
- ☆501Updated last week
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.☆148Updated last month
- Like ChatGPT's voice conversations with an AI, but entirely offline/private/trade-secret-friendly, using local AI models such as LLama 2 …☆136Updated 2 months ago
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆208Updated 3 months ago