mlc-ai / mlc-llmLinks
Universal LLM Deployment Engine with ML Compilation
☆21,691Updated last week
Alternatives and similar repositories for mlc-llm
Users that are interested in mlc-llm are comparing it to the libraries listed below
Sorting:
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆39,287Updated 6 months ago
- Code and documentation to train Stanford's Alpaca models, and generate the data.☆30,236Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,983Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,469Updated 5 months ago
- RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…☆14,177Updated 3 weeks ago
- Inference code for Llama models☆58,968Updated 10 months ago
- The definitive Web UI for local AI, with powerful features and easy setup.☆45,504Updated last week
- JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf☆24,472Updated 4 months ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,778Updated last year
- Python bindings for llama.cpp☆9,786Updated 3 months ago
- Running large language models on a single GPU for throughput-oriented scenarios.☆9,381Updated last year
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,758Updated last year
- StableLM: Stability AI Language Models☆15,786Updated last year
- Tensor library for machine learning☆13,648Updated last week
- ☆21,983Updated last year
- ☆9,013Updated last year
- High-performance In-browser LLM Inference Engine☆16,885Updated last week
- The simplest way to run LLaMA on your local machine☆13,027Updated last year
- ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.☆9,513Updated 2 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,528Updated 2 years ago
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.☆76,936Updated 6 months ago
- LLM inference in C/C++☆90,838Updated this week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,064Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆5,001Updated 7 months ago
- High-speed Large Language Model Serving for Local Deployment☆8,420Updated 4 months ago
- Large Language Model Text Generation Inference☆10,684Updated 2 weeks ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,087Updated 5 months ago
- A guidance language for controlling large language models.☆20,971Updated 2 weeks ago
- Locally run an Instruction-Tuned Chat-Style LLM☆10,197Updated 2 years ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,135Updated last year