mlc-ai / mlc-llmLinks

Universal LLM Deployment Engine with ML Compilation

☆21,691

Alternatives and similar repositories for mlc-llm

Users that are interested in mlc-llm are comparing it to the libraries listed below

Sorting:

lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆39,287Updated 6 months ago
tatsu-lab / stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
☆30,236Updated last year
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,983Updated last year
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,469Updated 5 months ago
BlinkDL / RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆14,177Updated 3 weeks ago
meta-llama / llama
Inference code for Llama models
☆58,968Updated 10 months ago
oobabooga / text-generation-webui
The definitive Web UI for local AI, with powerful features and easy setup.
☆45,504Updated last week
microsoft / JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
☆24,472Updated 4 months ago
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,778Updated last year
abetlen / llama-cpp-python
Python bindings for llama.cpp
☆9,786Updated 3 months ago
FMInference / FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
☆9,381Updated last year
Vision-CAIR / MiniGPT-4
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
☆25,758Updated last year
Stability-AI / StableLM
StableLM: Stability AI Language Models
☆15,786Updated last year
ggml-org / ggml
Tensor library for machine learning
☆13,648Updated last week
yoheinakajima / babyagi
☆21,983Updated last year
togethercomputer / OpenChatKit
☆9,013Updated last year
mlc-ai / web-llm
High-performance In-browser LLM Inference Engine
☆16,885Updated last week
cocktailpeanut / dalai
The simplest way to run LLaMA on your local machine
☆13,027Updated last year
BlinkDL / ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
☆9,513Updated 2 months ago
openlm-research / open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,528Updated 2 years ago
nomic-ai / gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
☆76,936Updated 6 months ago
ggml-org / llama.cpp
LLM inference in C/C++
☆90,838Updated this week
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆24,064Updated last year
AutoGPTQ / AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆5,001Updated 7 months ago
SJTU-IPADS / PowerInfer
High-speed Large Language Model Serving for Local Deployment
☆8,420Updated 4 months ago
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,684Updated 2 weeks ago
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,087Updated 5 months ago
guidance-ai / guidance
A guidance language for controlling large language models.
☆20,971Updated 2 weeks ago
antimatter15 / alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
☆10,197Updated 2 years ago
mit-han-lab / streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,135Updated last year