mlc-ai / web-llmLinks

High-performance In-browser LLM Inference Engine

☆15,962

Alternatives and similar repositories for web-llm

Users that are interested in web-llm are comparing it to the libraries listed below

Sorting:

mlc-ai / web-stable-diffusion
Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
☆3,672Updated last year
mlc-ai / mlc-llm
Universal LLM Deployment Engine with ML Compilation
☆20,983Updated last week
lm-sys / FastChat
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
☆38,882Updated last month
bentoml / OpenLLM
Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.
☆11,570Updated last week
openlm-research / open_llama
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
☆7,509Updated 2 years ago
huggingface / chat-ui
Open source codebase powering the HuggingChat app
☆8,986Updated last week
tloen / alpaca-lora
Instruct-tune LLaMA on consumer hardware
☆18,926Updated 11 months ago
huggingface / transformers.js
State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!
☆14,139Updated this week
run-llama / llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
☆43,107Updated this week
meta-llama / codellama
Inference code for CodeLlama models
☆16,352Updated 11 months ago
ggml-org / ggml
Tensor library for machine learning
☆12,831Updated last week
chroma-core / chroma
the AI-native open-source embedding database
☆21,160Updated this week
LAION-AI / Open-Assistant
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamical…
☆37,417Updated 11 months ago
nlpxucan / WizardLM
LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath
☆9,430Updated last month
0hq / WebGPT
Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.
☆3,715Updated last year
Stability-AI / StableLM
StableLM: Stability AI Language Models
☆15,828Updated last year
mudler / LocalAI
The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on…
☆33,969Updated this week
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,561Updated last year
zilliztech / GPTCache
Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
☆7,637Updated last week
antimatter15 / alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
☆10,225Updated 2 years ago
bigscience-workshop / petals
🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
☆9,726Updated 10 months ago
meta-llama / llama
Inference code for Llama models
☆58,524Updated 5 months ago
nat / openplayground
An LLM playground you can run on your laptop
☆6,357Updated last week
bigcode-project / starcoder
Home of StarCoder: fine-tuning & inference!
☆7,430Updated last year
yoheinakajima / babyagi
☆21,656Updated 8 months ago
cocktailpeanut / dalai
The simplest way to run LLaMA on your local machine
☆13,071Updated last year
microsoft / JARVIS
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
☆24,225Updated 9 months ago
BlinkDL / RWKV-LM
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable)…
☆13,796Updated last week
BlinkDL / ChatRWKV
ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.
☆9,502Updated 2 months ago
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆52,682Updated this week