mlc-ai / web-llmLinks
High-performance In-browser LLM Inference Engine
☆15,547Updated 3 weeks ago
Alternatives and similar repositories for web-llm
Users that are interested in web-llm are comparing it to the libraries listed below
Sorting:
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆11,283Updated last week
- Inference code for CodeLlama models☆16,312Updated 9 months ago
- State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!☆13,674Updated this week
- Universal LLM Deployment Engine with ML Compilation☆20,685Updated 3 weeks ago
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.☆3,660Updated last year
- Large Language Model Text Generation Inference☆10,155Updated this week
- Tensor library for machine learning☆12,591Updated this week
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We als…☆17,355Updated this week
- LLM inference in C/C++☆80,984Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆38,670Updated last week
- Open source codebase powering the HuggingChat app☆8,748Updated this week
- A guidance language for controlling large language models.☆20,238Updated last week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆23,128Updated this week
- the AI-native open-source embedding database☆20,090Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,101Updated this week
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,351Updated 8 months ago
- Home of StarCoder: fine-tuning & inference!☆7,416Updated last year
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath☆9,411Updated 9 months ago
- Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.☆3,705Updated last year
- Instruct-tune LLaMA on consumer hardware☆18,904Updated 10 months ago
- Official inference library for Mistral models☆10,262Updated 2 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset☆7,493Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆22,643Updated 9 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,898Updated 10 months ago
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,566Updated 8 months ago
- ☆21,498Updated 6 months ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,634Updated 8 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆14,662Updated 2 months ago
- 🥷 Run AI-agents with an API☆5,850Updated last month
- DSPy: The framework for programming—not prompting—language models☆24,538Updated this week