mlc-ai / web-llm
High-performance In-browser LLM Inference Engine
β13,554Updated this week
Related projects β
Alternatives and complementary repositories for web-llm
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β11,900Updated this week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running onβ¦β24,466Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β36,902Updated this week
- Universal LLM Deployment Engine with ML Compilationβ19,130Updated this week
- A guidance language for controlling large language models.β19,051Updated this week
- πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloadingβ9,224Updated 2 months ago
- the AI-native open-source embedding databaseβ15,276Updated this week
- LlamaIndex is a data framework for your LLM applicationsβ36,534Updated this week
- Run any open-source LLMs, such as Llama, Gemma, as OpenAI compatible API endpoint in the cloud.β9,993Updated this week
- Drag & drop UI to build your customized LLM flowβ31,263Updated this week
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.β3,590Updated 7 months ago
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β13,615Updated this week
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β70,518Updated this week
- Instruct-tune LLaMA on consumer hardwareβ18,630Updated 3 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β11,442Updated this week
- The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.β21,059Updated 4 months ago
- StableLM: Stability AI Language Modelsβ15,831Updated 7 months ago
- Inference code for CodeLlama modelsβ16,022Updated 2 months ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,028Updated 4 months ago
- OpenLLaMA, a permissively licensed open source reproduction of Meta AIβs LLaMA 7B trained on the RedPajama datasetβ7,383Updated last year
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β12,312Updated last month
- Code and documentation to train Stanford's Alpaca models, and generate the data.β29,518Updated 3 months ago
- Letta (formerly MemGPT) is a framework for creating LLM services with memory.β12,082Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.β7,212Updated last month
- β20,393Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,257Updated 3 months ago
- π OpenHands: Code Less, Make Moreβ34,952Updated this week
- LLM based autonomous agent that conducts local and web research on any topic and generates a comprehensive report with citations.β14,782Updated this week
- Open source codebase powering the HuggingChat appβ7,561Updated this week