mlc-ai / web-llmLinks
High-performance In-browser LLM Inference Engine
β16,571Updated 3 weeks ago
Alternatives and similar repositories for web-llm
Users that are interested in web-llm are comparing it to the libraries listed below
Sorting:
- πΈ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloadingβ9,803Updated last year
- State-of-the-art Machine Learning for the web. Run π€ Transformers directly in your browser, with no need for a server!β14,647Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,144Updated 4 months ago
- A guidance language for controlling large language models.β20,807Updated last week
- Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.β3,683Updated last year
- Open source codebase powering the HuggingChat appβ9,187Updated last week
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β44,575Updated this week
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.β11,817Updated 2 weeks ago
- Universal LLM Deployment Engine with ML Compilationβ21,423Updated last week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagβ¦β29,564Updated this week
- LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMathβ9,458Updated 4 months ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)β12,451Updated this week
- Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.β18,569Updated this week
- Large Language Model Text Generation Inferenceβ10,539Updated 2 weeks ago
- Open-source search and retrieval database for AI applications.β23,665Updated this week
- Structured Outputsβ12,648Updated this week
- Tensor library for machine learningβ13,222Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.β8,763Updated last year
- Instruct-tune LLaMA on consumer hardwareβ18,961Updated last year
- β21,860Updated 11 months ago
- An LLM playground you can run on your laptopβ6,364Updated last week
- π¦π Build context-aware reasoning applicationsβ116,508Updated this week
- Run GPT model on the browser with WebGPU. An implementation of GPT inference in less than ~1500 lines of vanilla Javascript.β3,735Updated last year
- π¦π Build context-aware reasoning applications π¦πβ15,791Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.β7,783Updated 2 months ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.β16,097Updated last week
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,679Updated last year
- StableLM: Stability AI Language Modelsβ15,803Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into cleanβ¦β12,796Updated last week
- Distribute and run LLMs with a single file.β23,173Updated 3 months ago