tangledgroup / llama-cpp-wasmLinks
WebAssembly (Wasm) Build and Bindings for llama.cpp
β273Updated last year
Alternatives and similar repositories for llama-cpp-wasm
Users that are interested in llama-cpp-wasm are comparing it to the libraries listed below
Sorting:
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inferenceβ785Updated last week
- Run Large-Language Models (LLMs) π directly in your browser!β212Updated 10 months ago
- Vercel and web-llm template to run wasm models directly in the browser.β160Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfacesβ139Updated last year
- JavaScript bindings for the ggml-js libraryβ43Updated 4 months ago
- JS tokenizer for LLaMA 3 and LLaMA 3.1β116Updated last week
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ102Updated 2 years ago
- Inference Llama 2 in one file of pure JavaScript(HTML)β33Updated 2 months ago
- Browser-compatible JS library for running language modelsβ228Updated 2 years ago
- A cross-platform browser ML framework.β709Updated 8 months ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssemblyβ194Updated 2 months ago
- WebGPU LLM inference tuned by handβ151Updated 2 years ago
- A JavaScript library that brings vector search and RAG to your browser!β135Updated 11 months ago
- JS tokenizer for LLaMA 1 and 2β355Updated last year
- LLM-based code completion engineβ193Updated 6 months ago
- SemanticFinder - frontend-only live semantic search with transformers.jsβ287Updated 4 months ago
- Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses Oβ¦β232Updated 7 months ago
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAβ¦β216Updated last year
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.β503Updated 2 months ago
- A fully in-browser privacy solution to make Conversational AI privacy-friendlyβ227Updated 9 months ago
- Web-optimized vector database (written in Rust).β249Updated 5 months ago
- JavaScript implementation of LiteLLM.β130Updated 4 months ago
- Library to generate vector embeddings in NodeJSβ136Updated 3 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.β48Updated last year
- LLM-powered lossless compression toolβ285Updated 11 months ago
- Tensor library for machine learningβ275Updated 2 years ago
- Universal LLM Interfaceβ88Updated last month
- Record and stream WAV audio data in the browser across all platformsβ86Updated 8 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.β66Updated last year
- π¬ Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals clientβ314Updated last year