tangledgroup / llama-cpp-wasmLinks
WebAssembly (Wasm) Build and Bindings for llama.cpp
☆280Updated last year
Alternatives and similar repositories for llama-cpp-wasm
Users that are interested in llama-cpp-wasm are comparing it to the libraries listed below
Sorting:
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆875Updated 2 weeks ago
- Vercel and web-llm template to run wasm models directly in the browser.☆160Updated last year
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆214Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆140Updated last year
- JS tokenizer for LLaMA 3 and LLaMA 3.1☆115Updated last month
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆217Updated last year
- Browser-compatible JS library for running language models☆231Updated 3 years ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆213Updated 4 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆103Updated 2 years ago
- A JavaScript library that brings vector search and RAG to your browser!☆141Updated last year
- Vector Storage is a vector database that enables semantic similarity searches on text documents in the browser's local storage. It uses O…☆234Updated 9 months ago
- SemanticFinder - frontend-only live semantic search with transformers.js☆296Updated 5 months ago
- JS tokenizer for LLaMA 1 and 2☆359Updated last year
- Web-optimized vector database (written in Rust).☆255Updated 6 months ago
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.☆518Updated 4 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆267Updated last month
- Inference Llama 2 in one file of pure JavaScript(HTML)☆33Updated 3 months ago
- JavaScript bindings for the ggml-js library☆44Updated 5 months ago
- LLM-based code completion engine☆193Updated 7 months ago
- LLM-powered lossless compression tool☆288Updated last year
- WebGPU LLM inference tuned by hand☆151Updated 2 years ago
- Universal LLM Interface☆105Updated 2 months ago
- Tensor library for machine learning☆274Updated 2 years ago
- A fully in-browser privacy solution to make Conversational AI privacy-friendly☆228Updated 10 months ago
- ggml implementation of BERT☆492Updated last year
- Record and stream WAV audio data in the browser across all platforms☆89Updated 10 months ago
- Python bindings for ggml☆146Updated last year
- Library to generate vector embeddings in NodeJS☆142Updated 5 months ago
- 💬 Chatbot web app + HTTP and Websocket endpoints for LLM inference with the Petals client☆315Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year