tangledgroup / llama-cpp-wasmLinks
WebAssembly (Wasm) Build and Bindings for llama.cpp
β269Updated 11 months ago
Alternatives and similar repositories for llama-cpp-wasm
Users that are interested in llama-cpp-wasm are comparing it to the libraries listed below
Sorting:
- Run Large-Language Models (LLMs) π directly in your browser!β209Updated 9 months ago
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inferenceβ750Updated 2 weeks ago
- Vercel and web-llm template to run wasm models directly in the browser.β152Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfacesβ137Updated 11 months ago
- A cross-platform browser ML framework.β702Updated 7 months ago
- JS tokenizer for LLaMA 3 and LLaMA 3.1β110Updated 3 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPUβ102Updated 2 years ago
- JS tokenizer for LLaMA 1 and 2β353Updated 11 months ago
- WebGPU LLM inference tuned by handβ151Updated 2 years ago
- Inference Llama 2 in one file of pure JavaScript(HTML)β33Updated last month
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssemblyβ178Updated last month
- On-device LLM Inference Powered by X-Bit Quantizationβ249Updated last week
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM β¦β573Updated 4 months ago
- LLM-powered lossless compression toolβ285Updated 10 months ago
- A ggml (C++) re-implementation of tortoise-ttsβ186Updated 10 months ago
- Record and stream WAV audio data in the browser across all platformsβ83Updated 7 months ago
- JavaScript bindings for the ggml-js libraryβ43Updated 3 months ago
- A fast batching API to serve LLM modelsβ183Updated last year
- Falcon LLM ggml framework with CPU and GPU supportβ246Updated last year
- WebContainers, except it's a million times easier to useβ83Updated 2 years ago
- Library to generate vector embeddings in NodeJSβ128Updated 2 months ago
- A mobile Implementation of llama.cppβ312Updated last year
- A JavaScript library that brings vector search and RAG to your browser!β128Updated 10 months ago
- The RunPod worker template for serving our large language model endpoints. Powered by vLLM.β322Updated last week
- A simple vector database built on idbβ90Updated last year
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenAβ¦β209Updated last year
- LLM-based code completion engineβ194Updated 5 months ago
- Run LLMs in the Browser with MLC / WebLLM β¨β135Updated 8 months ago
- Tensor library for machine learningβ276Updated 2 years ago
- ggml implementation of BERTβ493Updated last year