tangledgroup / llama-cpp-wasm
WebAssembly (Wasm) Build and Bindings for llama.cpp
☆257Updated 9 months ago
Alternatives and similar repositories for llama-cpp-wasm:
Users that are interested in llama-cpp-wasm are comparing it to the libraries listed below
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆690Updated last week
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 10 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆203Updated 8 months ago
- A cross-platform browser ML framework.☆689Updated 5 months ago
- Vercel and web-llm template to run wasm models directly in the browser.☆148Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- SemanticFinder - frontend-only live semantic search with transformers.js☆268Updated last month
- JS tokenizer for LLaMA 1 and 2☆351Updated 10 months ago
- LLM-powered lossless compression tool☆280Updated 8 months ago
- WebGPU LLM inference tuned by hand☆149Updated last year
- Python bindings for ggml☆140Updated 8 months ago
- Train your own small bitnet model☆70Updated 6 months ago
- ggml implementation of BERT☆488Updated last year
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆150Updated 4 months ago
- LLM-based code completion engine☆185Updated 3 months ago
- JS tokenizer for LLaMA 3 and LLaMA 3.1☆108Updated 2 months ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆558Updated 2 months ago
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆753Updated last week
- A JavaScript library that brings vector search and RAG to your browser!☆116Updated 8 months ago
- Web-optimized vector database (written in Rust).☆229Updated 2 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆809Updated 5 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆237Updated this week
- A simple vector database built on idb☆85Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆33Updated 10 months ago
- Simple repo that compiles and runs llama2.c on the Web☆54Updated last year
- Tensor library for machine learning☆278Updated 2 years ago
- jsgrad is a dependency-free ML library in Typescript for model inference and training with support to WebGPU and other runtimes.☆54Updated 2 weeks ago
- Port of Suno AI's Bark in C/C++ for fast inference☆53Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆178Updated 8 months ago
- Simple LLM library for JavaScript☆58Updated last week