WebAssembly (Wasm) Build and Bindings for llama.cpp
☆293Jul 23, 2024Updated last year
Alternatives and similar repositories for llama-cpp-wasm
Users that are interested in llama-cpp-wasm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆1,126Jun 17, 2026Updated last week
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆15Jun 16, 2026Updated 2 weeks ago
- ☆19Feb 7, 2024Updated 2 years ago
- Simple Tool Caller for llama.cpp☆11Aug 12, 2024Updated last year
- A collection of experiments related to LLM inference with llama.cpp/mlx☆40Jun 18, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation le…☆2,109Jun 21, 2026Updated last week
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆230Sep 8, 2024Updated last year
- Tensor library for machine learning☆273Apr 23, 2023Updated 3 years ago
- Inference of Mamba, Mamba2 and Mamba3 models in pure C☆202Mar 18, 2026Updated 3 months ago
- High-level, optionally asynchronous Rust bindings to llama.cpp☆246Jun 5, 2024Updated 2 years ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 10 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆52Jul 30, 2024Updated last year
- A live multiplayer trivia game where users can bid for the subject of the next question☆29Jan 9, 2026Updated 5 months ago
- Testing LLM reasoning abilities with family relationship quizzes.☆63Jan 28, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jun 27, 2024Updated 2 years ago
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 11 months ago
- High-performance In-browser LLM Inference Engine☆18,279Jun 9, 2026Updated 3 weeks ago
- Thin wrapper around GGML to make life easier☆47Nov 5, 2025Updated 7 months ago
- Web browser version of StarCoder.cpp☆47Jul 30, 2023Updated 2 years ago
- Kitten TTS web demo using tansformers.js☆98Aug 13, 2025Updated 10 months ago
- Parallel wasm Barnes-Hut t-SNE implementation written in Rust.☆23Apr 18, 2026Updated 2 months ago
- Examples for using the SiLLM framework for training and running Large Language Models (LLMs) on Apple Silicon☆16May 8, 2025Updated last year
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆865Nov 16, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimalist stable-diffusion desktop application with only one executable file writen with golang ( No python ).☆18Apr 16, 2025Updated last year
- iterate quickly with llama.cpp hot reloading. use the llama.cpp bindings with bun.sh☆51Oct 30, 2023Updated 2 years ago
- Structured inference with Llama 2 in your browser☆52Nov 1, 2024Updated last year
- The first contrastive learning work for Active Learning.☆18Mar 8, 2023Updated 3 years ago
- A De/CompressionStream for Bun☆14Jul 9, 2025Updated 11 months ago
- A Rust library for using stable diffusion functions when the Wasi is being executed on WasmEdge.☆13Oct 31, 2024Updated last year
- dart binding for llama.cpp☆296Jun 18, 2026Updated last week
- Demos for AI assistants using NLUX, Next.js, React, and Node.js☆17Jun 24, 2024Updated 2 years ago
- A cross-platform browser ML framework.☆767May 26, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- a pseudo-repo for discussion on Unix-like software in JS+Wasm ... and also about *browser* Python, Lua, Tcl.☆11Jan 25, 2023Updated 3 years ago
- Inference Llama 2 in one file of pure C#☆26Sep 3, 2023Updated 2 years ago
- State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!☆16,152Updated this week
- Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++☆6,398Updated this week
- Distributed LLM inference. Connect home devices into a powerful cluster to accelerate LLM inference. More devices means faster inference.☆2,964Apr 14, 2026Updated 2 months ago
- ☆49Mar 9, 2025Updated last year
- LLama.cpp rust bindings☆422Jun 27, 2024Updated 2 years ago