ngxson / wllamaLinks

WebAssembly binding for llama.cpp - Enabling on-browser LLM inference

☆785

Alternatives and similar repositories for wllama

Users that are interested in wllama are comparing it to the libraries listed below

Sorting:

tangledgroup / llama-cpp-wasm
WebAssembly (Wasm) Build and Bindings for llama.cpp
☆273Updated last year
intentee / paddler
Stateful load balancer custom-tailored for llama.cpp 🏓🦙
☆800Updated this week
hrishioa / wasm-ai
Vercel and web-llm template to run wasm models directly in the browser.
☆160Updated last year
babycommando / entity-db
EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly
☆192Updated 2 months ago
ggml-org / llama.vscode
VS Code extension for LLM-assisted code/text completion
☆873Updated this week
mlc-ai / web-llm-chat
Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.
☆807Updated 2 months ago
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆579Updated 5 months ago
lmstudio-ai / mlx-engine
Apple MLX engine for LM Studio
☆708Updated 2 weeks ago
google-ai-edge / LiteRT-LM
☆286Updated this week
HazyResearch / minions
Big & Small LLMs working together
☆1,088Updated this week
arcee-ai / fastmlx
FastMLX is a high performance production ready API to host MLX models.
☆319Updated 4 months ago
tantaraio / voy
🕸️🦀 A WASM vector similarity search written in Rust
☆986Updated last year
Picovoice / picollm
On-device LLM Inference Powered by X-Bit Quantization
☆260Updated last week
rahuldshetty / llm.js
Run Large-Language Models (LLMs) 🚀 directly in your browser!
☆212Updated 10 months ago
Blaizzy / mlx-vlm
MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.
☆1,551Updated last week
huggingface / transformers.js-examples
A collection of 🤗 Transformers.js demos and example applications
☆1,684Updated last week
PABannier / bark.cpp
Suno AI's Bark model in C/C++ for fast text-to-speech generation
☆834Updated 8 months ago
Stevenic / vectra
Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.
☆500Updated 2 months ago
abgulati / LARS
An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.
☆603Updated 9 months ago
LlamaEdge / LlamaEdge
The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge
☆1,466Updated last week
mustafaaljadery / lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
☆755Updated last year
foldl / chatllm.cpp
Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)
☆665Updated this week
do-me / SemanticFinder
SemanticFinder - frontend-only live semantic search with transformers.js
☆287Updated 4 months ago
riccardomusmeci / mlx-llm
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.
☆451Updated 6 months ago
IntrinsicLabsAI / gbnfgen
TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces
☆139Updated last year
ggml-org / p1
LLM-based code completion engine
☆193Updated 6 months ago
belladoreai / llama3-tokenizer-js
JS tokenizer for LLaMA 3 and LLaMA 3.1
☆114Updated 4 months ago
madroidmaq / mlx-omni-server
MLX Omni Server is a local inference server powered by Apple's MLX framework, specifically designed for Apple Silicon (M-series) chips. I…
☆454Updated 3 weeks ago
belladoreai / llama-tokenizer-js
JS tokenizer for LLaMA 1 and 2
☆355Updated last year
not-pizza / victor
Web-optimized vector database (written in Rust).
☆249Updated 5 months ago