ngxson / wllamaLinks
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
☆960Updated last week
Alternatives and similar repositories for wllama
Users that are interested in wllama are comparing it to the libraries listed below
Sorting:
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆285Updated last year
- A cross-platform browser ML framework.☆733Updated last year
- Open-source LLM load balancer and serving platform for self-hosting LLMs at scale 🏓🦙☆1,402Updated this week
- ☆617Updated this week
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆260Updated 7 months ago
- 🕸️🦀 A WASM vector similarity search written in Rust☆1,030Updated 2 years ago
- VS Code extension for LLM-assisted code/text completion☆1,106Updated last month
- Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.☆913Updated 3 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆222Updated last year
- Vercel and web-llm template to run wasm models directly in the browser.☆166Updated 2 years ago
- The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …☆611Updated 10 months ago
- On-device LLM Inference Powered by X-Bit Quantization☆273Updated this week
- FastMLX is a high performance production ready API to host MLX models.☆337Updated 9 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generation☆850Updated last year
- The easiest & fastest way to run customized and fine-tuned LLMs locally or on the edge☆1,557Updated this week
- Pure C++ implementation of several models for real-time chatting on your computer (CPU & GPU)☆757Updated last week
- Big & Small LLMs working together☆1,230Updated this week
- LM Studio Apple MLX engine☆842Updated last week
- Vectra is a local vector database for Node.js with features similar to pinecone but built using local files.☆551Updated 7 months ago
- SemanticFinder - frontend-only live semantic search with transformers.js☆313Updated 8 months ago
- A JavaScript library that brings vector search and RAG to your browser!☆157Updated last year
- Large-scale LLM inference engine☆1,611Updated last month
- MLX-VLM is a package for inference and fine-tuning of Vision Language Models (VLMs) on your Mac using MLX.☆1,949Updated this week
- 💬 A proposal for a web API for prompting browser-provided language models☆627Updated last month
- Super-fast Structured Outputs☆640Updated 3 weeks ago
- LLM-powered lossless compression tool☆295Updated last year
- VSCode AI coding assistant powered by self-hosted llama.cpp endpoint.☆183Updated 10 months ago
- Docs for GGUF quantization (unofficial)☆340Updated 5 months ago
- ML-powered speech synthesis directly in your browser☆170Updated 10 months ago
- Local AI API Platform☆2,764Updated 5 months ago