tangledgroup / llama-cpp-wasm
WebAssembly (Wasm) Build and Bindings for llama.cpp
☆225Updated 5 months ago
Alternatives and similar repositories for llama-cpp-wasm:
Users that are interested in llama-cpp-wasm are comparing it to the libraries listed below
- WebAssembly binding for llama.cpp - Enabling on-browser LLM inference☆485Updated this week
- Vercel and web-llm template to run wasm models directly in the browser.☆133Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆131Updated 6 months ago
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆177Updated 7 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆172Updated 4 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- A cross-platform browser ML framework.☆652Updated last month
- Tensor library for machine learning☆278Updated last year
- Web-optimized vector database (written in Rust).☆198Updated this week
- SemanticFinder - frontend-only live semantic search with transformers.js☆247Updated 3 weeks ago
- Browser-compatible JS library for running language models☆227Updated 2 years ago
- JS tokenizer for LLaMA 1 and 2☆350Updated 6 months ago
- LLM-based code completion engine☆178Updated last month
- LLM-powered lossless compression tool☆260Updated 5 months ago
- A ggml (C++) re-implementation of tortoise-tts☆174Updated 4 months ago
- WebGPU LLM inference tuned by hand☆148Updated last year
- llama.cpp fork with additional SOTA quants and improved performance☆126Updated this week
- Port of Suno AI's Bark in C/C++ for fast inference☆55Updated 9 months ago
- A simple vector database built on idb☆74Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Run GGML models with Kubernetes.☆173Updated last year
- ☆31Updated last year
- Inference Llama 2 in one file of pure JavaScript(HTML)☆30Updated 6 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated 8 months ago
- Generates grammer files from typescript for LLM generation☆35Updated 11 months ago
- Simple repo that compiles and runs llama2.c on the Web☆54Updated last year
- ☆138Updated 2 months ago
- JavaScript bindings for the ggml-js library☆40Updated last year
- ggml implementation of BERT☆474Updated 10 months ago
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆240Updated this week