lucasgelfond / webgpu-sam2Links
Segment Anything 2, 100% in the browser (with WebGPU!)
☆158Updated last week
Alternatives and similar repositories for webgpu-sam2
Users that are interested in webgpu-sam2 are comparing it to the libraries listed below
Sorting:
- A JavaScript library that brings vector search and RAG to your browser!☆158Updated last year
- Cog inference for flux models☆365Updated 5 months ago
- ML-powered speech synthesis directly in your browser☆171Updated 11 months ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆157Updated 2 years ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆222Updated last year
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆79Updated 2 years ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆107Updated 2 years ago
- A Next.js app for fast image generation with Flux on Replicate☆114Updated last month
- Full in-browser Semantic Search with Huggingface Transformers.js and ElectricSQL's PGlite!☆107Updated last year
- diffusers implementation for node.js and browser☆353Updated last year
- Physics in tldraw☆150Updated last year
- A JavaScript implementation of Llama 3 using node-mlx.☆74Updated last year
- Gradio UI for a Cog API☆71Updated last year
- ☆77Updated 11 months ago
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆285Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆141Updated last year
- A demo for running comfy deploy api via nextjs☆171Updated last year
- A client side vector search library that can embed, store, search, and cache vectors. Works on the browser and node. It outperforms OpenA…☆224Updated last year
- ☆90Updated 2 months ago
- Browse, search, and visualize ONNX models.☆34Updated 8 months ago
- SemanticFinder - frontend-only live semantic search with transformers.js☆319Updated 9 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆51Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆66Updated last year
- ☆49Updated 9 months ago
- ☆58Updated last year
- kokoro text to speech using javascript☆63Updated 11 months ago
- Generates grammer files from typescript for LLM generation☆38Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆46Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆267Updated 10 months ago
- Web-optimized vector database (written in Rust).☆259Updated 10 months ago