lucasgelfond / webgpu-sam2Links
Segment Anything 2, 100% in the browser (with WebGPU!)
☆150Updated 10 months ago
Alternatives and similar repositories for webgpu-sam2
Users that are interested in webgpu-sam2 are comparing it to the libraries listed below
Sorting:
- ML-powered speech synthesis directly in your browser☆167Updated 8 months ago
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆76Updated 2 years ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆158Updated 2 years ago
- A JavaScript library that brings vector search and RAG to your browser!☆152Updated last year
- A Next.js app for fast image generation with Flux on Replicate☆110Updated 2 months ago
- Full in-browser Semantic Search with Huggingface Transformers.js and ElectricSQL's PGlite!☆106Updated last year
- kokoro text to speech using javascript☆62Updated 9 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆219Updated last year
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆232Updated 5 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆104Updated 2 years ago
- A demo for running comfy deploy api via nextjs☆170Updated last year
- diffusers implementation for node.js and browser☆347Updated last year
- WebAssembly (Wasm) Build and Bindings for llama.cpp☆283Updated last year
- ☆72Updated 8 months ago
- ☆46Updated 6 months ago
- In-browser LLM website generator☆50Updated 9 months ago
- A JavaScript implementation of Llama 3 using node-mlx.☆72Updated last year
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated 4 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆46Updated 9 months ago
- Gradio UI for a Cog API☆69Updated last year
- Kitten TTS web demo using tansformers.js☆71Updated 2 months ago
- Browse, search, and visualize ONNX models.☆35Updated 5 months ago
- Browser-compatible JS library for running language models☆232Updated 3 years ago
- AI Powered search tool offers content-based, text, and visual similarity system-wide search.☆269Updated 5 months ago
- Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments☆45Updated 8 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆60Updated 4 months ago
- Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP …☆186Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆140Updated last year
- Vercel and web-llm template to run wasm models directly in the browser.☆164Updated last year
- A prompt to code site using mistral-7b and ollama.☆78Updated last year