lucasgelfond / webgpu-sam2
Segment Anything 2, 100% in the browser (with WebGPU!)
☆128Updated 4 months ago
Alternatives and similar repositories for webgpu-sam2
Users that are interested in webgpu-sam2 are comparing it to the libraries listed below
Sorting:
- Cog inference for flux models☆347Updated last month
- Gradio UI for a Cog API☆67Updated last year
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆253Updated 2 months ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆150Updated last year
- ☆79Updated this week
- ☆50Updated 6 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆89Updated 9 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 7 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆55Updated 3 months ago
- kokoro text to speech using javascript☆56Updated 3 months ago
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆70Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated 4 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆135Updated 10 months ago
- A Next.js app for fast image generation with Flux on Replicate☆107Updated 7 months ago
- A demo for running comfy deploy api via nextjs☆167Updated last year
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆152Updated last week
- Demonstration of MobileSAM in the browser enabled through ONNX runtime web☆108Updated last year
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- Using the moondream VLM with optical flow for promptable object tracking☆54Updated 2 months ago
- In-browser LLM website generator☆49Updated 3 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- A JavaScript implementation of Llama 3 using node-mlx.☆72Updated 9 months ago
- Gradio Client in Rust.☆28Updated 7 months ago
- Doodle Dash, an ML-powered web game that runs completely in your browser, thanks to Transformers.js!☆61Updated last year
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆58Updated last year
- ☆40Updated 3 months ago
- Sort a folder of images according to their similarity with provided text in your browser (uses a browser-ported version of OpenAI's CLIP …☆184Updated last year
- Guaranteed Structured Output from any Language Model via Hierarchical State Machines☆128Updated 2 weeks ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 2 months ago
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated 5 months ago