lucasgelfond / webgpu-sam2
Segment Anything 2, 100% in the browser (with WebGPU!)
☆105Updated 2 months ago
Alternatives and similar repositories for webgpu-sam2:
Users that are interested in webgpu-sam2 are comparing it to the libraries listed below
- A JavaScript library that brings vector search and RAG to your browser!☆93Updated 6 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆225Updated 2 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated last month
- ☆77Updated this week
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆65Updated last year
- A Next.js app for fast image generation with Flux on Replicate☆105Updated 4 months ago
- Demonstration of MobileSAM in the browser enabled through ONNX runtime web☆98Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆88Updated 6 months ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆45Updated last month
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆134Updated 7 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆46Updated 6 months ago
- A demo for running comfy deploy api via nextjs☆165Updated 10 months ago
- Gradio UI for a Cog API☆66Updated 10 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆63Updated 4 months ago
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi…☆21Updated 3 months ago
- kokoro text to speech using javascript☆52Updated 3 weeks ago
- Add caption to any video☆45Updated last year
- linked list tldraw + LCM model (realtime stable diffusion)☆66Updated last year
- ☆43Updated 3 months ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆116Updated last month
- A JavaScript implementation of Llama 3 using node-mlx.☆72Updated 7 months ago
- In-browser LLM website generator☆34Updated 3 weeks ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆146Updated last year
- Vercel and web-llm template to run wasm models directly in the browser.☆137Updated last year
- Cog inference for flux models☆323Updated last week
- How to use bounding boxes with the Gemini API☆102Updated 7 months ago
- Useful resources for LLM-based Diarization and Transcription.☆56Updated 4 months ago