lucasgelfond / webgpu-sam2Links
Segment Anything 2, 100% in the browser (with WebGPU!)
☆135Updated 6 months ago
Alternatives and similar repositories for webgpu-sam2
Users that are interested in webgpu-sam2 are comparing it to the libraries listed below
Sorting:
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆154Updated last year
- A JavaScript library that brings vector search and RAG to your browser!☆131Updated 11 months ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆189Updated 2 months ago
- A Next.js app for fast image generation with Flux on Replicate☆109Updated 9 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆210Updated 10 months ago
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆72Updated 2 years ago
- kokoro text to speech using javascript☆59Updated 5 months ago
- Gradio UI for a Cog API☆69Updated last year
- In-browser LLM website generator☆50Updated 5 months ago
- ML-powered speech synthesis directly in your browser☆160Updated 5 months ago
- Browse, search, and visualize ONNX models.☆32Updated 2 months ago
- Cog inference for flux models☆356Updated 3 weeks ago
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆47Updated 3 weeks ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 9 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆55Updated last month
- Using headless Chrome on server side environments for true client side browser emulation with NVIDIA T4 GPUs for Web AI model testing or …☆79Updated last year
- ☆84Updated this week
- A demo for running comfy deploy api via nextjs☆167Updated last year
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆137Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 11 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆89Updated 11 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated 2 years ago
- linked list tldraw + LCM model (realtime stable diffusion)☆68Updated last year
- diffusers implementation for node.js and browser☆343Updated last year
- Image Generation API Server - Similar to https://text-generator.io but for images☆50Updated this week
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 9 months ago
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi…☆22Updated 8 months ago
- GPU accelerated client-side embeddings for vector search, RAG etc.☆66Updated last year
- Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments☆42Updated 5 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆44Updated 6 months ago