lucasgelfond / webgpu-sam2
Segment Anything 2, 100% in the browser (with WebGPU!)
☆118Updated 3 months ago
Alternatives and similar repositories for webgpu-sam2:
Users that are interested in webgpu-sam2 are comparing it to the libraries listed below
- Gradio UI for a Cog API☆66Updated 11 months ago
- A JavaScript library that brings vector search and RAG to your browser!☆104Updated 7 months ago
- In-browser LLM website generator☆45Updated last month
- kokoro text to speech using javascript☆55Updated last month
- Simple LaMa Inpainting: An easy-to-use implementation of the LaMa (Large Mask) inpainting model. Remove unwanted objects or fill in missi…☆21Updated 4 months ago
- ☆46Updated 4 months ago
- ⚙️ | REPLACED BY https://github.com/runpod-workers | Official set of serverless worker provided by RunPod as endpoints.☆57Updated last year
- Cog inference for flux models☆332Updated this week
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆39Updated 2 months ago
- OpenAI's CLIP model ported to JavaScript using the ONNX web runtime☆146Updated last year
- A Next.js app for fast image generation with Flux on Replicate☆106Updated 5 months ago
- This project is a reverse-engineered version of Figma's tone changer. It uses Groq's Llama-3-8b for high-speed inference and to adjust th…☆88Updated 7 months ago
- Add local LLMs to your Web or Electron apps! Powered by Rust + WebGPU☆102Updated last year
- A demo for running comfy deploy api via nextjs☆167Updated 11 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆55Updated last month
- A SQLite extension for generating text embeddings from remote APIs (OpenAI, Nomic, Ollama, llamafile...)☆112Updated 4 months ago
- Useful resources for LLM-based Diarization and Transcription.☆55Updated 5 months ago
- Dynamically structure language models to produce outputs that adhere to specific requirements without sacrificing their creative capabili…☆118Updated last week
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆71Updated 6 months ago
- Run Large-Language Models (LLMs) 🚀 directly in your browser!☆188Updated 6 months ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 7 months ago
- TypeScript generator for llama.cpp Grammar directly from TypeScript interfaces☆136Updated 8 months ago
- Simple text to phones converter using eSpeak NG.☆25Updated 2 months ago
- EntityDB is an in-browser vector database wrapping indexedDB and Transformers.js over WebAssembly☆137Updated 2 months ago
- The collaborative multiplayer game engine for the web.☆122Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆77Updated 5 months ago
- An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.☆232Updated 2 weeks ago
- A JavaScript implementation of Llama 3 using node-mlx.☆72Updated 8 months ago