SauravMaheshkar / samv2Links
CPU compatible fork of the official SAMv2 implementation aimed at more accessible and documented tutorials
☆74Updated 10 months ago
Alternatives and similar repositories for samv2
Users that are interested in samv2 are comparing it to the libraries listed below
Sorting:
- Demonstration of MobileSAM in the browser enabled through ONNX runtime web☆109Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆64Updated 11 months ago
- Segment anything UI for annotations☆100Updated 4 months ago
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆488Updated 2 months ago
- 2nd place solution for the Generative Interior Design 2024 competition☆118Updated 6 months ago
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆51Updated 9 months ago
- FRP Fork☆171Updated 3 months ago
- SAM Annotaton Tool☆37Updated last year
- an optimized, production-ready implementation of active speaker detection☆67Updated last year
- A simple web application that lets you replace any part of an image with an image generated based on your description.☆117Updated 2 years ago
- GroundedSAM Base Model plugin for Autodistill☆51Updated last year
- 4bit bitsandbytes quants of the best 7B vlms☆33Updated 9 months ago
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆81Updated this week
- Image Prompter for Gradio☆92Updated last year
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆72Updated 2 years ago
- Playground Web UI using segment-anything-2 models from the Meta.☆54Updated 7 months ago
- Build your own Face App with Stable Diffusion 2.1☆151Updated 6 months ago
- ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing☆69Updated last year
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 9 months ago
- Gradio UI for a Cog API☆69Updated last year
- Passively collect images for computer vision datasets on the edge.☆34Updated last year
- ☆359Updated 9 months ago
- ☆46Updated last year
- Docker image for LLaVA: Large Language and Vision Assistant☆3Updated 2 months ago
- LLaVA server (llama.cpp).☆180Updated last year
- stable-diffusion.cpp bindings for python☆54Updated last week
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆269Updated last month
- Creation of annotated datasets from scratch using Generative AI and Foundation Computer Vision models☆120Updated this week
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Official repository of "TryOffAnyone: Tiled Cloth Generation from a Dressed Person"☆177Updated 5 months ago