SauravMaheshkar / samv2Links
CPU compatible fork of the official SAMv2 implementation aimed at more accessible and documented tutorials
☆75Updated 10 months ago
Alternatives and similar repositories for samv2
Users that are interested in samv2 are comparing it to the libraries listed below
Sorting:
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆126Updated last year
- [CVPR 2025] Official PyTorch implementation of "EdgeTAM: On-Device Track Anything Model"☆534Updated 3 months ago
- Segment anything UI for annotations☆102Updated 5 months ago
- A Gradio web UI for Depth-Pro, Sharp Monocular Metric Depth Estimation☆51Updated 10 months ago
- Demonstration of MobileSAM in the browser enabled through ONNX runtime web☆111Updated last month
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆65Updated 11 months ago
- ☆367Updated 10 months ago
- 2nd place solution for the Generative Interior Design 2024 competition☆119Updated 7 months ago
- Efficient Track Anything☆610Updated 7 months ago
- ☆51Updated 11 months ago
- Build your own Face App with Stable Diffusion 2.1☆151Updated 6 months ago
- Official Code for Tracking Any Object Amodally☆118Updated last year
- Docker image for LLaVA: Large Language and Vision Assistant☆3Updated 2 months ago
- The repo for "Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator"☆629Updated 3 months ago
- Image Prompter for Gradio☆92Updated last year
- Use Grounding DINO, Segment Anything, and GPT-4V to label images with segmentation masks for use in training smaller, fine-tuned models.☆66Updated last year
- stable-diffusion.cpp bindings for python☆56Updated last month
- ☆60Updated last year
- [ECCV 2024 & NeurIPS 2024] Official implementation of the paper TAPTR & TAPTRv2 & TAPTRv3☆266Updated 7 months ago
- LLaVA server (llama.cpp).☆181Updated last year
- Which model is the best at object detection? Which is best for small or large objects? We compare the results in a handy leaderboard.☆84Updated last week
- ☆48Updated 4 months ago
- GroundedSAM Base Model plugin for Autodistill☆51Updated last year
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆269Updated last week
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backend☆72Updated 2 years ago
- ☆46Updated last year
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆49Updated 10 months ago
- SAM Annotaton Tool☆39Updated last year
- Inference and fine-tuning examples for vision models from 🤗 Transformers☆158Updated 3 months ago
- A streamlined implementation of Grounding DINO and SAM for advanced image segmentation. This lightweight solution simplifies the integrat…☆64Updated 10 months ago