autodistill / autodistill-grounded-sam-2View external linksLinks
Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.
☆134Aug 7, 2024Updated last year
Alternatives and similar repositories for autodistill-grounded-sam-2
Users that are interested in autodistill-grounded-sam-2 are comparing it to the libraries listed below
Sorting:
- A multimodal large-scale model, which performs close to the closed-source Qwen-VL-PLUS on many datasets and significantly surpasses the p…☆14Feb 5, 2024Updated 2 years ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.☆69Aug 15, 2024Updated last year
- Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2☆3,275Nov 11, 2025Updated 3 months ago
- A simple demo for utilizing grounding dino and segment anything v2 models together☆21Jul 31, 2024Updated last year
- ☆14Aug 10, 2025Updated 6 months ago
- Evaluate the performance of computer vision models and prompts for zero-shot models (Grounding DINO, CLIP, BLIP, DINOv2, ImageBind, model…☆37Oct 18, 2023Updated 2 years ago
- Experimental AI chat app☆24Jan 3, 2025Updated last year
- Aerosol Optical Depth Statistical Analysis☆11Jun 1, 2016Updated 9 years ago
- The sparse Bayesian learning sandbox☆11Jul 4, 2021Updated 4 years ago
- ODLabel is a powerful tool for zero-shot object detection, labeling and visualization. It provides an intuitive graphical user interface …☆10May 19, 2024Updated last year
- Examples in the MLX framework☆11Sep 23, 2024Updated last year
- ☆10Jul 29, 2024Updated last year
- Simple Video Summarization using Text-to-Segment Anything (Florence2 + SAM2) This project provides a video processing tool that utilizes…☆10Feb 20, 2025Updated 11 months ago
- Scripts, data and researches related to cow weight and breed prediction☆13Aug 24, 2025Updated 5 months ago
- Unofficial implementation for SOLO instance segmentation☆25Mar 29, 2020Updated 5 years ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆26Jan 26, 2024Updated 2 years ago
- A Desktop Application to showcase primary OpenCV functions. With OpenCV Catalogue one create a chain of various available OpenCV function…☆11May 10, 2024Updated last year
- Mahalanobis Distance-based Multi-view Optimal Transport for Multi-view Crowd Localization, ECCV 2024☆15Nov 20, 2024Updated last year
- Real-time object detection using Florence-2 with a user-friendly GUI.☆30Aug 7, 2025Updated 6 months ago
- [MM 2024 Oral] Refiner for AIGC☆29Jul 29, 2024Updated last year
- ☆31Dec 20, 2022Updated 3 years ago
- ☆29Jul 6, 2022Updated 3 years ago
- GroundedSAM Base Model plugin for Autodistill☆55Apr 17, 2024Updated last year
- A deep learning-powered visual navigation engine to enables autonomous navigation of pocket-size quadrotor - running on PULP☆13Oct 30, 2024Updated last year
- ☆12Jan 25, 2023Updated 3 years ago
- official code for "EgoVSR: Towards High-Quality Egocentric Video Super-Resolution"☆15Jul 26, 2023Updated 2 years ago
- Compare Savant and PyTorch performance☆13Feb 9, 2024Updated 2 years ago
- This ComfyUI node pack allows the user to take a panoramic photo and a corresponding depth map, and turn it into a 3D environment that ca…☆13Mar 29, 2025Updated 10 months ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Apr 7, 2024Updated last year
- Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series☆1,086Jan 21, 2025Updated last year
- Diffusion Model for Voice Conversion☆17Oct 11, 2022Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC☆14Apr 19, 2025Updated 9 months ago
- Images to inference with no labeling (use foundation models to train supervised models).☆2,624May 14, 2025Updated 9 months ago
- ☆78Mar 25, 2025Updated 10 months ago
- A simple YOLOv5 demonstration that inferencing multiple video files or IP cameras concurrently. Detection results are saved to video file…☆15Mar 16, 2021Updated 4 years ago
- A collection of ROS 2 packages for autonomous systems, supporting self-driving cars, mobile robots, quadcopters, and other robotic platfo…☆28Jan 17, 2026Updated last month
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆17Dec 8, 2023Updated 2 years ago
- 完成轻量化网络FastestDet的算法NCNN部署☆17Jul 7, 2022Updated 3 years ago