Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
☆69Aug 15, 2024Updated last year
Alternatives and similar repositories for autodistill-florence-2
Users that are interested in autodistill-florence-2 are comparing it to the libraries listed below
Sorting:
- EdgeYOLO + ROS 2 object detection package☆29Mar 28, 2023Updated 2 years ago
- K-FACE Analysis Project on Pytorch☆11Sep 6, 2021Updated 4 years ago
- Multi-temporal Scene dataset for Scene Change Detection.☆15Apr 14, 2021Updated 4 years ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Aug 7, 2024Updated last year
- This ComfyUI node pack allows the user to take a panoramic photo and a corresponding depth map, and turn it into a 3D environment that ca…☆13Mar 29, 2025Updated 11 months ago
- a collection of tools for solving the Perspective-n-Point (PnP) problem in compute vision☆14Mar 11, 2016Updated 9 years ago
- Quick exploration into fine tuning florence 2☆338Sep 19, 2024Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated last month
- A small deep learning model based on atrous convolutional feature fusion for the application of emergency response.☆21Feb 26, 2024Updated 2 years ago
- codes for Efficient Test-Time Scaling via Self-Calibration☆19Sep 13, 2025Updated 5 months ago
- ☆21Sep 28, 2024Updated last year
- FlowFeat: Pixel-Dense Embedding of Motion Profiles (NeurIPS 2025 Spotlight)☆112Feb 13, 2026Updated 3 weeks ago
- This project combines YOLO object detection with Intel's MiDaS depth estimation.☆20Nov 25, 2024Updated last year
- a bidirectional ros to gstreamer bridge and utilities for dynamic pipelines☆18Dec 9, 2023Updated 2 years ago
- Flux Pro via Replicate API☆23Dec 26, 2024Updated last year
- ☆26May 26, 2024Updated last year
- ☆30Nov 26, 2025Updated 3 months ago
- ☆21Aug 3, 2023Updated 2 years ago
- GroundedSAM Base Model plugin for Autodistill☆55Apr 17, 2024Updated last year
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆27Oct 10, 2024Updated last year
- This repo gives a start for the docker.☆37Jan 17, 2024Updated 2 years ago
- ☆27Oct 25, 2022Updated 3 years ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆118May 30, 2023Updated 2 years ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆26Jan 26, 2024Updated 2 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆12Jun 28, 2022Updated 3 years ago
- ☆17Aug 18, 2022Updated 3 years ago
- Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-lan…☆155Jul 3, 2024Updated last year
- ☆31Jul 25, 2023Updated 2 years ago
- DreamDA: Generative Data Augmentation with Diffusion Models (Official Implementation)☆27Oct 13, 2024Updated last year
- Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"☆37Aug 18, 2024Updated last year
- CR3DT Fork of the BEVDet Model☆31Aug 6, 2024Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆30Feb 28, 2025Updated last year
- Robot registration (roboreg): eye-to-hand calibration from RGB / RGB-D images using robot mesh as calibration target.☆101Feb 8, 2026Updated last month
- Finetune SAM3 with LoRA — optimized for images. A simple setup for training SAM3 on image datasets. Video finetuning is not yet supported…☆96Feb 3, 2026Updated last month
- 3D Kalman Filter - C++ Implementation☆16Jun 2, 2024Updated last year
- 3D Gaussian Splatting for underwater scene reconstruction via physcial-based appearance-medium decoupling☆23Feb 13, 2026Updated 3 weeks ago
- ☆36Feb 6, 2026Updated last month
- ☆30Dec 16, 2025Updated 2 months ago
- I'm LPR (LiDAR Place Recognition), even if built upon a Vision Foundation Model.☆61Dec 1, 2025Updated 3 months ago