autodistill / autodistill-florence-2View external linksLinks
Use Florence 2 to auto-label data for use in training fine-tuned object detection models.
☆69Aug 15, 2024Updated last year
Alternatives and similar repositories for autodistill-florence-2
Users that are interested in autodistill-florence-2 are comparing it to the libraries listed below
Sorting:
- Code and pruned models for our paper: K. Gkrispanis, N. Gkalelis, V. Mezaris, "Filter-Pruning of Lightweight Face Detectors Using a Geome…☆14May 8, 2024Updated last year
- EdgeYOLO + ROS 2 object detection package☆29Mar 28, 2023Updated 2 years ago
- Real-time object detection using Florence-2 with a user-friendly GUI.☆30Aug 7, 2025Updated 6 months ago
- RAG-QA is a free, containerised question-answer framework that allows you to ask questions to your documents in an intuitive way☆20Jan 25, 2024Updated 2 years ago
- K-FACE Analysis Project on Pytorch☆11Sep 6, 2021Updated 4 years ago
- Multi-temporal Scene dataset for Scene Change Detection.☆15Apr 14, 2021Updated 4 years ago
- Use Segment Anything 2, grounded with Florence-2, to auto-label data for use in training vision models.☆134Aug 7, 2024Updated last year
- a collection of tools for solving the Perspective-n-Point (PnP) problem in compute vision☆14Mar 11, 2016Updated 9 years ago
- This ComfyUI node pack allows the user to take a panoramic photo and a corresponding depth map, and turn it into a 3D environment that ca…☆13Mar 29, 2025Updated 10 months ago
- The Florence Tool CLI provides a command-line interface for processing images using the Florence-2 model. This tool allows users to apply…☆16Jan 21, 2025Updated last year
- Quick exploration into fine tuning florence 2☆339Sep 19, 2024Updated last year
- A composition of offline tools to achieve high quality multilingual speech to text transcription☆23Feb 2, 2026Updated 2 weeks ago
- Offical implementation of "Confidence-Calibrated Face and Kinship Verification" (T-IFS 2023)☆24Oct 2, 2023Updated 2 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- A small deep learning model based on atrous convolutional feature fusion for the application of emergency response.☆21Feb 26, 2024Updated last year
- ☆29Nov 26, 2025Updated 2 months ago
- a bidirectional ros to gstreamer bridge and utilities for dynamic pipelines☆18Dec 9, 2023Updated 2 years ago
- ☆26May 26, 2024Updated last year
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆26Oct 10, 2024Updated last year
- Flask-based web application designed to compare text and image embeddings using the CLIP model.☆22Jan 22, 2024Updated 2 years ago
- ☆22Aug 3, 2023Updated 2 years ago
- GroundedSAM Base Model plugin for Autodistill☆55Apr 17, 2024Updated last year
- [NeurIPS'25 Spotlight🔥] Official Implementation of RobustMerge: Parameter-Efficient Model Merging for MLLMs with Direction Robustness☆59Dec 25, 2025Updated last month
- Nougat is a Meta AI's revolutionary OCR model designed to transcribe scientific PDFs into an easy-to-use Markdown format.☆27Oct 14, 2023Updated 2 years ago
- ☆27Oct 25, 2022Updated 3 years ago
- Our idea is to combine the power of computer vision model and LLMs. We use YOLO, CLIP and DINOv2 to extract high-level features from imag…☆118May 30, 2023Updated 2 years ago
- ☆17Aug 18, 2022Updated 3 years ago
- Finetune SAM3 with LoRA — optimized for images. A simple setup for training SAM3 on image datasets. Video finetuning is not yet supported…☆75Feb 3, 2026Updated 2 weeks ago
- Real-time video understanding and interaction through text,audio,image and video with large multi-modal model. 利用多模态大模型的实时视频理解和交互框架,通过文本…☆26Jan 26, 2024Updated 2 years ago
- ☆15Feb 6, 2026Updated last week
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-lan…☆152Jul 3, 2024Updated last year
- ☆31Jul 25, 2023Updated 2 years ago
- DreamDA: Generative Data Augmentation with Diffusion Models (Official Implementation)☆27Oct 13, 2024Updated last year
- An integration of Segment Anything Model, Molmo, and, Whisper to segment objects using voice and natural language.☆30Feb 28, 2025Updated 11 months ago
- mHC-lite: You Don’t Need 20 Sinkhorn-Knopp Iterations☆65Jan 12, 2026Updated last month
- Eye-to-hand calibration from RGB / RGB-D images using robot mesh as calibration target.☆97Feb 8, 2026Updated last week
- ☆28Dec 16, 2025Updated 2 months ago
- 3D Gaussian Splatting for underwater scene reconstruction via physcial-based appearance-medium decoupling☆23Updated this week