Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.
β155Jul 3, 2024Updated last year
Alternatives and similar repositories for Florence-2-Vision-Language-Model
Users that are interested in Florence-2-Vision-Language-Model are comparing it to the libraries listed below
Sorting:
- Quick exploration into fine tuning florence 2β338Sep 19, 2024Updated last year
- [ACMMM UAVM 2025] ππ VICI: VLM-Instructed Cross-view Image-localisation π‘πΊοΈβ17Feb 4, 2026Updated last month
- Non-local Modeling for Image Quality Assessmentβ13Dec 20, 2023Updated 2 years ago
- Use Florence 2 to auto-label data for use in training fine-tuned object detection models.β69Aug 15, 2024Updated last year
- Mutual Information Predicts Hallucinations in Abstractive Summarizationβ13Nov 14, 2022Updated 3 years ago
- About Code release for βDeepLag: Discovering Deep Lagrangian Dynamics for Intuitive Fluid Predictionβ (NeurIPS 2024), https://arxiv.org/aβ¦β21Oct 31, 2024Updated last year
- β25Oct 28, 2024Updated last year
- β29Feb 20, 2026Updated 2 weeks ago
- β24May 6, 2025Updated 10 months ago
- Image/Instance Retrieval using CLIP, A self supervised Learning Modelβ29May 30, 2023Updated 2 years ago
- A user-friendly Command & Control (C&C) web platform for remote monitoring, management, and task automation across multiple devices.β14Dec 15, 2024Updated last year
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelingsβ12Jun 28, 2022Updated 3 years ago
- Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.β152Feb 7, 2025Updated last year
- Implementation of related angular-margin-based classification loss functions for training (face) embedding models: SphereFace, CosFace, Aβ¦β26May 21, 2024Updated last year
- β32Jul 23, 2022Updated 3 years ago
- A music composer and player with MATLABβ11Mar 14, 2020Updated 5 years ago
- [AAAI 2026] Causal-Tune: Mining Causal Factors from Vision Foundation Models for Domain Generalized Semantic Segmentationβ24Dec 28, 2025Updated 2 months ago
- GPT4-4V Histopathology In-Context-Learningβ33May 12, 2024Updated last year
- Code for ChatRex: Taming Multimodal LLM for Joint Perception and Understandingβ211Oct 15, 2025Updated 4 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves predictionβ¦β11Mar 20, 2023Updated 2 years ago
- [CVPR'2025] EntitySAM: Segment Everything in Videoβ59Jul 13, 2025Updated 7 months ago
- β10Jul 29, 2022Updated 3 years ago
- An R library for estimating causal effectsβ12Apr 25, 2025Updated 10 months ago
- β10Nov 15, 2015Updated 10 years ago
- Python package using GPU via CUDA for astronomical image reductionβ11Jul 19, 2024Updated last year
- β11Oct 19, 2023Updated 2 years ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every minβ¦β10Updated this week
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022β11Apr 13, 2025Updated 10 months ago
- image retrieval using metric learningβ10Nov 22, 2022Updated 3 years ago
- gqrx-Hamlib interface with GUIβ12Apr 29, 2018Updated 7 years ago
- Avionics software to be developed and passed down over multiple tours.β11May 25, 2020Updated 5 years ago
- Graph-Based Image Matching Systemβ40Sep 4, 2025Updated 6 months ago
- β14Jun 10, 2025Updated 8 months ago
- Scraping LegiFrance naturalisation decrees for fun and OSINT profitβ12May 27, 2023Updated 2 years ago
- WindTurbineHighSpeedBearingPrognosis-Dataβ10Aug 19, 2020Updated 5 years ago
- Header-only configuration file library for C++11β12Nov 13, 2014Updated 11 years ago
- Human-like Controllable Image Captioning with Verb-specific Semantic Roles.β36Mar 11, 2022Updated 3 years ago
- This is a dehazed method for remote sensing image, which based on CycleGAN.β12May 10, 2022Updated 3 years ago
- β25Aug 19, 2025Updated 6 months ago