libing64 / Qwen2.5-VL-Fine-TuningLinks
☆17Updated 3 months ago
Alternatives and similar repositories for Qwen2.5-VL-Fine-Tuning
Users that are interested in Qwen2.5-VL-Fine-Tuning are comparing it to the libraries listed below
Sorting:
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆85Updated 4 months ago
- ☆8Updated 5 months ago
- The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"☆143Updated last week
- Official implementation and datasets of AddressCLIP☆60Updated 11 months ago
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆15Updated 4 months ago
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆326Updated 2 months ago
- Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆143Updated 5 months ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆213Updated this week
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆87Updated 2 months ago
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆132Updated 4 months ago
- [TPAMI reviewing] Towards Visual Grounding: A Survey☆171Updated 2 months ago
- Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"☆189Updated 2 months ago
- Precision Search through Multi-Style Inputs☆70Updated last month
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆192Updated last week
- ☆43Updated 3 months ago
- ☆93Updated last month
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆91Updated 7 months ago
- [CVPR 2024 Highlight] Official GraCo: Granularity-Controllable Interactive Segmentation.☆57Updated 2 months ago
- Multimodal Open-O1 (MO1) is designed to enhance the accuracy of inference models by utilizing a novel prompt-based approach. This tool wo…☆29Updated 8 months ago
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆60Updated 9 months ago
- New generation of CLIP with fine grained discrimination capability, ICML2025☆158Updated 2 weeks ago
- [CVPR'24] MESA: Matching Everything by Segmenting Anything☆139Updated 2 months ago
- ☆22Updated 3 weeks ago
- Awesome OVD-OVS - A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future☆181Updated 2 months ago
- ☆41Updated 4 months ago
- Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.☆78Updated 4 months ago
- ☆11Updated last month
- ☆25Updated 9 months ago
- The first attempt to replicate o3-like visual clue-tracking reasoning capabilities.☆46Updated last week
- Official repo of Griffon series including v1(ECCV 2024), v2, and G☆217Updated last week