libing64 / Qwen2.5-VL-Fine-TuningLinks
☆33Updated 8 months ago
Alternatives and similar repositories for Qwen2.5-VL-Fine-Tuning
Users that are interested in Qwen2.5-VL-Fine-Tuning are comparing it to the libraries listed below
Sorting:
- X-SAM: From Segment Anything to Any Segmentation (AAAI2026)☆321Updated 2 weeks ago
- ☆76Updated 6 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆438Updated 2 months ago
- OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆378Updated 8 months ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆491Updated 3 months ago
- Vision Manus: Your versatile Visual AI assistant☆297Updated last month
- New generation of CLIP with fine grained discrimination capability, ICML2025☆472Updated 3 weeks ago
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆190Updated last month
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆166Updated 10 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆260Updated last week
- Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)☆811Updated last week
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆93Updated 8 months ago
- [NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…☆248Updated 2 weeks ago
- Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"☆254Updated this week
- NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing☆577Updated last year
- Fine tuning grounding Dino☆149Updated 3 months ago
- 动手训练一个简单的CLIP模型,加深对CLIP的理解。☆21Updated 6 months ago
- The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".☆113Updated 11 months ago
- [NeurIPS2025 Workshop] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆52Updated 4 months ago
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆79Updated 2 weeks ago
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆555Updated 3 months ago
- 多模态 MM +Chat 合集☆278Updated 3 months ago
- VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs☆78Updated this week
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆176Updated 11 months ago
- [NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆268Updated 4 months ago
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆15Updated 10 months ago
- SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3☆186Updated 3 weeks ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆91Updated 10 months ago
- ☆95Updated 3 months ago
- leetcode-hot100的题目,和 Interview-code-practice-python互为一体,找工作的好帮手。☆28Updated last year