libing64 / Qwen2.5-VL-Fine-TuningLinks
☆33Updated 9 months ago
Alternatives and similar repositories for Qwen2.5-VL-Fine-Tuning
Users that are interested in Qwen2.5-VL-Fine-Tuning are comparing it to the libraries listed below
Sorting:
- X-SAM: From Segment Anything to Any Segmentation (AAAI2026)☆330Updated 2 weeks ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆511Updated 3 months ago
- New generation of CLIP with fine grained discrimination capability, ICML2025☆497Updated last month
- [NeurIPS-W 2025] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆54Updated 5 months ago
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆462Updated 3 months ago
- Detect Anything via Next Point Prediction (Based on Qwen2.5-VL-3B)☆981Updated 3 weeks ago
- ☆78Updated 6 months ago
- 🔮 UniPixel: Unified Object Referring and Segmentation for Pixel-Level Visual Reasoning (NeurIPS 2025)☆201Updated last month
- [NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Langu…☆257Updated last month
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆170Updated 10 months ago
- Vision Manus: Your versatile Visual AI assistant☆302Updated 2 months ago
- OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆383Updated 9 months ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆93Updated 9 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆128Updated 6 months ago
- SegDINO: An Efficient Design for Medical and Natural Image Segmentation with DINO-V3☆209Updated 2 weeks ago
- Fine tuning grounding Dino☆150Updated 4 months ago
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆15Updated 11 months ago
- ☆52Updated 5 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆176Updated last year
- 这是一个不基于任何框架实现的从0到1的VLM finetune(包括Pre-train和SFT)☆35Updated 3 months ago
- Train InternViT-6B in MMSegmentation and MMDetection with DeepSpeed☆107Updated last year
- 多模态 MM +Chat 合集☆279Updated 3 months ago
- Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"☆262Updated 2 weeks ago
- The source code of IEEE TPAMI 2025 "Hyper-YOLO: When Visual Object Detection Meets Hypergraph Computation".☆115Updated last year
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆386Updated 5 months ago
- [TPAMI 2025] Towards Visual Grounding: A Survey☆265Updated 3 weeks ago
- [NeurIPS 2025] Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆277Updated 5 months ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆95Updated last week
- [ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"☆248Updated last year
- leetcode-hot100的题目,和 Interview-code-practice-python互为一体,找工作的好帮手。☆28Updated last year