libing64 / Qwen2.5-VL-Fine-TuningLinks
☆20Updated 5 months ago
Alternatives and similar repositories for Qwen2.5-VL-Fine-Tuning
Users that are interested in Qwen2.5-VL-Fine-Tuning are comparing it to the libraries listed below
Sorting:
- 将SmolVLM2的视觉头与Qwen3-0.6B模型进行了拼接微调☆226Updated last week
- Video Benchmark Suite: Rapid Evaluation of Video Foundation Models☆15Updated 7 months ago
- (CVPR 2025 highlight✨) Official repository of paper "LLMDet: Learning Strong Open-Vocabulary Object Detectors under the Supervision of La…☆345Updated this week
- YOLO-UniOW: Efficient Universal Open-World Object Detection☆149Updated 6 months ago
- A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.☆223Updated 3 months ago
- Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion☆359Updated 5 months ago
- Vision Manus: Your versatile Visual AI assistant☆245Updated last week
- ☆47Updated 2 months ago
- Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"☆213Updated 2 months ago
- NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing☆562Updated 9 months ago
- [arXiv'25] Official Implementation of "Seg-R1: Segmentation Can Be Surprisingly Simple with Reinforcement Learning"☆32Updated last month
- New generation of CLIP with fine grained discrimination capability, ICML2025☆263Updated 2 weeks ago
- Official code for "No time to train! Training-Free Reference-Based Instance Segmentation"☆184Updated 2 weeks ago
- Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection☆91Updated 5 months ago
- ☆44Updated 6 months ago
- [CVPR2025] Project for "HyperSeg: Towards Universal Visual Segmentation with Large Language Model".☆162Updated 8 months ago
- ☆8Updated 7 months ago
- 【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling☆122Updated 2 months ago
- Official code implementation of Perception R1: Pioneering Perception Policy with Reinforcement Learning☆236Updated 3 weeks ago
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆484Updated last week
- ☆78Updated 2 months ago
- A Light-Weight Framework for Open-Set Object Detection with Decoupled Feature Alignment in Joint Space☆88Updated 6 months ago
- Includes the VideoCount dataset and CountVid code for the paper Open-World Object Counting in Videos.☆64Updated 3 weeks ago
- A cli program of image retrieval using dinov2☆75Updated 2 years ago
- The official implementation of [CVPR 2025] "5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition Tasks".☆349Updated last month
- ☆500Updated 2 weeks ago
- ☆127Updated 3 months ago
- leetcode-hot100的题目,和 Interview-code-practice-python互为一体,找工作的好帮手。☆16Updated 9 months ago
- Official repo of Griffon series including v1(ECCV 2024), v2, and G☆228Updated 2 months ago
- [TPAMI reviewing] Towards Visual Grounding: A Survey☆207Updated 2 months ago