sandy1990418 / Finetune-Qwen2.5-VL
View external linksLinks

Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.

☆152

Alternatives and similar repositories for Finetune-Qwen2.5-VL

Users that are interested in Finetune-Qwen2.5-VL are comparing it to the libraries listed below

Sorting:

2U1 / Qwen-VL-Series-Finetune
View on GitHub
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
☆1,658Jan 10, 2026Updated last month
zhangfaen / finetune-Qwen2.5-VL
View on GitHub
☆85Aug 13, 2025Updated 6 months ago
maomaoyuchengzi / MobileNetSSD-detect
View on GitHub
mobileNet SSD 基于caffe的前向检测
☆10Nov 30, 2018Updated 7 years ago
pingponglabs / FaceAnime
View on GitHub
☆10Apr 22, 2021Updated 4 years ago
ahaqu01 / mot
View on GitHub
基于Yolov5-Deepsort-Fastreid源码，重构了视频行人MOT和行人ReID特征提取代码、接口
☆13Mar 15, 2023Updated 2 years ago
tommyMessi / PerspectiveExample
View on GitHub
python opencv 文档照片与证件照片的仿射变换的矫正
☆11Nov 3, 2020Updated 5 years ago
dreamy-xay / TableCenterNet
View on GitHub
The source code repository for the paper.
☆21Sep 8, 2025Updated 5 months ago
Zeyi-Lin / Qwen2-VL-finetune-LatexOCR
View on GitHub
☆29Feb 27, 2025Updated 11 months ago
junhongmit / Person-Reidentification-System
View on GitHub
Person Re-identification System based on Deep Residual Network with GUI.
☆11Apr 3, 2018Updated 7 years ago
rkuo2000 / GenAI
View on GitHub
☆11Updated this week
PresentIDco / PresentIDFaceVerificationAPI
View on GitHub
Face Verification API
☆11Sep 27, 2021Updated 4 years ago
Dawars / DocMAE
View on GitHub
Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning
☆20Dec 20, 2023Updated 2 years ago
NJ-SunJiawei / qwen2.5vl-vllm-vedio-camera
View on GitHub
基于vllm部署qwen2.5_vl实现视频流的实时识别
☆20Apr 1, 2025Updated 10 months ago
zhangfaen / finetune-Qwen2-VL
View on GitHub
☆385Feb 8, 2025Updated last year
TianzhongSong / text2image
View on GitHub
生成用于训练CRNN的图片数据
☆20Apr 13, 2018Updated 7 years ago
heroinlin / SlowFastTRT
View on GitHub
使用TensorRT部署SlowFast模型
☆24Mar 2, 2022Updated 3 years ago
HorizonParadox / DRCCBI
View on GitHub
☆25Jan 13, 2025Updated last year
waylandzhang / embedding_from_scratch
View on GitHub
训练自己的中文 Embedding 模型
☆28Jan 6, 2025Updated last year
Xiaolong-RRL / qwen2_5_vllm_fastapi
View on GitHub
使用FastAPI+vLLM部署Qwen2.5
☆25Sep 29, 2024Updated last year
dyl96 / ORFENet
View on GitHub
Tiny Object Detection in Remote Sensing Images Based on Object Reconstruction and Multiple Receptive Field Adaptive Feature Enhancement (…
☆29May 30, 2025Updated 8 months ago
anyantudre / Florence-2-Vision-Language-Model
View on GitHub
Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-lan…
☆152Jul 3, 2024Updated last year
aws-samples / fine-tune-qwen2-vl-with-llama-factory
View on GitHub
☆32Jul 2, 2025Updated 7 months ago
google-research-datasets / Video-Timeline-Tags-ViTT
View on GitHub
A collection of videos annotated with timelines where each video is divided into segments, and each segment is labelled with a short free…
☆29Jan 15, 2022Updated 4 years ago
sudarshan-koirala / langchain-chainlit-docker-deployment
View on GitHub
A template to run Lanchain Powered App using Chainlit Front UI
☆13Aug 1, 2023Updated 2 years ago
ZurichRain / HMCGR
View on GitHub
code for COLING paper "A Hybrid Model of Classification and Generation for Spatial Relation Extraction"
☆10Oct 20, 2022Updated 3 years ago
hanquansanren / DvD
View on GitHub
[SIGGRAPH Asia 2025] The official implementation of the paper "DvD: Unleashing a Generative Paradigm for Document Dewarping via Coordinat…
☆32Nov 22, 2025Updated 2 months ago
myeonghak / Transformer-product-categorization
View on GitHub
트랜스포머 블록을 활용한 상품명 자연어처리 기반 카테고리 분류 모델
☆10Dec 5, 2022Updated 3 years ago
liaochikon / Minimalistic-HRNet-Human-Pose-Estimation
View on GitHub
A lightweight pytorch implementation of HRNet human pose estimation
☆14Jun 13, 2024Updated last year
6zzhh6 / WeChat_Formatting_Tool
View on GitHub
A simple WeChat Official Account layout tool based on Dify
☆16Jun 27, 2025Updated 7 months ago
domingomery / Xdefects
View on GitHub
Automatic defect recognition in X-ray testing using computer vision
☆12Dec 8, 2018Updated 7 years ago
Caesar-xxx / Human_ReID
View on GitHub
使用yolov8自动标注，运用度量学习metric learning 的ReID算法，实现跨镜头人脸追踪
☆10May 15, 2024Updated last year
Liuhk123 / yolov5_deepsort_tensorrt
View on GitHub
☆31Mar 26, 2021Updated 4 years ago
Vancouver-Datajam / WasteNet
View on GitHub
Image classification for Recyclables
☆10Sep 14, 2020Updated 5 years ago
sun1638650145 / A2PI2
View on GitHub
机器学习使用过的API中文版及机器学习的理论知识
☆13Jun 8, 2025Updated 8 months ago
majinkai / dify-database-to-knowledge
View on GitHub
Write the database metadata into the dify knowledge
☆12Dec 30, 2025Updated last month
aws-samples / sample-data-analyst-bi
View on GitHub
A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…
☆25Jan 6, 2026Updated last month
HugoPalomares / design-intent-for-sdd
View on GitHub
☆28Dec 4, 2025Updated 2 months ago
KenKaiii / b0t
View on GitHub
Workflow automation, but you just describe what you want and it happens.
☆26Nov 22, 2025Updated 2 months ago
OneWave-AI / claude-skills
View on GitHub
100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…
☆35Oct 22, 2025Updated 3 months ago

sandy1990418 / Finetune-Qwen2.5-VLView external linksLinks

Alternatives and similar repositories for Finetune-Qwen2.5-VL

sandy1990418 / Finetune-Qwen2.5-VL
View external linksLinks