zhangfaen / finetune-Qwen2.5-VL
View external linksLinks

☆85

Alternatives and similar repositories for finetune-Qwen2.5-VL

Users that are interested in finetune-Qwen2.5-VL are comparing it to the libraries listed below

Sorting:

zhangfaen / finetune-Qwen2-VL
View on GitHub
☆385Feb 8, 2025Updated last year
2U1 / Qwen-VL-Series-Finetune
View on GitHub
An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.
☆1,658Jan 10, 2026Updated last month
clh124 / VQAThinker
View on GitHub
[AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning
☆19Nov 28, 2025Updated 2 months ago
ihdia / seamformer
View on GitHub
Official repository accompaying the ICDAR 2023 paper
☆13Oct 3, 2023Updated 2 years ago
sandy1990418 / Finetune-Qwen2.5-VL
View on GitHub
Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.
☆152Feb 7, 2025Updated last year
lqzxt / NGTR
View on GitHub
☆13May 26, 2025Updated 8 months ago
whlscut / DocLayLLM
View on GitHub
[CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding
☆26Dec 18, 2025Updated 2 months ago
GZHU-DVL / HVS-5M
View on GitHub
A Spatial–Temporal Video Quality Assessment Method via Comprehensive HVS Simulation
☆17Jan 13, 2024Updated 2 years ago
ThunderVVV / RCLSTR
View on GitHub
Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`
☆17Sep 22, 2023Updated 2 years ago
Maxfashko / NV_TRT_SSD
View on GitHub
nvidia TensorRT SSD implementation
☆16May 15, 2018Updated 7 years ago
csguoh / KD-LTR
View on GitHub
[MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"
☆16Nov 3, 2023Updated 2 years ago
zzyhlyoko / DCTC
View on GitHub
☆42Sep 2, 2023Updated 2 years ago
dros1986 / filter_removal
View on GitHub
CNN-based photo-filter removal
☆14Apr 1, 2019Updated 6 years ago
musyoku / face-alignment-at-3000fps
View on GitHub
Face Alignment at 3000 FPS via Regressing Local Binary Features
☆19Nov 28, 2017Updated 8 years ago
sandwichfish / VirFace
View on GitHub
This is a PyTorch implementation of "VirFace: Enhancing Face Recognition via Unlabeled Shallow Data" (CVPR 2021).
☆22Sep 30, 2022Updated 3 years ago
FBehrad / Charm
View on GitHub
The official PyTorch Implementation of Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment
☆41Dec 4, 2025Updated 2 months ago
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆42Oct 6, 2025Updated 4 months ago
CodeGoat24 / DreamText
View on GitHub
[CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.
☆79Mar 24, 2025Updated 10 months ago
zzc-1998 / Q-Eval
View on GitHub
Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"
☆40Jun 9, 2025Updated 8 months ago
mxin262 / Bridging-Text-Spotting
View on GitHub
(CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.
☆71Jun 11, 2024Updated last year
TongkunGuan / Text-Related-Papers
View on GitHub
Update the latest text-related papers from top conferences
☆27Mar 12, 2025Updated 11 months ago
zijianchen98 / GAIA
View on GitHub
[NeurIPS'24 Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos
☆39Apr 1, 2025Updated 10 months ago
xiangyu-mm / EasyGen
View on GitHub
The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"
☆73Nov 21, 2024Updated last year
OPPO-Mente-Lab / GlyphDraw
View on GitHub
Text-To-Image Generation with Chinese Characters
☆132Jul 20, 2023Updated 2 years ago
erwold / qwen2vl-flux
View on GitHub
☆572Nov 26, 2024Updated last year
shuyansy / Visual-Text-Processing-survey
View on GitHub
The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""
☆96Oct 20, 2025Updated 3 months ago
jwoerner42 / LCW-Fine-Tuning-ChemBERTa-2
View on GitHub
Demonstrator for the effectiveness of transformer models, specifically the newly released ChemBERTa-2, in predicting physical-chemical pr…
☆15May 7, 2023Updated 2 years ago
VoyageWang / VG-Refiner
View on GitHub
The repository of VG-Refiner paper
☆17Dec 9, 2025Updated 2 months ago
eileenrmartin / dissertation-reproducibility-passive-DAS
View on GitHub
This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…
☆10Jun 14, 2018Updated 7 years ago
mxnuchim / SwiftUI-ChatGPT-App
View on GitHub
☆10Mar 21, 2023Updated 2 years ago
ruhig6 / JNMR
View on GitHub
☆11Mar 11, 2024Updated last year
chenhaoxing / DiffUTE
View on GitHub
This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).
☆144Apr 11, 2025Updated 10 months ago
ZYM-PKU / UDiffText
View on GitHub
[ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…
☆234Feb 14, 2025Updated last year
Canjie-Luo / Real-300K
View on GitHub
The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…
☆34Jun 21, 2022Updated 3 years ago
BeierZhu / GLA
View on GitHub
[NeurIPS 2023] Generalized Logit Adjustment
☆39Apr 21, 2024Updated last year
arnaudjudge / RL4Seg
View on GitHub
Domain adaptation framework for segmentation via reinforcement learning.
☆11Oct 13, 2025Updated 4 months ago
leimao / TensorRT-Docker-Image
View on GitHub
TensorRT In Docker
☆11Dec 7, 2024Updated last year
flowersteam / EAGER
View on GitHub
☆10Oct 11, 2022Updated 3 years ago
wzx99 / CLIPOCR
View on GitHub
☆38Oct 20, 2023Updated 2 years ago

zhangfaen / finetune-Qwen2.5-VLView external linksLinks

Alternatives and similar repositories for finetune-Qwen2.5-VL

zhangfaen / finetune-Qwen2.5-VL
View external linksLinks