zhangfaen / finetune-Qwen2.5-VLView external linksLinks
☆85Aug 13, 2025Updated 6 months ago
Alternatives and similar repositories for finetune-Qwen2.5-VL
Users that are interested in finetune-Qwen2.5-VL are comparing it to the libraries listed below
Sorting:
- ☆385Feb 8, 2025Updated last year
- An open-source implementaion for fine-tuning Qwen-VL series by Alibaba Cloud.☆1,658Jan 10, 2026Updated last month
- [AAAI 2026] Official Code for VQAThinker: Exploring Generalizable and Explainable Video Quality Assessment via Reinforcement Learning☆19Nov 28, 2025Updated 2 months ago
- Official repository accompaying the ICDAR 2023 paper☆13Oct 3, 2023Updated 2 years ago
- Fine-tuning Qwen2.5-VL for vision-language tasks | Optimized for Vision understanding | LoRA & PEFT support.☆152Feb 7, 2025Updated last year
- ☆13May 26, 2025Updated 8 months ago
- [CVPR 2025] DocLayLLM: An Efficient Multi-modal Extension of Large Language Models for Text-rich Document Understanding☆26Dec 18, 2025Updated 2 months ago
- A Spatial–Temporal Video Quality Assessment Method via Comprehensive HVS Simulation☆17Jan 13, 2024Updated 2 years ago
- Official PyTorch implementation of `[ACMMM 2023]Relational Contrastive Learning for Scene Text Recognition`☆17Sep 22, 2023Updated 2 years ago
- nvidia TensorRT SSD implementation☆16May 15, 2018Updated 7 years ago
- [MM2023] An official implement of the paper "One-stage Low-resolution Text Recognition with High-resolution Knowledge Transfer"☆16Nov 3, 2023Updated 2 years ago
- ☆42Sep 2, 2023Updated 2 years ago
- CNN-based photo-filter removal☆14Apr 1, 2019Updated 6 years ago
- Face Alignment at 3000 FPS via Regressing Local Binary Features☆19Nov 28, 2017Updated 8 years ago
- This is a PyTorch implementation of "VirFace: Enhancing Face Recognition via Unlabeled Shallow Data" (CVPR 2021).☆22Sep 30, 2022Updated 3 years ago
- The official PyTorch Implementation of Charm: The Missing Piece in ViT fine-tuning for Image Aesthetic Assessment☆41Dec 4, 2025Updated 2 months ago
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆42Oct 6, 2025Updated 4 months ago
- [CVPR2025] Official implementation of High Fidelity Scene Text Synthesis.☆79Mar 24, 2025Updated 10 months ago
- Repo for "Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content"☆40Jun 9, 2025Updated 8 months ago
- (CVPR 2024) Bridging the Gap Between End-to-End and Two-Step Text Spotting.☆71Jun 11, 2024Updated last year
- Update the latest text-related papers from top conferences☆27Mar 12, 2025Updated 11 months ago
- [NeurIPS'24 Spotlight] GAIA: Rethinking Action Quality Assessment for AI-Generated Videos☆39Apr 1, 2025Updated 10 months ago
- The official code for paper "EasyGen: Easing Multimodal Generation with a Bidirectional Conditional Diffusion Model and LLMs"☆73Nov 21, 2024Updated last year
- Text-To-Image Generation with Chinese Characters☆132Jul 20, 2023Updated 2 years ago
- ☆572Nov 26, 2024Updated last year
- The official project of paper "Visual Text Processing: A Comprehensive Review and Unified Evaluation""☆96Oct 20, 2025Updated 3 months ago
- Demonstrator for the effectiveness of transformer models, specifically the newly released ChemBERTa-2, in predicting physical-chemical pr…☆15May 7, 2023Updated 2 years ago
- The repository of VG-Refiner paper☆17Dec 9, 2025Updated 2 months ago
- This repo contains the code to reproduce figures in my dissertation "Passive Imaging and Characterization of the Subsurface With Distribu…☆10Jun 14, 2018Updated 7 years ago
- ☆10Mar 21, 2023Updated 2 years ago
- ☆11Mar 11, 2024Updated last year
- This repository is the code of our paper "DiffUTE: Universal Text Editing Diffusion Model" (NeurIPS'2023).☆144Apr 11, 2025Updated 10 months ago
- [ECCV 2024] Official repo for UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diff…☆234Feb 14, 2025Updated last year
- The dataset used in the CVPR 2022 paper (SimAN: Exploring Self-Supervised Representation Learning of Scene Text via Similarity-Aware Norm…☆34Jun 21, 2022Updated 3 years ago
- [NeurIPS 2023] Generalized Logit Adjustment☆39Apr 21, 2024Updated last year
- Domain adaptation framework for segmentation via reinforcement learning.☆11Oct 13, 2025Updated 4 months ago
- TensorRT In Docker☆11Dec 7, 2024Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- ☆38Oct 20, 2023Updated 2 years ago