This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
☆77Jul 14, 2025Updated 7 months ago
Alternatives and similar repositories for qwen2-vl-finetune-huggingface
Users that are interested in qwen2-vl-finetune-huggingface are comparing it to the libraries listed below
Sorting:
- Table detection with Florence.☆15Jul 11, 2024Updated last year
- ☆388Feb 8, 2025Updated last year
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆19Apr 9, 2025Updated 11 months ago
- Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”☆18Dec 6, 2022Updated 3 years ago
- Demos of ChatGPT's function calling/structured data support.☆24Dec 21, 2023Updated 2 years ago
- Codes for Vision-Language Synthetic Data Enhances Echocardiography Downstream Tasks☆12May 8, 2024Updated last year
- The official code of Linguistic More: Taking a Further Step toward Efficient and Accurate Scene Text Recognition (IJCAI2023)☆27Sep 3, 2023Updated 2 years ago
- ☆16Jan 30, 2022Updated 4 years ago
- AI-powered browser extension to chat with any webpage☆10Aug 12, 2025Updated 6 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Sep 15, 2025Updated 5 months ago
- Automated bash script to set up a high-performance environment on Ubuntu Linux with RTX5090, including installations of PyTorch, Unsloth,…☆19Apr 1, 2025Updated 11 months ago
- ☆14May 26, 2023Updated 2 years ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Dec 4, 2021Updated 4 years ago
- Datably.ai☆17Jun 17, 2025Updated 8 months ago
- Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22☆14Aug 3, 2023Updated 2 years ago
- Awesome AI in Libraries☆17Jul 21, 2023Updated 2 years ago
- Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer☆15Oct 5, 2023Updated 2 years ago
- time-series row column classification☆14Jan 7, 2022Updated 4 years ago
- [ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation☆75Sep 12, 2024Updated last year
- SPRINT: Script-agnostic Structure Recognition in Tables☆16Mar 26, 2025Updated 11 months ago
- Introduction to AI for GLAM☆20Feb 6, 2026Updated last month
- Library for converting from RGB / GrayScale image to base64 and back.☆19Sep 19, 2022Updated 3 years ago
- Web-based tool to convert model into MyriadX blob☆16Dec 9, 2025Updated 3 months ago
- a single interface around speech-to-speech foundation models☆27Jun 27, 2025Updated 8 months ago
- nnanno is a collection of tools that sample, annotate and apply computer vision to the Newspaper Navigator dataset☆17Oct 16, 2024Updated last year
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Nov 4, 2025Updated 4 months ago
- Goldfish: Monolingual language models for 350 languages.☆23Updated this week
- Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.☆18Apr 23, 2023Updated 2 years ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Feb 24, 2026Updated 2 weeks ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆18Apr 5, 2025Updated 11 months ago
- Tensorflow port implementation of Single Headed Attention RNN☆16Feb 1, 2020Updated 6 years ago
- IIIF Examples and useful code☆20Sep 10, 2025Updated 6 months ago
- This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135☆19Mar 7, 2023Updated 3 years ago
- NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement☆51Aug 5, 2024Updated last year
- ☆17Jul 11, 2024Updated last year
- Implementation of BitNet-1.58 instruct tuning☆27Apr 14, 2024Updated last year
- Load any clip model with a standardized interface☆22Oct 20, 2025Updated 4 months ago
- ☆21Apr 29, 2024Updated last year
- ☆11Jun 13, 2025Updated 8 months ago