shaadclt/Qwen2-VL-OCR-VQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shaadclt/Qwen2-VL-OCR-VQA)

shaadclt / Qwen2-VL-OCR-VQA

This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.

☆29

Alternatives and similar repositories for Qwen2-VL-OCR-VQA

Users that are interested in Qwen2-VL-OCR-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arnab39 / POS-Tagging_HMM_vs_CRF
View on GitHub
My implemention of Hidden Markov Model(HMM) and Conditional Random Field(CRF) for Part of Speech tagging in python 3.6
☆11Nov 7, 2018Updated 7 years ago
nicknochnack / CustomObjectDetectionReactJSTensorflow
View on GitHub
A complete app leveraging Tensorflow.JS and React for real time object detection.
☆15Nov 17, 2020Updated 5 years ago
Lwasinam / voicera
View on GitHub
☆19Sep 9, 2024Updated last year
entbappy / SRGAN-Super-Resolution-GAN
View on GitHub
☆13May 29, 2023Updated 3 years ago
aryan-0077 / Machine-Learning-Collection
View on GitHub
A resource for learning about ML, DL, PyTorch and TensorFlow. Feedback always appreciated :)
☆22Mar 26, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
moured / YOLOv11-Document-Layout-Analysis
View on GitHub
YOLOv11 trained on DocLayNet dataset.
☆56Nov 4, 2024Updated last year
Mega-Barrel / Speed-Detection-Using-OpenCV
View on GitHub
The camera will capture all the cars/bike number plate. It will monitor cars speed limit. If the cars speed limit is greater it will stor…
☆21Oct 1, 2020Updated 5 years ago
adaptyvbio / ProtFill
View on GitHub
Inpainting protein sequence and structure
☆12Nov 10, 2023Updated 2 years ago
sagnik3788 / FlavorFleet
View on GitHub
🔥 FlavorFleet: Your Open-Source Culinary Adventure! Embark on a Delicious Journey with Our Open-Source Food Platform 🚀🚀🚀
☆16Sep 27, 2024Updated last year
theos-ai / license-plate-recognition
View on GitHub
Automatic License Plate Recognition System built using YOLOv7 in Python
☆19Oct 15, 2022Updated 3 years ago
JerryYann / DPI
View on GitHub
tmp DPI
☆14Dec 18, 2024Updated last year
jothepro / djinni-library-template
View on GitHub
A template for a Djinni library that can be used in Java/Kotlin, ObjC/Swift and C#
☆11Oct 6, 2022Updated 3 years ago
Vatshayan / Bank-Smart-Contracts-Blockchain-Project
View on GitHub
Smart Contracts Blockchain Project With Code and Report for Banking System. Solidity bank Dapp project
☆20Aug 12, 2022Updated 3 years ago
nicknochnack / Hugging-Face-Transformers-Summarization
View on GitHub
A super fast walkthrough of NLP Text Summarization with Hugging Face Transformers.
☆27Jan 23, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rupeshs / machineye
View on GitHub
Web service for image file/image URL classification without uploading.
☆16May 27, 2022Updated 4 years ago
andfoy / textobjdetection
View on GitHub
Code for the Human-related Object Detection based on Natural Language Parsing of Image Query Expressions article
☆13Aug 8, 2017Updated 8 years ago
nhtlongcs / liveness-detection
View on GitHub
A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…
☆23Nov 29, 2022Updated 3 years ago
kastnerkyle / raw_voice_cleanup
View on GitHub
Examples of cleaning up raw voices
☆18Mar 2, 2022Updated 4 years ago
TUE-EE-ES / HalideAutoGPU
View on GitHub
☆11Sep 14, 2020Updated 5 years ago
ShmuelRonen / Stable-Diffusion-3.0-GUI
View on GitHub
Stable Diffusion 3.0 beta Generation GUI for image generation process and automatic save images.
☆13Apr 18, 2024Updated 2 years ago
mbellitti / wikiart-classifier
View on GitHub
Teaching a Convolutional Neural Network to recognize painting genre. Handcrafted dataset. Cool visualizations.
☆10Dec 19, 2018Updated 7 years ago
unnonouno / chainer-memnn
View on GitHub
Now it is exported as an official example
☆13Jan 24, 2018Updated 8 years ago
verlab / SceneUnderstanding_CIARP_2017
View on GitHub
☆12Apr 23, 2018Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lucidrains / lbm-training-framework
View on GitHub
Training framework for Large Behavioral Models
☆28Sep 17, 2025Updated 9 months ago
visipedia / imet-fgvcx
View on GitHub
☆13Apr 9, 2019Updated 7 years ago
divelab / dtn
View on GitHub
☆57Nov 17, 2017Updated 8 years ago
catdad-experiments / libheif-emscripten
View on GitHub
☆10Jun 11, 2025Updated last year
asanakoy / deep_unsupervised_posets
View on GitHub
Deep Unsupervised Similarity Learning using Partially Ordered Sets (CVPR17)
☆20Dec 15, 2020Updated 5 years ago
superna9999 / meson_g12a_mali_bifrost
View on GitHub
Amlogic G12A Mali support for Mali Bifrost based SoCs, for Mainline Linux only
☆12Jan 28, 2023Updated 3 years ago
ameya005 / Deep-Segmentation
View on GitHub
Deep Learning methods for semantic segmentation with weakly labelled data
☆16Aug 12, 2016Updated 9 years ago
PINTO0309 / sam4onnx
View on GitHub
A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier …
☆15Feb 6, 2026Updated 5 months ago
timvieira / learning-to-prune
View on GitHub
Learning to Prune: Exploring the Frontier of Fast and Accurate Parsing
☆22Sep 24, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LynnHaDo / Checkbox-Detection
View on GitHub
Checkbox Detection Model for Scanned Documents
☆94Mar 6, 2025Updated last year
octoml / public-tvm-docker
View on GitHub
Build TVM docker image for production compilation deployments
☆12Sep 7, 2021Updated 4 years ago
Toxblh / Monic
View on GitHub
ddc ci utility for linux which live in you tray. Brightnress, sound and input.
☆31Mar 25, 2026Updated 3 months ago
ai-forever / aggme
View on GitHub
Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)
☆12Nov 6, 2024Updated last year
kcshum / pose-conditioned-NeRF-object-fusion
View on GitHub
Official Github repository for paper "Language-driven Object Fusion into Neural Radiance Fields with Pose-Conditioned Dataset Updates"
☆14Mar 22, 2024Updated 2 years ago
JackKuo666 / bioRxiv-MCP-Server
View on GitHub
🔍 Enable AI assistants to search and access bioRxiv papers through a simple MCP interface.
☆25Mar 18, 2025Updated last year
codeslake / blur_sharpness_assessment
View on GitHub
tensorflow implementation for scoring blur image sharpness
☆12Nov 29, 2017Updated 8 years ago