This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.
β27Oct 18, 2024Updated last year
Alternatives and similar repositories for Qwen2-VL-OCR-VQA
Users that are interested in Qwen2-VL-OCR-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π A React component for iOS that keeps your bottom-positioned elements fixed and safely visible, automatically adjusting their positionsβ¦β25Jun 22, 2025Updated 9 months ago
- Expert eyeglasses recommendation system with Generative Adversarial Networks written in Python, 2020.β10Aug 27, 2020Updated 5 years ago
- A swiper, carousel or slider built in React and TypeScript.β12Jun 17, 2022Updated 3 years ago
- Extract structured information from images with the AI SDKβ22Aug 14, 2024Updated last year
- Inpainting protein sequence and structureβ12Nov 10, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- β10Dec 9, 2018Updated 7 years ago
- tmp DPIβ14Dec 18, 2024Updated last year
- A course on Hugging Face landβ28Oct 9, 2025Updated 5 months ago
- A template for a Djinni library that can be used in Java/Kotlin, ObjC/Swift and C#β11Oct 6, 2022Updated 3 years ago
- A keyboard-avoiding view for Android and iOS with React Native and Expo.β25Apr 30, 2021Updated 4 years ago
- Web service for image file/image URL classification without uploading.β16May 27, 2022Updated 3 years ago
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake vβ¦β23Nov 29, 2022Updated 3 years ago
- Scalar OpenAPI References in Laravelβ60Mar 23, 2026Updated last week
- Examples of cleaning up raw voicesβ18Mar 2, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways β’ AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Continuous quality evaluation of ML algorithms via CI/CD and GitHub Actions.β16Jan 15, 2020Updated 6 years ago
- Stable Diffusion 3.0 beta Generation GUI for image generation process and automatic save images.β14Apr 18, 2024Updated last year
- Packages and tools to enable EmBARDiment type of AI agents inside Android XR.β28Mar 9, 2026Updated 2 weeks ago
- Teaching a Convolutional Neural Network to recognize painting genre. Handcrafted dataset. Cool visualizations.β10Dec 19, 2018Updated 7 years ago
- Training framework for Large Behavioral Modelsβ27Sep 17, 2025Updated 6 months ago
- β13Apr 9, 2019Updated 6 years ago
- β57Nov 17, 2017Updated 8 years ago
- β10Jun 11, 2025Updated 9 months ago
- Port of gst-editor to Gtk+ 3 and GStreamer 1.0β13Dec 16, 2016Updated 9 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- frame extractor for OpenNI2 recordins .Oniβ12Jun 3, 2019Updated 6 years ago
- A simple Streamlit frontend for a pre-trained MobileNet CNN model + OpenCV for face mask detection in images.β10Mar 25, 2023Updated 3 years ago
- Created for this model trained by Gustavosta for Stable Diffusion to create a prompt from a few words. You can submit your own text or seβ¦β16Feb 13, 2023Updated 3 years ago
- Amlogic G12A Mali support for Mali Bifrost based SoCs, for Mainline Linux onlyβ11Jan 28, 2023Updated 3 years ago
- A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier β¦β15Feb 6, 2026Updated last month
- Build TVM docker image for production compilation deploymentsβ12Sep 7, 2021Updated 4 years ago
- ddc ci utility for linux which live in you tray. Brightnress, sound and input.β31Jul 3, 2025Updated 8 months ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)β11Nov 6, 2024Updated last year
- tensorflow implementation for scoring blur image sharpnessβ12Nov 29, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A simple Reminders app made with React Native (learning project)β28Jan 23, 2019Updated 7 years ago
- generative models for speechβ20Jul 4, 2016Updated 9 years ago
- Detect Credit card number using Mask RCNN and make task easier for OCR to retrive number from the cardβ11Oct 8, 2019Updated 6 years ago
- Searching the location of a template or a target image.β12Jul 23, 2019Updated 6 years ago
- Little image retouching application for Linux Desktop (Development)β14Feb 22, 2022Updated 4 years ago
- Easy to download and parse version of the Smartdoc 2015 - Challenge 1 dataset.β14Mar 5, 2018Updated 8 years ago
- β12May 22, 2023Updated 2 years ago