shaadclt / Qwen2-VL-OCR-VQAView external linksLinks
This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.
☆24Oct 18, 2024Updated last year
Alternatives and similar repositories for Qwen2-VL-OCR-VQA
Users that are interested in Qwen2-VL-OCR-VQA are comparing it to the libraries listed below
Sorting:
- A template for a Djinni library that can be used in Java/Kotlin, ObjC/Swift and C#☆11Oct 6, 2022Updated 3 years ago
- A simple Streamlit frontend for a pre-trained MobileNet CNN model + OpenCV for face mask detection in images.☆10Mar 25, 2023Updated 2 years ago
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- Continuous quality evaluation of ML algorithms via CI/CD and GitHub Actions.☆16Jan 15, 2020Updated 6 years ago
- Teaching a Convolutional Neural Network to recognize painting genre. Handcrafted dataset. Cool visualizations.☆10Dec 19, 2018Updated 7 years ago
- Amlogic G12A Mali support for Mali Bifrost based SoCs, for Mainline Linux only☆11Jan 28, 2023Updated 3 years ago
- Sentiment in the social media (facebook, twitter, instagram, linkedin etc.) plays a big role in managing the perception of an organizatio…☆11Nov 22, 2017Updated 8 years ago
- ☆10Dec 9, 2018Updated 7 years ago
- ☆12Apr 14, 2025Updated 10 months ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Nov 6, 2024Updated last year
- YOLOv10: Real-Time End-to-End Object Detection☆12May 24, 2024Updated last year
- VINS: Visual Search for Mobile User Interface Design☆49Jan 9, 2021Updated 5 years ago
- ☆12May 22, 2023Updated 2 years ago
- Detect Credit card number using Mask RCNN and make task easier for OCR to retrive number from the card☆11Oct 8, 2019Updated 6 years ago
- Python and JS tools to generate Printed LaTex formulas and images☆16Oct 26, 2023Updated 2 years ago
- ☆12Feb 23, 2023Updated 2 years ago
- Generation of handwritten cyrillic text using fonts☆12Mar 27, 2023Updated 2 years ago
- pytorch implementation of "pix2face" network for 3D face estimation from 2D images☆12Jan 14, 2021Updated 5 years ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆12Jan 17, 2021Updated 5 years ago
- Tutorial on Keras-OCR which is a packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆11Jun 5, 2021Updated 4 years ago
- Multimodal object tracking and scene analytics for highly actionable, real-world contextualized data☆36Updated this week
- ☆10Jun 11, 2025Updated 8 months ago
- Created for this model trained by Gustavosta for Stable Diffusion to create a prompt from a few words. You can submit your own text or se…☆14Feb 13, 2023Updated 3 years ago
- An automatic deep tabular learning package☆13Oct 30, 2023Updated 2 years ago
- C#, Unity, HoloLens 2, AR, MRTK2, UWP, HoloLens 2 Emulator☆12Jul 25, 2022Updated 3 years ago
- A Swift interface for XGBoost☆12Aug 3, 2020Updated 5 years ago
- LaTeXDataHub is an open-source platform dedicated to the sharing and contribution of real-world LaTeX image datasets and their annotation…☆12Aug 13, 2024Updated last year
- A modern Python library to work with Anoto dot patterns.☆16Aug 24, 2023Updated 2 years ago
- 🥤 RaspberryPi program for the Reverse Vending Machine project.☆15Jun 23, 2025Updated 7 months ago
- [ICML-2025] We introduce Lie group Relative position Encodings (LieRE) that goes beyond RoPE in supporting n-dimensional inputs.☆29Aug 13, 2025Updated 6 months ago
- ☆13Apr 9, 2019Updated 6 years ago
- Multi-platform, single executable HTTP proxy connecting through SSH tunnels☆10Jul 2, 2016Updated 9 years ago
- ☆10Aug 15, 2023Updated 2 years ago
- [ACL 2025] RealHiTBench: A Comprehensive Realistic Hierarchical Table Benchmark for Evaluating LLM-Based Table Analysis☆24Aug 8, 2025Updated 6 months ago
- tensorflow implementation for scoring blur image sharpness☆12Nov 29, 2017Updated 8 years ago
- ☆34Feb 10, 2026Updated last week
- A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier …☆15Feb 6, 2026Updated last week
- An open-source tool created by OctoML that converts TVM-optimized models to code runnable in ONNX Runtime.☆17Mar 30, 2023Updated 2 years ago
- Universal LLM Telegram chatbot in Python☆17Aug 16, 2024Updated last year