This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.
☆28Oct 18, 2024Updated last year
Alternatives and similar repositories for Qwen2-VL-OCR-VQA
Users that are interested in Qwen2-VL-OCR-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A complete app leveraging Tensorflow.JS and React for real time object detection.☆15Nov 17, 2020Updated 5 years ago
- This notebook demonstrates how to implement Tensorflow & queries to forecasting for a very big dataset. ANN with 2 hidden layers has been…☆10Nov 3, 2018Updated 7 years ago
- Spacy, HAC, pytesseract, easyocr, doctr, mmocr, layoutlm, paddleocr☆22Mar 20, 2024Updated 2 years ago
- Final Year ImageChain Blockchain Project is application of Blockchain. Project Include Code, Documents with Video Explanation☆17Aug 12, 2022Updated 3 years ago
- Face anti-spoofing model, python/pytorch☆16Dec 19, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Streamlit app to Translate text to or between 50 languages with mBART-50 from Huggingface and Facebook☆25May 29, 2021Updated 5 years ago
- The Safari browser does not adjust the view layout size when activating the virtual keyboard on mobile phones. You can see the difference…☆20Sep 28, 2023Updated 2 years ago
- Inpainting protein sequence and structure☆12Nov 10, 2023Updated 2 years ago
- 🔥 FlavorFleet: Your Open-Source Culinary Adventure! Embark on a Delicious Journey with Our Open-Source Food Platform 🚀🚀🚀☆17Sep 27, 2024Updated last year
- ☆12Feb 23, 2023Updated 3 years ago
- ☆10Dec 9, 2018Updated 7 years ago
- tmp DPI☆14Dec 18, 2024Updated last year
- A walkthrough of how to use Hugging Face summarization pipelines for long post summarization.☆27Feb 3, 2021Updated 5 years ago
- A template for a Djinni library that can be used in Java/Kotlin, ObjC/Swift and C#☆11Oct 6, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆157Mar 10, 2026Updated 2 months ago
- A super fast walkthrough of NLP Text Summarization with Hugging Face Transformers.☆27Jan 23, 2021Updated 5 years ago
- Code for the Human-related Object Detection based on Natural Language Parsing of Image Query Expressions article☆13Aug 8, 2017Updated 8 years ago
- A strong baseline for liveness detection. The source code could be used for similar tasks, such as face anti-spoofing or detecting fake v…☆23Nov 29, 2022Updated 3 years ago
- Generate a 1 million-sample warm-up dataset for neural machine translation from a 700 million-word Mongolian text corpus using the Google…☆18Jun 27, 2025Updated 11 months ago
- Fast-Forward Video Based on Semantic Extraction @ 2016 IEEE International Conference on Image Processing (ICIP)☆10Oct 28, 2019Updated 6 years ago
- 🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.☆12Jan 17, 2021Updated 5 years ago
- 生成训练文本检测数据集☆12Jul 1, 2020Updated 5 years ago
- ☆11Sep 14, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Stable Diffusion 3.0 beta Generation GUI for image generation process and automatic save images.☆13Apr 18, 2024Updated 2 years ago
- ☆14Aug 10, 2019Updated 6 years ago
- Teaching a Convolutional Neural Network to recognize painting genre. Handcrafted dataset. Cool visualizations.☆10Dec 19, 2018Updated 7 years ago
- Training framework for Large Behavioral Models☆28Sep 17, 2025Updated 8 months ago
- ☆13Apr 9, 2019Updated 7 years ago
- ☆57Nov 17, 2017Updated 8 years ago
- ☆10Jun 11, 2025Updated 11 months ago
- Port of gst-editor to Gtk+ 3 and GStreamer 1.0☆13Dec 16, 2016Updated 9 years ago
- DSMP Capstone Project Website using Streamlit☆31Sep 22, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A simple Streamlit frontend for a pre-trained MobileNet CNN model + OpenCV for face mask detection in images.☆10Mar 25, 2023Updated 3 years ago
- Amlogic G12A Mali support for Mali Bifrost based SoCs, for Mainline Linux only☆11Jan 28, 2023Updated 3 years ago
- Deep Learning methods for semantic segmentation with weakly labelled data☆16Aug 12, 2016Updated 9 years ago
- 🔍 Enable AI assistants to search and access bioRxiv papers through a simple MCP interface.☆23Mar 18, 2025Updated last year
- ddc ci utility for linux which live in you tray. Brightnress, sound and input.☆31Mar 25, 2026Updated 2 months ago
- ☆36Jan 6, 2023Updated 3 years ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆12Nov 6, 2024Updated last year