This project demonstrates how to use the Qwen2-VL model from Hugging Face for Optical Character Recognition (OCR) and Visual Question Answering (VQA). The model combines vision and language capabilities, enabling users to analyze images and generate context-based responses.
☆29Oct 18, 2024Updated last year
Alternatives and similar repositories for Qwen2-VL-OCR-VQA
Users that are interested in Qwen2-VL-OCR-VQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for the KVP10k dataset☆23Sep 18, 2025Updated 9 months ago
- Training a BERT model from scratch.☆11Oct 15, 2023Updated 2 years ago
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆13Jul 30, 2024Updated last year
- A complete app leveraging Tensorflow.JS and React for real time object detection.☆15Nov 17, 2020Updated 5 years ago
- Revision of previous Library Bridger application. Features much cleaner code and refined UI.☆13Feb 27, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 📐 A React component for iOS that keeps your bottom-positioned elements fixed and safely visible, automatically adjusting their positions…☆28Jun 22, 2025Updated 11 months ago
- Object Detection with Transformers : DETR, Conditional DETR, Deformable DETR, Dynamic Head☆12Jan 22, 2023Updated 3 years ago
- This repository contains a notebook to demonstrate the power of Document Text Recognition (DocTR) library☆15Aug 24, 2021Updated 4 years ago
- ☆14Sep 16, 2021Updated 4 years ago
- It has 4 major features which include Text Editor , Image Editor , File Compressor & Small snake game☆18Jul 28, 2020Updated 5 years ago
- Demo for scalable Elasticsearch setups with Frozen Indices, Index Lifecycle Management, and Rollups☆12Oct 17, 2020Updated 5 years ago
- This notebook demonstrates how to implement Tensorflow & queries to forecasting for a very big dataset. ANN with 2 hidden layers has been…☆10Nov 3, 2018Updated 7 years ago
- ☆19Aug 20, 2023Updated 2 years ago
- This project is the Develop a generalized algorithm to detect the brightness of any image. Your algorithm should take an image as input a…☆13Nov 21, 2019Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- personal portfolio☆22Jun 12, 2026Updated last week
- A swiper, carousel or slider built in React and TypeScript.☆12Jun 17, 2022Updated 4 years ago
- Extract structured information from images with the AI SDK☆21Aug 14, 2024Updated last year
- ☆11Mar 16, 2021Updated 5 years ago
- Face anti-spoofing model, python/pytorch☆16Dec 19, 2023Updated 2 years ago
- A GSAP Text Reveal Component for Nuxt 3. Animation on GSAP ScrollTrigger & GSAP SmoothScroller☆13Dec 15, 2023Updated 2 years ago
- YOLOv11 trained on DocLayNet dataset.☆56Nov 4, 2024Updated last year
- The Safari browser does not adjust the view layout size when activating the virtual keyboard on mobile phones. You can see the difference…☆20Sep 28, 2023Updated 2 years ago
- This is an experimental of anomalies detection by applying different approach to the problem. PCA component regularization method, K-Mean…☆21Feb 24, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Text summarization with BERT using bert-extractive-summarizer☆25May 29, 2021Updated 5 years ago
- pytorch implementation of "pix2face" network for 3D face estimation from 2D images☆12Jan 14, 2021Updated 5 years ago
- object tracker for VOT☆10Jun 22, 2016Updated 9 years ago
- DFX API☆18Updated this week
- A collection of useful Google Apps Scripts.☆21Updated this week
- Trivia game in the browser using websockets and asyncio.☆16Nov 14, 2021Updated 4 years ago
- Automatic License Plate Recognition System built using YOLOv7 in Python☆19Oct 15, 2022Updated 3 years ago
- ☆12Feb 23, 2023Updated 3 years ago
- ☆10Dec 9, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 🔗 A curated list of awesome url shortener☆23Jan 22, 2024Updated 2 years ago
- tmp DPI☆14Dec 18, 2024Updated last year
- Decentrilized Blockchain Blog System Project with code and Documents☆31Apr 7, 2024Updated 2 years ago
- A template for a Djinni library that can be used in Java/Kotlin, ObjC/Swift and C#☆11Oct 6, 2022Updated 3 years ago
- Smart Contracts Blockchain Project With Code and Report for Banking System. Solidity bank Dapp project☆20Aug 12, 2022Updated 3 years ago
- find landmark from dog face☆11Jun 6, 2022Updated 4 years ago
- Web service for image file/image URL classification without uploading.☆16May 27, 2022Updated 4 years ago