qhnhynmm/ViOCRVQA-Dataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/qhnhynmm/ViOCRVQA-Dataset)

qhnhynmm / ViOCRVQA-Dataset

The largest VQA dataset for Vietnamese. Related to the text content in the image.

☆19

Alternatives and similar repositories for ViOCRVQA-Dataset

Users that are interested in ViOCRVQA-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tannd-ds / realtime-reid
View on GitHub
Realtime Person Re-Identification using Kafka, Spark and Deep Learning
☆15Feb 20, 2024Updated 2 years ago
hyeonss0417 / nestjs-airbnb
View on GitHub
Airbnb Backend wth NestJS + TypeORM
☆13Mar 20, 2021Updated 5 years ago
harrytea / TGDoc
View on GitHub
"Towards Improving Document Understanding: An Exploration on Text-Grounding via MLLMs" 2023
☆16Nov 28, 2024Updated last year
phuctan214 / FineTunning-FeedBack-Transformer
View on GitHub
☆10Apr 3, 2021Updated 5 years ago
kyegomez / VisionLLaMA
View on GitHub
Implementation of VisionLLaMA from the paper: "VisionLLaMA: A Unified LLaMA Interface for Vision Tasks" in PyTorch and Zeta
☆15Nov 11, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nghiangh / OpenViVQA
View on GitHub
This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…
☆15Dec 31, 2024Updated last year
text2motion / blender-integration
View on GitHub
Integrate Blender with the Text2Motion platform to generate 3D animations from text prompts using Generative AI.
☆22Mar 27, 2025Updated last year
saifullah3396 / docxclassifier
View on GitHub
☆17Jul 11, 2024Updated 2 years ago
hllj / Vistral-V
View on GitHub
Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.
☆23Jul 1, 2024Updated 2 years ago
ispamm / NAF-DPM
View on GitHub
NAF-DPM: A Nonlinear Activation-Free Diffusion Probabilistic Model for Document Enhancement
☆54Aug 5, 2024Updated last year
MapariAbdullah / Llama2-Custom-document-QA
View on GitHub
Using open-source LLM Llama2 by Meta on local CPU inference for document question-and-answer
☆15Oct 5, 2023Updated 2 years ago
RisabBiswas / T2T-BinFormer
View on GitHub
SOTA Document Image Enhancement - T2T-BinFormer: Effective Document Image Enhancement Using tokens-to-token Transformer Network
☆24Dec 9, 2023Updated 2 years ago
davisyoshida / jax-gptq
View on GitHub
JAX implementation of GPTQ quantization algorithm
☆10Jul 19, 2023Updated 3 years ago
kh4nh12 / ViVQA
View on GitHub
☆16Jul 10, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
pstage-ocr-team6 / ocr-teamcode
View on GitHub
✨ Beautiful OCR Project Team Code by Team DKT
☆12Jun 23, 2021Updated 5 years ago
dali92002 / SSL-OCR
View on GitHub
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoders for Text Recognition and Document Enhancement - AAAI 2023
☆30Jul 12, 2023Updated 3 years ago
DM2-ND / EDMem
View on GitHub
Code for EMNLP 2022 paper "A Unified Encoder-Decoder Framework with Entity Memory"
☆15Apr 24, 2023Updated 3 years ago
mrmarufpro / Delivery-management-system
View on GitHub
A open source Basic Delivery Management System. Build with Nest Js framework, Remix Js full stack JavaScript React framework, Tailwind CS…
☆55Jun 20, 2024Updated 2 years ago
tanminhtran168 / Vi-MTVQA
View on GitHub
☆15Oct 3, 2024Updated last year
L597383845 / row-col-table-recognition
View on GitHub
time-series row column classification
☆14Jan 7, 2022Updated 4 years ago
noorkhokhar99 / Fire-Detection-using-YOLOv8
View on GitHub
Fire-Detection-using-YOLOv8
☆56Feb 2, 2023Updated 3 years ago
ayanban011 / SwinDocSegmenter
View on GitHub
[ICDAR 2023] (Oral) An End-to-End Unified Domain Adaptive Transformer for Document Instance Segmentation
☆74Sep 12, 2024Updated last year
DS3Lab / TableParser
View on GitHub
Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22
☆15Aug 3, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zhangyifei01 / LMIM
View on GitHub
Linguistics-aware Masked Image Modeling for Self-supervised Scene Text Recognition
☆15Oct 26, 2025Updated 9 months ago
mxin262 / ESTextSpotter
View on GitHub
(ICCV 2023) ESTextSpotter: Towards Better Scene Text Spotting with Explicit Synergy in Transformer
☆78Apr 9, 2024Updated 2 years ago
ZZR8066 / SEMv2
View on GitHub
☆71Jun 26, 2024Updated 2 years ago
h2oai / doctr
View on GitHub
docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Lear…
☆11May 19, 2026Updated 2 months ago
uakarsh / TiLT-Implementation
View on GitHub
Implementation of the paper: Going Full-TILT Boogie on Document Understanding with Text-Image-Layout Transformer.
☆18Apr 23, 2023Updated 3 years ago
wjbmattingly / qwen2-vl-finetune-huggingface
View on GitHub
This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
☆78Jul 14, 2025Updated last year
ayumiymk / DiG
View on GitHub
Official PyTorch implementation of `Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition`
☆74Feb 27, 2023Updated 3 years ago
CuteBoiz / Ubuntu_Installation
View on GitHub
Thing To-Do After Install Ubuntu
☆12Sep 9, 2023Updated 2 years ago
facebookresearch / MultiplexedOCR
View on GitHub
Code for CVPR21 paper A Multiplexed Network for End-to-End, Multilingual OCR
☆80Dec 2, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ThreeRiversAINexus / sample-agents
View on GitHub
☆21May 14, 2025Updated last year
Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year
Khang-9966 / Vietnamese-LLM-instruction-datasets
View on GitHub
☆29Mar 29, 2024Updated 2 years ago
hilmansw / PDF-Summarizer
View on GitHub
PDF Summarizer using Streamlit, LangChain, and OpenAI frameworks.
☆23Oct 18, 2023Updated 2 years ago
Taha0229 / self-reflective-RAG
View on GitHub
Exploring SOTA Advanced RAG techniques: This project implements a self reflective RAG, seamlessly integrating multiple knowledge sources …
☆20Jul 8, 2024Updated 2 years ago
fishmingyu / qrv2-gpu-mode
View on GitHub
Batched square compact-Householder QR factorization.
☆14Jul 2, 2026Updated 3 weeks ago
ToolBrain / TraceBrain
View on GitHub
TraceBrain is an open-source trace management platform for observing, governing, and improving LLM-based agent workflows.
☆69Jun 2, 2026Updated last month