2bgm/KIE-HVQA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/2bgm/KIE-HVQA)

2bgm / KIE-HVQA

☆13

Alternatives and similar repositories for KIE-HVQA

Users that are interested in KIE-HVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Dedsec-Xu / DatasetImgLabel-ICDAR2015
View on GitHub
DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format
☆12Dec 7, 2019Updated 6 years ago
Toseic / LLM-inference-arxiv-daily
View on GitHub
🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)
☆12Updated this week
sunxm2357 / DIME-FM
View on GitHub
Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"
☆15Oct 12, 2023Updated 2 years ago
keillernogueira / dynamic-rs-segmentation
View on GitHub
Dynamic Multi-Context Segmentation of Remote Sensing Images based on Convolutional Networks
☆13May 16, 2019Updated 7 years ago
d-li14 / efficientnet-lite.pytorch
View on GitHub
PyTorch implementation of EfficientNet-lite and a spectrum of pre-trained models on ImageNet
☆10Mar 20, 2020Updated 6 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
ami66 / DbnetAddClassify
View on GitHub
dbnet文字检测，添加文本框分类
☆14Jul 27, 2022Updated 4 years ago
jiangnanboy / doc_ai
View on GitHub
这里将paddle中的ocr等模型转为onnx格式，并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。
☆14Nov 15, 2022Updated 3 years ago
ShunqiM / PM
View on GitHub
☆14Apr 9, 2026Updated 3 months ago
CoolDawnAnt / InfoChartQA
View on GitHub
☆41Apr 19, 2026Updated 3 months ago
JerryXu0129 / HHF
View on GitHub
☆12Sep 8, 2022Updated 3 years ago
brendanartley / GenPlot
View on GitHub
Increasing the scale and diversity of chart de-rendering data.
☆12Mar 13, 2024Updated 2 years ago
FelixHertlein / inv3d
View on GitHub
Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".
☆13Dec 21, 2023Updated 2 years ago
SII-sc22mc / DocFusion
View on GitHub
A Unified Framework for Document Parsing Tasks (Including Document Layout Analysis, OCR, Formula Recognition, and Table Recognition)
☆15Jul 1, 2025Updated last year
plutoyuxie / toonify-game-character
View on GitHub
☆18Mar 19, 2021Updated 5 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
HackerHyper / TMCN
View on GitHub
Trusted Mamba Contrastive Network for Multi-View Clustering
☆16Dec 10, 2025Updated 7 months ago
Walkerlikesfish / HSNRS
View on GitHub
Hourglass shape network for remote sensing imagery semantic segmentation
☆20Jun 4, 2018Updated 8 years ago
VIM-Bench / VIM_TOOL
View on GitHub
☆12Jun 12, 2024Updated 2 years ago
DataArcTech / ChartBench
View on GitHub
☆16May 15, 2025Updated last year
Dawars / DocMAE
View on GitHub
Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning
☆20Dec 20, 2023Updated 2 years ago
SimpleVQA / SimpleVQA
View on GitHub
SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models
☆15Feb 20, 2025Updated last year
EmbraceLife / fastai_courses_translation_EN2CN
View on GitHub
A community effort to translate fastai video lessons from English to Chinese
☆14May 2, 2019Updated 7 years ago
lzk9508 / DaFIR
View on GitHub
The official code for "DaFIR: Distortion-Aware Representation Learning for Fisheye Image Rectification", TCSVT, 2023.
☆13May 30, 2025Updated last year
fortitude94deng / huawei_remote-sensing
View on GitHub
☆15Jul 3, 2019Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
JianqiangWan / VLPT-STD
View on GitHub
Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)
☆12Mar 21, 2022Updated 4 years ago
LQNew / Dockerfiles
View on GitHub
Dockerfile for RL research. Including MuJoCo / DMC / PyTorch / Tensoflow / Atari support.
☆16Jan 5, 2022Updated 4 years ago
JuneTse / ReInceptionE
View on GitHub
☆13Mar 16, 2021Updated 5 years ago
saicoco / webdataset
View on GitHub
pytorch大规模数据读取dataset
☆13May 30, 2022Updated 4 years ago
Z1zs / MMNeuron
View on GitHub
Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…
☆26Dec 20, 2024Updated last year
jiangnanboy / layout_analysis
View on GitHub
中文版面检测（Chinese layout detection），yolov8 is used to detect the layout of Chinese document images。
☆60Apr 28, 2023Updated 3 years ago
zmzhang2000 / MIGCN
View on GitHub
Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos
☆16May 23, 2023Updated 3 years ago
emasa / BalPoE-CalibratedLT
View on GitHub
This repository is the official Pytorch implementation of Balanced Product of Calibrated Experts for Long-Tailed Recognition (CVPR 2023).
☆17Mar 13, 2025Updated last year
Su-my / TRAPO
View on GitHub
The official repository for Trust-Region Adaptive Policy Optimization (TRAPO) – a novel hybrid framework designed to enhance large langua…
☆16Mar 2, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Hxyz-123 / ReasoningOCR
View on GitHub
☆18Jul 24, 2025Updated last year
XuZhengzhuo / Prior-LT
View on GitHub
Implement Code for UniMix and Bayias Compensated Loss
☆19Mar 7, 2023Updated 3 years ago
starriver030515 / ChartVerse
View on GitHub
☆19Feb 9, 2026Updated 5 months ago
rahulsrma26 / streamlit-mnist-drawable
View on GitHub
A drawable MNIST demo using streamlit.
☆11Nov 27, 2020Updated 5 years ago
bytedance / WildDoc
View on GitHub
The official repo for “WildDoc: How Far Are We from Achieving Comprehensive and Robust Document Understanding in the Wild?“
☆74May 19, 2025Updated last year
jinga-lala / DAMEX
View on GitHub
Code for "DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets", accepted at Neurips 2023 (Main confer…
☆28Mar 29, 2024Updated 2 years ago
fourierer / ffmpeg
View on GitHub
Just for learning ffmpeg
☆13Jul 11, 2022Updated 4 years ago