☆14Jun 10, 2025Updated 9 months ago
Alternatives and similar repositories for KIE-HVQA
Users that are interested in KIE-HVQA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DatasetImgLabeler is a image annotation tool for researchers to prepare datasets in ICDAR2015 format☆12Dec 7, 2019Updated 6 years ago
- 🎓Automatically Update LLM inference systems Papers Daily using Github Actions (Update Every 12th hours)☆12Updated this week
- Implementation of "DIME-FM: DIstilling Multimodal and Efficient Foundation Models"☆15Oct 12, 2023Updated 2 years ago
- ☆20Nov 21, 2025Updated 4 months ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- PyTorch implementation of EfficientNet-lite and a spectrum of pre-trained models on ImageNet☆11Mar 20, 2020Updated 6 years ago
- Dynamic Multi-Context Segmentation of Remote Sensing Images based on Convolutional Networks☆13May 16, 2019Updated 6 years ago
- dbnet文字检测,添加文本框分类☆14Jul 27, 2022Updated 3 years ago
- ☆12Aug 20, 2025Updated 7 months ago
- ☆12Sep 8, 2022Updated 3 years ago
- Increasing the scale and diversity of chart de-rendering data.☆12Mar 13, 2024Updated 2 years ago
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Dec 21, 2023Updated 2 years ago
- ☆18Mar 19, 2021Updated 5 years ago
- ☆15May 15, 2025Updated 10 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- VisuRiddles: Fine-grained Perception is a important thing for Multimodal Large Models in Riddles Solving☆18Oct 22, 2025Updated 5 months ago
- Trusted Mamba Contrastive Network for Multi-View Clustering☆16Dec 10, 2025Updated 3 months ago
- Hourglass shape network for remote sensing imagery semantic segmentation☆20Jun 4, 2018Updated 7 years ago
- ☆12Jun 12, 2024Updated last year
- Vision-Language Pre-Training for Boosting Scene Text Detectors (CVPR2022)☆12Mar 21, 2022Updated 4 years ago
- Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning☆20Dec 20, 2023Updated 2 years ago
- ☆13Mar 16, 2021Updated 5 years ago
- pytorch大规模数据读取dataset☆13May 30, 2022Updated 3 years ago
- A community effort to translate fastai video lessons from English to Chinese☆14May 2, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [NAACL 2025] Beyond End-to-End VLMs: Leveraging Intermediate Text Representations for Superior Flowchart Understanding☆20Aug 23, 2025Updated 7 months ago
- The official code for "DaFIR: Distortion-Aware Representation Learning for Fisheye Image Rectification", TCSVT, 2023.☆13May 30, 2025Updated 10 months ago
- 中文版面检测(Chinese layout detection),yolov8 is used to detect the layout of Chinese document images。☆58Apr 28, 2023Updated 2 years ago
- ☆15Jul 3, 2019Updated 6 years ago
- Dockerfile for RL research. Including MuJoCo / DMC / PyTorch / Tensoflow / Atari support.☆16Jan 5, 2022Updated 4 years ago
- Code for "DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets", accepted at Neurips 2023 (Main confer…☆27Mar 29, 2024Updated 2 years ago
- The code for "AttentionPredictor: Temporal Pattern Matters for Efficient LLM Inference", Qingyue Yang, Jie Wang, Xing Li, Zhihai Wang, Ch…☆28Jul 15, 2025Updated 8 months ago
- [NeurIPS 2025 🔥] Official implementation for "Don't Just Chase “Highlighted Tokens” in MLLMs: Revisiting Visual Holistic Context Retenti…☆61Mar 5, 2026Updated 3 weeks ago
- Official implementation of "MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model". Our co…☆25Dec 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [NeurIPS 2025 Spotlight] SparseMVC: Probing Cross-view Sparsity Variations for Multi-view Clustering [Pytorch repository]☆42Jan 7, 2026Updated 2 months ago
- 文档图像处理工具(Document image processing tool),包括漂白 / 文字方向矫正 / 清晰增强 / 笔记去噪美化 / 去阴影 / 扭曲矫正 / 切边增强(DocBleach / TextOrientationCorrection / DocSha…☆126Aug 27, 2024Updated last year
- Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos☆16May 23, 2023Updated 2 years ago
- Implement Code for UniMix and Bayias Compensated Loss☆19Mar 7, 2023Updated 3 years ago
- A drawable MNIST demo using streamlit.☆11Nov 27, 2020Updated 5 years ago
- Just for learning ffmpeg☆13Jul 11, 2022Updated 3 years ago
- Official code implementation of " TextDiff: Mask-Guided Residual Diffusion Models for Scene Text Image " in Pattern Recognition☆24Apr 24, 2024Updated last year