SonTG / pp2024
☆14Updated last year
Alternatives and similar repositories for pp2024
Users that are interested in pp2024 are comparing it to the libraries listed below
Sorting:
- UniTable: Towards a Unified Table Foundation Model☆467Updated 11 months ago
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆181Updated 8 months ago
- Scene text vietnamese☆14Updated 3 years ago
- Dictionary-guided Scene Text Recognition (CVPR-2021)☆147Updated 9 months ago
- This repo provides Geometric LayoutLM for Vietnamese document and code for export to ONNX☆13Updated last year
- A curated list of resources dedicated to table recognition☆401Updated 5 months ago
- Small application for Vietnamese scenetext detection and recognition☆18Updated last year
- A large scale camera-taken table detection and recognition dataset.☆128Updated last year
- BED-AIO team code for AIChallenge2023☆41Updated 9 months ago
- https://dl.acm.org/doi/10.1145/3657281☆96Updated last year
- ☆10Updated last year
- ☆59Updated 10 months ago
- A toolbox for Vietnamese Optical Character Recognition.☆116Updated 2 years ago
- ☆20Updated 3 years ago
- A Vietnamese handwriting recognition project☆10Updated last year
- ☆62Updated 9 months ago
- Handwriting OCR for Vietnamese Address using state-of-the-art CRNN model implemented with Tensorflow. This was a challenge proposed by th…☆74Updated last week
- Transformer OCR☆660Updated 3 months ago
- ☆30Updated 6 months ago
- This is our solution dealing with BKAI challenge☆63Updated 2 years ago
- ☆86Updated 3 months ago
- ☆15Updated 2 years ago
- My personal implementation of SVTR model for handwritten OCR☆13Updated last year
- ☆24Updated 10 months ago
- RAG for Vietnamese Wikipedia corpus.☆33Updated last year
- ICIP 2022: Adaptive Radial Projection on Fourier Magnitude Spectrum for Document Image Skew Estimation☆136Updated this week
- MTL-TabNet: Multi-task Learning based Model for Image-based Table Recognition☆100Updated 11 months ago
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆11Updated 4 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated last year
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year