BengaliAI / BADLADLinks
BADLAD: Bengali Document Layout Analysis Dataset
☆14Updated last year
Alternatives and similar repositories for BADLAD
Users that are interested in BADLAD are comparing it to the libraries listed below
Sorting:
- Bangla Unicode Normalization☆21Updated last year
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆35Updated last year
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Updated last year
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆28Updated 2 years ago
- ☆60Updated 5 months ago
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆28Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆80Updated 2 years ago
- Resources and Tool for Bangla language computation☆14Updated 2 years ago
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆27Updated last year
- This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced i…☆85Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆41Updated 3 years ago
- ☆41Updated 2 years ago
- This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Da…☆152Updated last year
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Updated last year
- Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset☆50Updated 2 years ago
- Solutions provided to Chip Huyen's Machine Learning Interview Book with GPT☆42Updated 2 years ago
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆17Updated last year
- Fast whitespace correction with Transformers☆17Updated 3 months ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆42Updated 2 years ago
- A Java toolkit to generate multi fonts Arabic text images☆11Updated 4 years ago
- ☆18Updated last year
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆58Updated 2 years ago
- ☆49Updated 3 years ago
- ☆141Updated last year
- Handwritten text recognition using transformers.☆158Updated last year
- Arabic nested named entity recognition☆42Updated 9 months ago
- Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task…☆286Updated 2 years ago
- AIN - The First Arabic Inclusive Large Multimodal Model. It is a versatile bilingual LMM excelling in visual and contextual understanding…☆49Updated 9 months ago
- ☆18Updated last year