BengaliAI / BADLAD
BADLAD: Bengali Document Layout Analysis Dataset
☆12Updated last year
Alternatives and similar repositories for BADLAD
Users that are interested in BADLAD are comparing it to the libraries listed below
Sorting:
- Bangla Unicode Normalization☆19Updated 11 months ago
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆35Updated last year
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆12Updated 9 months ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆20Updated 5 months ago
- Resources and Tool for Bangla language computation☆14Updated 2 years ago
- Line and Word Segmentation for Bangla Handwritten Text Recognition☆14Updated last year
- Classification Benchmarks for Under-resourced Bengali Language based on Multichannel Convolutional-LSTM Network☆20Updated 3 years ago
- Dataset for Bangla named entity recognition☆7Updated 3 years ago
- Pytorch implementation for paper 'BANNER: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition'☆14Updated 5 years ago
- ☆15Updated 6 years ago
- State of the Art Language models and Classifier for Bengali, which is primarily spoken by the Bengalis in South Asia.☆32Updated 4 years ago
- Bangla-Bert is a pretrained bert model for Bengali language☆78Updated 2 weeks ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆80Updated 2 years ago
- Sample implementation of OCR metrics (CER, WER) calculation with TesseractOCR and fastwer☆28Updated 3 years ago
- Into the depths of some concepts of Artificial Intelligence and Machine Learning☆10Updated last month
- The largest VQA dataset for Vietnamese. Related to the text content in the image.☆16Updated last month
- A curated list of Bangla NLP Corpus☆13Updated last year
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆40Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆95Updated last year
- Context-Sensitive Neural Spelling Checker☆20Updated 7 months ago
- This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Summarization for…☆50Updated last year
- ☆17Updated 10 months ago
- CTC handwriting transcription model written in pytorch☆38Updated 3 years ago
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 3 years ago
- Bengali transformer using transformers☆21Updated 2 weeks ago
- Repository for Multilingual-VQA task created during HuggingFace JAX/Flax community week.☆34Updated 3 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 3 years ago
- Minimalist BERT implementation assignment for CS11-711☆83Updated 2 years ago
- ☆17Updated last year
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆24Updated 6 months ago