BengaliAI / BADLADLinks
BADLAD: Bengali Document Layout Analysis Dataset
☆15Updated last year
Alternatives and similar repositories for BADLAD
Users that are interested in BADLAD are comparing it to the libraries listed below
Sorting:
- Bangla Unicode Normalization☆21Updated last year
- This python module is an easy-to-use port of the text normalization used in the paper "Not low-resource anymore: Aligner ensembling, batc…☆36Updated last year
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Updated last year
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Updated last year
- Transformer based Bangla Speech Recognition | Encoder Decoder Architecture☆57Updated 2 years ago
- ☆63Updated 6 months ago
- Resources and Tool for Bangla language computation☆14Updated 2 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Updated last year
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆17Updated last year
- Bangla TTS Inference pipeline using Vit TTS☆13Updated last year
- This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Da…☆152Updated last year
- SemEval 2024 Task 1 : Textual Semantic Relatedness☆27Updated last year
- Context-Sensitive Neural Spelling Checker☆20Updated last year
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆98Updated 8 months ago
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆29Updated 2 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆71Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆17Updated 5 years ago
- ☆50Updated 3 years ago
- ☆11Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆90Updated 6 months ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUI🐸TTS(Text-to-Speech) based high performing neural…☆42Updated 2 years ago
- This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 4…☆277Updated last year
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- Arabic nested named entity recognition☆45Updated 10 months ago
- BNLP is a natural language processing toolkit for Bengali Language.☆308Updated 2 weeks ago
- ☆12Updated 5 years ago
- Pytorch implementation for paper 'BANNER: A Cost-Sensitive Contextualized Model for Bangla Named Entity Recognition'☆13Updated 5 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…☆14Updated 3 years ago