mehedihasanbijoy / DPCSpell
[Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages
☆10Updated 5 months ago
Alternatives and similar repositories for DPCSpell:
Users that are interested in DPCSpell are comparing it to the libraries listed below
- ☆41Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 11 months ago
- asr2k☆48Updated 7 months ago
- ☆42Updated 2 years ago
- Repository containing the open source code of works published at the FBK MT unit.☆42Updated last week
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆19Updated 10 months ago
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆15Updated 4 months ago
- LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models☆23Updated 5 months ago
- A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation (INTERSPEECH 2022)☆20Updated 5 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Unsupervised spoken sentence embeddings☆14Updated 2 years ago
- ☆24Updated 4 years ago
- ☆11Updated 3 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆20Updated 2 months ago
- Library for pruning experts per language pair in NLLB-200☆31Updated last year
- ☆28Updated 2 years ago
- ☆33Updated 3 years ago
- ☆17Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- ☆32Updated 3 weeks ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆11Updated 10 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆71Updated 3 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆44Updated last year
- ASCEND Chinese-English code-switching dataset☆23Updated 2 years ago
- NTREX -- News Test References for MT Evaluation☆80Updated 7 months ago
- ☆34Updated 4 months ago
- ☆74Updated 3 years ago