ye-kyaw-thu / toolsLinks
preprocessing and postediting tools especially for NLP (bash, perl, python)
☆17Updated 5 months ago
Alternatives and similar repositories for tools
Users that are interested in tools are comparing it to the libraries listed below
Sorting:
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆78Updated 3 months ago
- ☆14Updated 6 years ago
- Some lecture materials of NLP Class at UTYCC☆31Updated 2 years ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆32Updated 3 months ago
- Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).☆57Updated 4 years ago
- Laphet: A tiny neural network language modeling library designed for students and educators.☆11Updated 11 months ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆104Updated 4 years ago
- A curated list of research papers and resources on Indonesian languages☆40Updated last year
- Syllable segmentation tool for Myanmar language (Burmese) by Ye.☆63Updated last year
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆18Updated 8 months ago
- Myanmar Word Segmentation Tool☆32Updated 7 years ago
- This repo builds an end-to-end deep learning application that supports speech recognition system. It's simple to use and understand☆38Updated 2 years ago
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆62Updated last year
- Awesome Myanmar Projects and Resources☆119Updated 2 years ago
- Universal Dependency Tree for Myanmar Language☆10Updated 11 months ago
- Automatic Speech Recognition for Indonesian☆18Updated 4 years ago
- Various experimental NLP tasks for Khmer language☆34Updated 5 years ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆71Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆17Updated 5 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Updated 4 years ago
- Solution for Zalo AI Challenge 2022 - Lyrics Alignment☆68Updated 3 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Updated last year
- Transformer OCR is a Optical Character Recognition tookit built for researchers working on both OCR for both Vietnamese and English. This…☆10Updated 4 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 7 months ago
- Cross-lingual learning in scene text recognition (ICASSP2024)☆18Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Pre-trained Mongolian BERT models☆49Updated 4 years ago
- Vietnamese Text Dataset - Wikipedia vi 2018☆14Updated 6 years ago
- syllable, word and phrase segmenter for Burmese (Myanmar language)☆63Updated 4 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago