ye-kyaw-thu / toolsLinks
preprocessing and postediting tools especially for NLP (bash, perl, python)
☆17Updated 4 months ago
Alternatives and similar repositories for tools
Users that are interested in tools are comparing it to the libraries listed below
Sorting:
- myPOS (Myanmar Part-of-Speech) Corpus for Myanmar NLP Research and Developments☆78Updated 2 months ago
- Khmer language processing toolkit☆78Updated 2 years ago
- Word segmentation using Conditional Random Fields (CRF) for Khmer document☆32Updated last month
- ☆14Updated 6 years ago
- Some lecture materials of NLP Class at UTYCC☆31Updated 2 years ago
- Syllable segmentation tool for Myanmar language (Burmese) by Ye.☆62Updated last year
- Myanmar (Burmese) Language Grapheme to Phoneme (myG2P) Conversion Dictionary for speech recognition (ASR) and speech synthesis (TTS).☆55Updated 4 years ago
- Laphet: A tiny neural network language modeling library designed for students and educators.☆11Updated 9 months ago
- A PyPI package for fast word/character error rate (WER/CER) calculation☆72Updated 2 years ago
- This is a repository for PALM students at Royal University of Phnom Penh (2024). The materials for this course are adopted from https://i…☆17Updated 7 months ago
- မြန်မာစကားလုံးများ (Myanmar Words / Burmese Words).☆48Updated 2 years ago
- Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance☆21Updated last year
- Universal Dependency Tree for Myanmar Language☆10Updated 9 months ago
- A collection of preprocessed datasets and pretrained models for generating paraphrases.☆31Updated 4 years ago
- Myanmar Word Segmentation Tool☆32Updated 7 years ago
- Awesome Myanmar Projects and Resources☆119Updated 2 years ago
- khPOS (Khmer Part-of-Speech) Corpus for Khmer NLP Research and Developments☆34Updated last year
- A large collection of Khmer language resources. Khmer is a language used by Cambodia.☆154Updated last month
- Code for extracting parallel corpora from pmindia☆16Updated 5 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- scipts for working with open.bible data☆26Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆32Updated 4 years ago
- Automatic Speech Recognition for Indonesian☆18Updated 4 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Updated last year
- This is the official implementation to the EMNLP 2024 paper: Modeling Layout Reading Order as Ordering Relations for Visually-rich Docume…☆28Updated last year
- UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)☆63Updated last year
- Python library for Myanmar text processing☆73Updated 3 months ago
- An implementation of Tiling and Corruption (TACo) Augmentations for OCR/HTR☆15Updated 3 years ago
- Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION☆79Updated 2 years ago
- End-to-End Vietnamese Speech Recognition using wav2vec 2.0☆104Updated 4 years ago