azmat21 / UyghurWebsiteCrawler
simple crawler for some uyghur website such as uy.ts.cn,bbs.bagdax.cn,www.bagdax.cn(using python and scrapy)
☆12Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for UyghurWebsiteCrawler
- Make N-Gram for Uyghur language☆14Updated 3 years ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆22Updated 2 years ago
- uyghur text resource crawled from website☆12Updated 8 years ago
- ☆11Updated 9 years ago
- This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyg…☆48Updated last month
- Bu Uyghur yéziqini Pythonning gensim ambiridiki word2vec algorizimida sinap baqqan misal.☆16Updated 2 years ago
- simple uyghur tts with pyaudio☆13Updated 6 years ago
- Collection of resources for Uyghur linguistics.☆15Updated 8 years ago
- Awesome Open Source Projects in Uyghur Language☆36Updated 6 years ago
- Speech Recognition for Uyghur using deep learning☆30Updated 3 years ago
- ☆22Updated 2 years ago
- Speech Recognition for Uyghur using Speech transformer☆20Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- phone inventory library☆15Updated last year
- Simple Kaldi recipe for forced alignment☆10Updated last year
- ☆9Updated 3 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Updated 5 years ago
- Simple Uyghur OCR with Tesseract☆25Updated 6 months ago
- Pronounce Arabic words☆18Updated 5 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Easier analysis of large speech corpora☆21Updated 3 years ago
- An Uyghur spell checking library written in .NET☆15Updated 10 years ago
- ☆32Updated 2 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- Data preparation code for building Kaldi ASR system☆14Updated 7 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- ☆16Updated 4 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated 11 months ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 months ago