azmat21 / UyghurWebsiteCrawler
simple crawler for some uyghur website such as uy.ts.cn,bbs.bagdax.cn,www.bagdax.cn(using python and scrapy)
☆12Updated 4 years ago
Alternatives and similar repositories for UyghurWebsiteCrawler:
Users that are interested in UyghurWebsiteCrawler are comparing it to the libraries listed below
- Make N-Gram for Uyghur language☆14Updated 4 years ago
- ☆11Updated 9 years ago
- Uyghur Single Speaker Speech Dataset. ウイグル語音声データセット☆22Updated 2 years ago
- uyghur text resource crawled from website☆12Updated 9 years ago
- This converter converts multiple Uyghur scripts: ULS(Uyghur Latin Script), UAS(Uyghur Arabic Script), CTS(Common Turkick Scritp), UCS(Uyg…☆47Updated 3 months ago
- Bu Uyghur yéziqini Pythonning gensim ambiridiki word2vec algorizimida sinap baqqan misal.☆16Updated 3 years ago
- chrome extension for uyghur language☆9Updated 4 months ago
- Awesome Open Source Projects in Uyghur Language☆36Updated 6 years ago
- An Uyghur spell checking library written in .NET☆15Updated 11 years ago
- simple uyghur tts with pyaudio☆14Updated 7 years ago
- Simple Uyghur OCR with Tesseract☆25Updated 8 months ago
- Speech Recognition for Uyghur using deep learning☆32Updated 3 years ago
- Uyghur Word List☆42Updated 8 years ago
- Collection of resources for Uyghur linguistics.☆15Updated 9 years ago
- collection of fonts for Uyghur arabic script☆11Updated 5 years ago
- Speech Recognition for Uyghur using Speech transformer☆23Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆16Updated 2 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 4 months ago
- Multilingual grapheme-to-phoneme conversion☆20Updated 6 years ago
- ☆9Updated 3 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15Updated 4 years ago
- open-source Mandarian biased word dataset☆11Updated last year
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago
- Mirror of SRILM☆54Updated 4 years ago
- A simple word level tokenizing library and tool for Uyghur language | ئۇيغۇرچە سۆز سۈزۈش كودى ۋە قۇرالى☆20Updated 10 years ago
- An audio and transcribed corpus of contemporary Hong Kong Cantonese☆35Updated 4 years ago
- Pronounce Arabic words☆18Updated 5 years ago
- Text Editor with Spell Check Ability for Uyghur☆55Updated 3 weeks ago