MarsPanther / crawl-for-parallel-corporaLinks
simple bs4 based web crawl for a corpus in need of statistical machine translation
☆13Updated 4 years ago
Alternatives and similar repositories for crawl-for-parallel-corpora
Users that are interested in crawl-for-parallel-corpora are comparing it to the libraries listed below
Sorting:
- Morphological analysis and generation of Amharic, Oromo, and Tigrinya☆11Updated 8 years ago
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 4 years ago
- The set of files used for the development of the Amharic Corpus.☆11Updated 8 years ago
- Best Practices in Translation Memory Management☆47Updated 7 years ago
- Benchmark Arabic text diacritization dataset☆77Updated 6 years ago
- Morphological processing for languages of the Horn of Africa☆54Updated last month
- ElixirFM Functional Arabic Morphology☆45Updated 2 years ago
- Automatic categorization of documents, consists in assigning a category to a text based on the information it contains. We'll follow diff…☆94Updated 7 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Updated 11 years ago
- Morphological analysis for Udmurt.☆12Updated 2 months ago
- Arabic Stop Word List☆36Updated 2 years ago
- ☆30Updated 6 years ago
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆107Updated 8 years ago
- Arabic edition of BERT pretrained language models☆132Updated 5 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆11Updated 5 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Updated 5 years ago
- ☆14Updated 3 years ago
- HORNMORPHO is a Python program that analyzes Amharic, Oromo, and Tigrinya words into their constituent morphemes (meaningful parts) and g…☆20Updated 8 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆164Updated 2 years ago
- Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec☆94Updated last year
- Rasa's retail starter pack☆42Updated 3 years ago
- Arabic support for textblob☆86Updated 4 years ago
- Youtube comments topics modeling and sentiment analyzer☆16Updated 3 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆45Updated 2 years ago
- Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..☆90Updated 4 years ago
- ☆40Updated 6 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Updated 7 years ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆93Updated last year
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Updated 8 years ago
- Diacritization of Arabic texts☆11Updated 9 years ago