urduhack / awesome-urdu
š A curated list of resources dedicated to Urdu language.
ā60Updated 3 years ago
Related projects: ā
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.ā67Updated last month
- An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way pā¦ā279Updated 8 months ago
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resourcesā31Updated 3 years ago
- šA text file containing 150,000 Urdu words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion.ā42Updated 3 years ago
- š Complete collection of Urdu language characters & unicode code points.ā39Updated last year
- ā12Updated 4 years ago
- Large scale font independent printed Urdu text data setā49Updated 4 years ago
- hULMonA (ŲŁŁ ŁŲ§)ā: tHe first Universal Language MOdel iN Arabicā46Updated 3 years ago
- BRAD: Books Reviews in Arabic Datasetā12Updated 6 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNsā15Updated 2 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).ā51Updated last year
- Arabic edition of BERT pretrained language modelsā126Updated 3 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehensionā159Updated last year
- AraT5: Text-to-Text Transformers for Arabic Language Understandingā83Updated 4 months ago
- ā50Updated 2 years ago
- ā28Updated 4 years ago
- Arabic nested named entity recognitionā31Updated 4 months ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)ā37Updated 7 years ago
- An Urdu text corpusā59Updated 9 months ago
- A deep learning model to classify the Arabic letters and digits easily.ā56Updated 4 years ago
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)ā105Updated 7 years ago
- Description Describes the IndicNLP corpus and associated datasetsā150Updated last year
- Benchmark Arabic text diacritization datasetā70Updated 5 years ago
- Neural Arabic text diacritizationā82Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.ā85Updated 8 months ago
- Hotels Arabic-Reviews Datasetā31Updated 5 years ago
- A Python implementation of Farasa toolkitā110Updated last week
- Urdu Text Line OCRā25Updated last year
- UBC ARBERT and MARBERT Deep Bidirectional Transformers for Arabicā100Updated 3 years ago
- This repo contains Arabic OCR Appā52Updated 2 years ago