zeerakahmed / makhzan
An Urdu text corpus
β59Updated 9 months ago
Related projects: β
- πA text file containing 150,000 Urdu words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion.β42Updated 3 years ago
- An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way pβ¦β279Updated 8 months ago
- π A curated list of resources dedicated to Urdu language.β60Updated 3 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.β67Updated last month
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resourcesβ31Updated 3 years ago
- π Complete collection of Urdu language characters & unicode code points.β39Updated last year
- Pre-processing and training scripts for the Tarteel Datasetβ181Updated 2 years ago
- Urdu Text Line OCRβ25Updated last year
- The Quranic Arabic Corpus, an invaluable linguistic resource, is due for a revamp. We're calling on Linguistics, AI, and Tech volunteersβ¦β75Updated last year
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataseβ¦β186Updated 4 years ago
- Dataset for Urdu Ghazalsβ12Updated last year
- Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and noteboβ¦β387Updated 6 months ago
- Benchmark Arabic text diacritization datasetβ70Updated 5 years ago
- A TensorFlow implementation of Baidu's DeepSpeech architectureβ84Updated last month
- A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.β144Updated 2 months ago
- β14Updated this week
- Large scale font independent printed Urdu text data setβ49Updated 4 years ago
- A curated list of awesome projects and dev/design resources for supporting Arabic computational needs.β486Updated 2 weeks ago
- Machine Translation for Africaβ272Updated 2 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.β403Updated 3 weeks ago
- β28Updated this week
- Madina OpenType variable fontβ14Updated 2 weeks ago
- BRAD: Books Reviews in Arabic Datasetβ12Updated 6 years ago
- Arabic cleaning, normalization and segmentation library.β58Updated 11 months ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehensionβ159Updated last year
- Neural Arabic text diacritizationβ82Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.β85Updated 8 months ago
- Leeds University and King Saud University (LK) Hadith Corpusβ57Updated last year
- Wordlists for Arabicβ51Updated 6 years ago
- Our submission for quran QA shared-task. Fortunately, this work achieved the first place among accepted papers.β17Updated 6 months ago