urduhack / awesome-urduLinks
π A curated list of resources dedicated to Urdu language.
β74Updated 4 years ago
Alternatives and similar repositories for awesome-urdu
Users that are interested in awesome-urdu are comparing it to the libraries listed below
Sorting:
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.β73Updated last year
- An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way pβ¦β306Updated 2 years ago
- This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Daβ¦β152Updated last year
- BNLP is a natural language processing toolkit for Bengali Language.β308Updated 2 weeks ago
- β64Updated 4 years ago
- Arabic edition of BERT pretrained language modelsβ132Updated 5 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.β110Updated 2 years ago
- AraT5: Text-to-Text Transformers for Arabic Language Understandingβ93Updated last year
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.β194Updated last week
- πA text file containing 150,000 Urdu words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion.β55Updated 5 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehensionβ164Updated 2 years ago
- Arabic nested named entity recognitionβ45Updated 10 months ago
- Bangla Machine Translator based on seq2seq Architectureβ44Updated 3 years ago
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resourcesβ34Updated 5 years ago
- A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.β18Updated 3 years ago
- β30Updated 6 years ago
- A collaborative catalog of NLP resources for Indic languagesβ628Updated last year
- Sentiment Analysis in Arabic tweetsβ75Updated last month
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).β57Updated 2 years ago
- This repo contains Arabic OCR Appβ62Updated 3 years ago
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)β107Updated 8 years ago
- π Complete collection of Urdu language characters & unicode code points.β40Updated 2 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.β522Updated 3 months ago
- Bangla-Bert is a pretrained bert model for Bengali languageβ82Updated 9 months ago
- β41Updated 4 years ago
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)β709Updated 3 years ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.β55Updated last year
- Arabic cleaning, normalization and segmentation library.β73Updated 2 years ago
- AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP researcβ¦β415Updated 4 years ago
- Transliteration models for 21 Indic languagesβ111Updated 2 years ago