amir9ume / urdu_ghazals_rekhta
Dataset for Urdu Ghazals
β14Updated last year
Alternatives and similar repositories for urdu_ghazals_rekhta
Users that are interested in urdu_ghazals_rekhta are comparing it to the libraries listed below
Sorting:
- π A curated list of resources dedicated to Urdu language.β68Updated 4 years ago
- Compilation of Manually Tagged Roman Urdu Dataset (Urdu written in Latin/Roman Script), along with other helpful Roman Urdu NLP resourcesβ32Updated 4 years ago
- π Complete collection of Urdu language characters & unicode code points.β40Updated 2 years ago
- Collection of Urdu datasets for POS, NER, Sentiment, Summarization and NLP tasks.β72Updated 9 months ago
- πA text file containing 150,000 Urdu words for all your dictionary/word-based projects e.g: auto-completion / autosuggestion.β46Updated 4 years ago
- An NLP library for the Urdu language. It comes with a lot of battery included features to help you process Urdu data in the easiest way pβ¦β298Updated last year
- State of the Art Language models and Classifier for Bengali, which is primarily spoken by the Bengalis in South Asia.β32Updated 4 years ago
- β49Updated 5 years ago
- Large scale font independent printed Urdu text data setβ51Updated 5 years ago
- Bengali NLPβ32Updated 6 years ago
- Quran, Hadith, Translations, Tafaseer, Corpus Linguistics. Everything for NLPβ95Updated last year
- Repository dedicated to a collection of resources and helping material for Urdu language Processing related tasksβ20Updated 5 years ago
- An Urdu text corpusβ71Updated last year
- A blueprint for creating Pretraining and Fine-Tuning datasets for Indic languagesβ106Updated 7 months ago
- State of the Art Language models and Classifier for Malayalam, which is spoken by the Malayali people in the Indian state of Kerala and tβ¦β38Updated 4 years ago
- A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagaβ¦β36Updated last year
- Codebase for Indic-Transliteration using Seq2Seq RNN. For latest repo with Transformer-based models, check: https://github.com/AI4Bharat/β¦β60Updated 3 years ago
- Pretraining, fine-tuning and evaluation scripts for IndicBERT-v2 and IndicXTREMEβ94Updated last month
- Data Inspector is an open-source python library that brings 15++ types of different functions to make EDA, data cleaning easier.β39Updated last month
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataseβ¦β195Updated 4 years ago
- Context-Based-Question-Answeringβ43Updated 9 months ago
- β30Updated last year
- This app is built using Python 3.9+, Flask 2.0+, and Pinecone. It performs a similarity search using the Pinecone SDK to find articles thβ¦β23Updated 3 years ago
- Repository for Synthetic datasets I'm creatingβ37Updated 4 years ago
- Bangla-Bert is a pretrained bert model for Bengali languageβ78Updated 2 weeks ago
- Description Describes the IndicNLP corpus and associated datasetsβ172Updated 2 years ago
- State of the Art Language models and Classifier for Tamil language (spoken in India, and few other South Asian countries)β53Updated 4 years ago
- Data Science, Machine Learning, and Deep Learning. Projects, Tutorials and Cheatsheets.β176Updated 3 months ago
- AraT5: Text-to-Text Transformers for Arabic Language Understandingβ90Updated last year
- Tutorial on creating a spelling correction Python application using Gingerit and Streamlitβ16Updated 3 years ago