nainiayoub / moroccan-darija-datasetsLinks
A list of Moroccan Darija Datasets grouped by name, data source, region and size.
☆55Updated last year
Alternatives and similar repositories for moroccan-darija-datasets
Users that are interested in moroccan-darija-datasets are comparing it to the libraries listed below
Sorting:
- A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic di…☆95Updated 2 years ago
- darija <-> english dataset☆354Updated 2 months ago
- ☆74Updated 2 years ago
- Dvoice est un outil de reconnaissance vocale pour les dialectes et les langues peu représentées.☆34Updated 3 years ago
- ☆19Updated 4 months ago
- TODa: Tamazight Open Dataset☆16Updated 11 months ago
- Awesome Darija Arabic NLP Resources☆19Updated 7 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Updated last year
- ☆186Updated last week
- ☆56Updated last year
- A Python library to easily visualize geospatial data of Morocco.☆32Updated 2 years ago
- ☆26Updated last year
- a discord bot that pulls the latest or most relevant research papers from arxiv.org☆19Updated 3 years ago
- 4-day AI hackathon in 1337 Benguerir, Morocco☆51Updated 6 months ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆189Updated last week
- This repo is for semantic search app to search over Quran tafsir books☆24Updated last year
- Moroccan housing data pipeline using scrapy, mongodb , zyte and digitalocean cloud☆11Updated 3 years ago
- ☆15Updated 3 years ago
- Python intefrace for evaluation on chatgpt models☆19Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Updated last year
- Code for Arabic Nougat☆49Updated last year
- ☆24Updated 3 years ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆93Updated last year
- Practical LangChain tutorials for LLM applications development☆200Updated 2 months ago
- Quran, Hadith, Translations, Tafaseer, Corpus Linguistics. Everything for NLP☆112Updated last year
- Public repository for the datasets used in the Explore Data Science Academy☆45Updated 2 weeks ago
- Arabic nested named entity recognition☆42Updated 9 months ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Updated 2 years ago
- An AI based solution to help people self diagnose their health issues. Based on GPT-3 Language Model☆18Updated 2 years ago
- Large Language Models: In this repository Language models are introduced covering both theoretical and practical aspects.☆392Updated 2 months ago