A curated collection of resources and repositories for Natural Language Processing (NLP) tasks specific to Darija, the Moroccan Arabic dialect. This repository aims to provide students and researchers with a comprehensive collection of tools, datasets, models, and code examples to facilitate Darija processing and analysis.
☆102Sep 27, 2023Updated 2 years ago
Alternatives and similar repositories for Arabic-Darija-NLP-Resources
Users that are interested in Arabic-Darija-NLP-Resources are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A list of Moroccan Darija Datasets grouped by name, data source, region and size.☆60May 10, 2024Updated last year
- ☆20Dec 28, 2025Updated 4 months ago
- Dive into the world of Arabic NLP with this extensive collection of resources, tools, datasets, and best practices tailored for the Arabi…☆60Oct 30, 2023Updated 2 years ago
- TODa: Tamazight Open Dataset☆19Jan 13, 2025Updated last year
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Certified robustness of deep neural networks☆19Aug 20, 2024Updated last year
- Dvoice est un outil de reconnaissance vocale pour les dialectes et les langues peu représentées.☆34Mar 19, 2022Updated 4 years ago
- Official code for PLoP☆20Mar 6, 2026Updated 2 months ago
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- Resk is a robust Python library designed to enhance security and manage context when interacting with LLMs. It provides a protective …☆19Apr 13, 2026Updated 3 weeks ago
- 4-day AI hackathon in 1337 Benguerir, Morocco☆57Jun 13, 2025Updated 10 months ago
- An AI based solution to help people self diagnose their health issues. Based on GPT-3 Language Model☆18Oct 10, 2023Updated 2 years ago
- Machine Learning in Darija☆24Jul 10, 2020Updated 5 years ago
- Curated list of Moroccans publishing in the most prestigious AI conferences☆11Oct 14, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Python package for Arabic natural language processing☆28Jun 12, 2019Updated 6 years ago
- Code and models for "The Interplay of Variant, Size, and Task Type in Arabic Pre-trained Language Models". EACL 2021, WANLP.☆57Jun 21, 2024Updated last year
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- ☆13Nov 22, 2022Updated 3 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆548Mar 5, 2026Updated 2 months ago
- A tool written in Go that helps you monitor a collection of websites using various metrics.☆11Nov 9, 2021Updated 4 years ago
- Mobily.ws SMS channel for Laravel notification system☆11Sep 6, 2018Updated 7 years ago
- ☆32Feb 3, 2026Updated 3 months ago
- Ray-casting game for wasting productive time.☆11May 23, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Apr 9, 2023Updated 3 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- Solutions to problems from contest INSEA Game of codes 20222☆19May 26, 2022Updated 3 years ago
- A Moroccan utility library for working with CIN, phone numbers, currency, addresses, dates, and more.☆183Jul 10, 2025Updated 9 months ago
- A Notion Changelog Next js boilerplate☆14Jun 28, 2021Updated 4 years ago
- 👩💻 🇲🇦List of awesome Moroccan things for developers 🇲🇦👨🏻💻☆733Nov 6, 2025Updated 6 months ago
- Arabic to English machine translation with Transformers and Pytorch☆27Apr 1, 2026Updated last month
- Benchmarking Large Language Models☆105Jun 20, 2025Updated 10 months ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- graphpatch is a library for activation patching on PyTorch neural network models.☆21Feb 11, 2025Updated last year
- End-to-End Arabic ASR using DeepSpeech engine☆14Nov 2, 2021Updated 4 years ago
- Jupyter notebook that contains the workflow for cleaning scraped HTML sites for NLP in Python☆10Sep 3, 2020Updated 5 years ago
- Awesome Darija Arabic NLP Resources☆20Apr 22, 2025Updated last year
- stateofdev.ma source code☆162Mar 1, 2026Updated 2 months ago
- A Next js boilerplate for authentication☆18Mar 31, 2021Updated 5 years ago
- BoDmagh dataset is a Supervised Fine-Tuning (SFT) dataset for the Darija language☆21May 4, 2025Updated last year