Maha is a text processing library specially developed to deal with Arabic text.
☆216May 18, 2026Updated last week
Alternatives and similar repositories for Maha
Users that are interested in Maha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆719Oct 17, 2022Updated 3 years ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆196Jan 30, 2026Updated 3 months ago
- Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..☆91Feb 7, 2021Updated 5 years ago
- Deep learning for AR text Vocalization - التشكيل الالي للنصوص العربية☆354Mar 25, 2023Updated 3 years ago
- Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebo…☆421Mar 1, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.☆137Dec 12, 2022Updated 3 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆111Jan 4, 2024Updated 2 years ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆57Apr 9, 2023Updated 3 years ago
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆167Aug 4, 2023Updated 2 years ago
- Arabic NLP tools List inventory☆92Dec 17, 2022Updated 3 years ago
- Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب☆333Mar 27, 2024Updated 2 years ago
- قاموس فرنسي-عربي موجه إلى طلبة السنة الأولى للشعب التقنية الجامعية☆33Dec 10, 2014Updated 11 years ago
- Explore the content of Arabic text datasets.☆18May 23, 2022Updated 4 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆551Mar 5, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Al-Faraheedy Project☆23Jun 20, 2024Updated last year
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆46Apr 3, 2025Updated last year
- Neural Arabic text diacritization☆97Mar 24, 2023Updated 3 years ago
- Annotated corpus of Arabic tweets which mention a violence act.☆10Jun 6, 2018Updated 7 years ago
- A Python implementation of Farasa toolkit☆141Sep 11, 2025Updated 8 months ago
- Arabic speech recognition, classification and text-to-speech.☆428Sep 30, 2023Updated 2 years ago
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- Naftawayh: arabic word tagger☆13Aug 27, 2020Updated 5 years ago
- pyarabic☆485Jan 16, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- All Hadith With Tashkil and Without Tashkel from the Nine Books that are 62,169 Hadith.☆323Apr 17, 2022Updated 4 years ago
- ☆16Aug 22, 2023Updated 2 years ago
- Arabic cleaning, normalization and segmentation library.☆76Sep 28, 2023Updated 2 years ago
- Arabic Stop Word List☆38Jan 11, 2024Updated 2 years ago
- مكتبة جافاسكريبت تقوم باستبدال الأحرف اللاتنية عند الكتابة بأحرف عربية (والعكس) مع واجهة برمجة مرنة☆41Oct 22, 2019Updated 6 years ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Jun 23, 2024Updated last year
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- This is the official repository for Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks.☆26Dec 9, 2024Updated last year
- ArWordVec is a collection of pre-trained word embedding model built from huge repository of Arabic tweets in different topics. The aim of…☆19Jul 9, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of awesome projects and dev/design resources for supporting Arabic computational needs.☆551Apr 25, 2026Updated last month
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Nov 16, 2020Updated 5 years ago
- تجميعة من المشاريع، وخصوصا مفتوحة المصدر☆331Apr 24, 2024Updated 2 years ago
- Platform for Arabic Poetry Analysis using knowledge-based and deep learning approaches.☆37Jan 3, 2023Updated 3 years ago
- Arabic nested named entity recognition☆46Mar 10, 2025Updated last year
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆18May 27, 2023Updated 2 years ago
- Generate arabic golden standard corpus for morphology and stemming☆12Jan 12, 2023Updated 3 years ago