Arabic cleaning, normalization and segmentation library.
☆74Sep 28, 2023Updated 2 years ago
Alternatives and similar repositories for tnkeeh
Users that are interested in tnkeeh are comparing it to the libraries listed below
Sorting:
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Jan 4, 2024Updated 2 years ago
- A simple strategy for training and finetuning NLP models for Arabic. Specify the parameters and just wait for the results. A simple desig…☆21Jan 27, 2024Updated 2 years ago
- Implementation of many Arabic NLP and CV projects. Providing real time experience using many interfaces like web, command line and notebo…☆421Mar 1, 2024Updated 2 years ago
- Python intefrace for evaluation on chatgpt models☆19Feb 13, 2024Updated 2 years ago
- TURJUMAN, a neural toolkit for translating from 20 languages into Modern Standard Arabic (MSA).☆56Apr 9, 2023Updated 2 years ago
- Time Series Explaination in Arabic | شرح السلاسل الزمنية باللغة العربية☆13May 24, 2022Updated 3 years ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆93May 16, 2024Updated last year
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆711Oct 17, 2022Updated 3 years ago
- pyarabic☆478Jan 16, 2026Updated last month
- A Python implementation of Farasa toolkit☆137Sep 11, 2025Updated 5 months ago
- Arabic edition of BERT pretrained language models☆133Dec 5, 2020Updated 5 years ago
- Arabic speech recognition, classification and text-to-speech.☆424Sep 30, 2023Updated 2 years ago
- قاموس فرنسي-عربي موجه إلى طلبة السنة الأولى للشعب التقنية الجامعية☆33Dec 10, 2014Updated 11 years ago
- Maha is a text processing library specially developed to deal with Arabic text.☆213Feb 23, 2026Updated last week
- Manim Library Tutorial for Math, Physics & Chemistry.☆10Oct 7, 2021Updated 4 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆45Sep 10, 2023Updated 2 years ago
- أفكار لمشروع رقمنة الكتب عن طريق الجهد الموزع☆20Dec 18, 2021Updated 4 years ago
- Al-Faraheedy Project☆23Jun 20, 2024Updated last year
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆45Apr 3, 2025Updated 11 months ago
- Deep learning for AR text Vocalization - التشكيل الالي للنصوص العربية☆349Mar 25, 2023Updated 2 years ago
- Neural Arabic text diacritization☆94Mar 24, 2023Updated 2 years ago
- AyaSpell Arabic Dictionary for Hunspell Spellchecker☆45Aug 27, 2020Updated 5 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆11Feb 5, 2020Updated 6 years ago
- A tiny wrapper for Arabic WordCloud plots☆10May 24, 2020Updated 5 years ago
- A python client for Belqis System☆43Feb 28, 2023Updated 3 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Oct 28, 2022Updated 3 years ago
- Material for the Text Analysis of Arabic course taught at the NYU Abu Dhabi Winter Institute in Digital Humanities 2020.☆15Jan 30, 2020Updated 6 years ago
- A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.☆154Jun 24, 2024Updated last year
- Benchmark Arabic text diacritization dataset☆77Jul 26, 2019Updated 6 years ago
- Pre-process arabic text (remove diacritics, punctuations and repeating characters)☆107Apr 8, 2017Updated 8 years ago
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14May 30, 2023Updated 2 years ago
- دليل أسلوب كتابة المحتوى - للمواقع الإلكترونية - تطبيقات الموبايل - لوحات التحكّم☆17Jan 31, 2026Updated last month
- A Python package that do stemming, tokenization, sentence breaking, segmentation, normalization, POS tagging for Arabic language.☆28Mar 20, 2021Updated 4 years ago
- Python library used for Arabic NLP to process, prepare and clean the Arabic text☆16Jul 6, 2024Updated last year
- Adawat: Arabic Text tools☆54Aug 27, 2020Updated 5 years ago
- ☆30Feb 1, 2020Updated 6 years ago
- Seq2Seq-based open domain empathetic conversational model for Arabic: Dataset & Model☆59Feb 25, 2025Updated last year
- This model detects arabic fonts (نسخ, رقعة) given a picture of the text, Live https://calbot.hawzen.me/☆16May 27, 2023Updated 2 years ago
- Saudi National Address API PHP Library with Laravel support.☆18Oct 16, 2021Updated 4 years ago