☆55Jul 21, 2024Updated last year
Alternatives and similar repositories for Arabic_nlp_preprocessing
Users that are interested in Arabic_nlp_preprocessing are comparing it to the libraries listed below
Sorting:
- ☆49Jul 21, 2024Updated last year
- ☆17Jan 27, 2025Updated last year
- ☆231Jul 13, 2024Updated last year
- ☆43Jul 27, 2024Updated last year
- Aranizer: A Custom Tokenizer based on SentencePiece and BPE tailored for Arabic Language Modeling☆21Aug 4, 2024Updated last year
- ☆12May 11, 2024Updated last year
- ☆11May 24, 2024Updated last year
- A neural and statistical engine for accurately adding diacritics (Tashkeel) to Arabic text. First-place winner on Kaggle 🥇☆18May 29, 2025Updated 9 months ago
- ☆39Feb 1, 2025Updated last year
- llms server, decentralized (local) llms to run in your computer privately, host ALL your AI locally☆18Jul 16, 2024Updated last year
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆45Apr 3, 2025Updated 10 months ago
- Named Entity Recognition System for Arabic☆20Nov 29, 2022Updated 3 years ago
- Explore the content of Arabic text datasets.☆19May 23, 2022Updated 3 years ago
- ☆32Aug 8, 2025Updated 6 months ago
- هذا الدليل لمساعدة المهتمين في تعلم معالجة النصوص في اللغة العربية☆50Apr 9, 2025Updated 10 months ago
- A central hub for translating stuff into Arabic (Join our Discord Server, if you want to help)☆48Jan 12, 2026Updated last month
- ☆24Jun 21, 2024Updated last year
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Jan 4, 2024Updated 2 years ago
- ☆42Aug 2, 2025Updated 7 months ago
- ☆32Oct 2, 2025Updated 5 months ago
- Zero-shot Transfer Learning from English to Arabic☆30Jun 22, 2022Updated 3 years ago
- A production-ready FastAPI boilerplate application with a comprehensive set of features for modern web backend development.☆146May 17, 2025Updated 9 months ago
- An open-source AI system enabling real time Quran recitation tracking, word-level alignment, error detection, and adaptive feedback. Desi…☆103Feb 4, 2026Updated 3 weeks ago
- AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP researc…☆417Apr 4, 2021Updated 4 years ago
- ☆12Sep 21, 2023Updated 2 years ago
- Arabic cleaning, normalization and segmentation library.☆74Sep 28, 2023Updated 2 years ago
- Deep learning for AR text Vocalization - التشكيل الالي للنصوص العربية☆349Mar 25, 2023Updated 2 years ago
- ☆80Aug 24, 2023Updated 2 years ago
- Arabic Stop Word List☆36Jan 11, 2024Updated 2 years ago
- مستودع الأوراق المسحية في معالجة اللغة العربية (أسبر) A Repository for survey and review papers in Arabic Natural Language processing (AN…☆85Feb 22, 2026Updated last week
- Islandora Solr Search module☆24Jul 28, 2025Updated 7 months ago
- LLM Building Blocks for Python Course☆15Nov 17, 2025Updated 3 months ago
- A sample implementation of login/registration with Cognito in React☆12Jun 23, 2023Updated 2 years ago
- Python script demonstrating the process of recovering text from embeddings, highlighting the associated privacy risks and mitigation stra…☆19Nov 19, 2024Updated last year
- a blog starter project☆11Oct 29, 2018Updated 7 years ago
- Quranic Lexical/Semantic Search☆50Feb 28, 2025Updated last year
- ☆18Jun 25, 2025Updated 8 months ago
- This is the repo for CROssBARv2 Knowledge Graph data. CROssBARv2 is a heterogeneous general-purpose biomedical KG-based system.☆11Feb 4, 2026Updated 3 weeks ago
- AraT5: Text-to-Text Transformers for Arabic Language Understanding☆93May 16, 2024Updated last year