Automatic categorization of documents, consists in assigning a category to a text based on the information it contains. We'll follow different approach of Supervised Machine Learning.
☆94Jan 1, 2019Updated 7 years ago
Alternatives and similar repositories for Arabic-News-Article-Classification
Users that are interested in Arabic-News-Article-Classification are comparing it to the libraries listed below
Sorting:
- Automatic Text Summarization (English/Arabic).☆39Jul 1, 2018Updated 7 years ago
- Arabic text documents classified using SVM, k-nn and Naive bayes classifers.☆12May 10, 2020Updated 5 years ago
- Applied Data Science training course (for updates and resources, read the ReadMe file below)☆15Sep 9, 2023Updated 2 years ago
- Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec☆95Aug 20, 2024Updated last year
- Largest list of Arabic stop words on Github. أكبر قائمة لمستبعدات الفهرسة العربية على جيت هاب☆329Mar 27, 2024Updated last year
- Arabic Open Domain Question Answering System using Neural Reading Comprehension☆165Aug 4, 2023Updated 2 years ago
- Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..☆90Feb 7, 2021Updated 5 years ago
- " Le but de ce projet est de vous permettre d’apprécier de façon pratique la puissance des techniques de Traitement Automatique du Langag…☆11Jun 16, 2019Updated 6 years ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Jul 28, 2017Updated 8 years ago
- Deep learning for AR text Vocalization - التشكيل الالي للنصوص العربية☆349Mar 25, 2023Updated 2 years ago
- Arabic Dialectal Offensive Language dataset from social media comments on news post from facebook, twitter and youtube platforms☆18Sep 25, 2020Updated 5 years ago
- This repository contains my attempt to use two famous speech recognition frameworks (Kaldi, CMU Sphinx4) for Arabic Language using the pu…☆30Jan 15, 2020Updated 6 years ago
- A Python implementation of Farasa toolkit☆138Sep 11, 2025Updated 5 months ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Apr 24, 2017Updated 8 years ago
- AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training☆43Aug 28, 2013Updated 12 years ago
- All resources created and used in Arabic Sentiment Analysis of Arabic Tweets. Includes Sentiment lexicon generated from Arabic tweets and…☆14Dec 21, 2021Updated 4 years ago
- AraVec is a pre-trained distributed word representation (word embedding) open source project which aims to provide the Arabic NLP researc…☆417Apr 4, 2021Updated 4 years ago
- Youtube comments topics modeling and sentiment analyzer☆16Oct 25, 2022Updated 3 years ago
- A suite of Arabic natural language processing tools developed by the CAMeL Lab at New York University Abu Dhabi.☆532Feb 11, 2026Updated 3 weeks ago
- ☆32Aug 27, 2018Updated 7 years ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆36Apr 14, 2022Updated 3 years ago
- Arabic edition of BERT pretrained language models☆133Dec 5, 2020Updated 5 years ago
- ☆40Apr 20, 2019Updated 6 years ago
- Arabic Word-Embedding (Word2vec) model training from Wikipedia articles☆11Dec 13, 2018Updated 7 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆45Apr 3, 2025Updated 11 months ago
- Arabic poetry analysis and generation.☆23Jul 23, 2023Updated 2 years ago
- Radix Primitives Cheatsheet☆12Mar 11, 2022Updated 3 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Jul 14, 2020Updated 5 years ago
- Pre-trained Transformers for Arabic Language Understanding and Generation (Arabic BERT, Arabic GPT2, Arabic ELECTRA)☆711Oct 17, 2022Updated 3 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Apr 3, 2014Updated 11 years ago
- ☆43Aug 7, 2015Updated 10 years ago
- pyarabic☆478Jan 16, 2026Updated last month
- Reconstruct Arabic sentences to be used in applications that don't support Arabic☆432May 12, 2025Updated 9 months ago
- Extract plain text from Arabic Wikipedia dumps.☆13Jun 15, 2014Updated 11 years ago
- Generate arabic golden standard corpus for morphology and stemming☆12Jan 12, 2023Updated 3 years ago
- Experimenting with Sentiment Analysis in Arabic☆10Aug 31, 2014Updated 11 years ago
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Nov 16, 2020Updated 5 years ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆110Jan 4, 2024Updated 2 years ago
- Maha is a text processing library specially developed to deal with Arabic text.☆213Updated this week