Georeactor / alif-toolkitLinks
Tools for splitting, normalizing, text-shaping Arabic script
☆12Updated last year
Alternatives and similar repositories for alif-toolkit
Users that are interested in alif-toolkit are comparing it to the libraries listed below
Sorting:
- Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..☆90Updated 5 years ago
- Yaziji : Arabic phrase generator☆17Updated last year
- ☆40Updated 6 years ago
- Lexical data at Unicode☆70Updated last year
- AQMAR Arabic Tagger: Sequence tagger with cost-augmented structured perceptron training☆43Updated 12 years ago
- Arabic Transliteration in Python☆36Updated 12 years ago
- A python library to extract basic shapes of font glyphs in a fundamenta☆14Updated 7 years ago
- Collection of various Arabic NLP and Text Processing Scripts and Utilities☆59Updated 12 years ago
- Qutuf (قُطُوْف): An Arabic Morphological analyzer and Part-Of-Speech tagger as an Expert System.☆132Updated 3 years ago
- Tashaphyne: Arabic Light Stemmer☆103Updated last year
- Arabic flexionnal morphology generator☆35Updated last year
- A dataset for online Arabic calligraphy. A collection of 2500 annotated calligraphic styles.☆153Updated last year
- Dictionary app that allows you to look up Arabic words in transliteration☆61Updated 3 years ago
- Arabic Art using GANs☆17Updated 3 years ago
- YaraSpell is an simplified arabic spell checker☆45Updated 8 years ago
- ElixirFM Functional Arabic Morphology☆45Updated 2 years ago
- This repo contains a set of Arabic newspaper articles alongwith metadata, extracted from various Saudi newspapers.☆71Updated 7 years ago
- Crawler for linguistic corpora☆213Updated 5 months ago
- Assem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to improve search.☆149Updated 3 years ago
- Deep learning for AR text Vocalization - التشكيل الالي للنصوص العربية☆347Updated 2 years ago
- Reconstruct Arabic sentences to be used in applications that don't support Arabic☆431Updated 8 months ago
- Helsinki Finite-State Technology (library and application suite)☆136Updated last month
- Arabic vocalized text corpus☆14Updated 11 years ago
- Arabic Stop Word List☆36Updated 2 years ago
- Instruction dataset for Arabic with 10,000 instruction and output pairs. CIDAR can be used to fine-tune LLMs to follow instructions.☆43Updated 10 months ago
- Neural Arabic text diacritization☆93Updated 2 years ago
- Open Source tool for Arabic text readability☆23Updated 7 months ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Updated 8 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆42Updated 3 years ago
- Shami Dialect Corpus (SDC)☆29Updated 7 years ago