A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be made around whether or not to include all types of diacritics and characters or ignore them. Useful for NLP experiments where you may want to normalize text.
☆26Apr 3, 2014Updated 11 years ago
Alternatives and similar repositories for Buckwalter
Users that are interested in Buckwalter are comparing it to the libraries listed below
Sorting:
- ☆30Feb 1, 2020Updated 6 years ago
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Apr 24, 2017Updated 8 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆63Jun 14, 2017Updated 8 years ago
- ElixirFM Functional Arabic Morphology☆45Mar 15, 2023Updated 2 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- Experimenting with Sentiment Analysis in Arabic☆10Aug 31, 2014Updated 11 years ago
- This buckwalter2unicode script is designed to convert Arabic text that has been transliterated to ASCII symbols using the Buckwalter Tran…☆13Sep 30, 2012Updated 13 years ago
- The Arabic NLP Python Library (Archived in favor of Matn library)☆11Apr 28, 2017Updated 8 years ago
- Benchmark Arabic text diacritization dataset☆77Jul 26, 2019Updated 6 years ago
- Jabalín is an application for generating verbs in Modern Standard Arabic. The application is implemented in python language version 3. Th…☆12Jul 12, 2015Updated 10 years ago
- Dictionary app that allows you to look up Arabic words in transliteration☆63Feb 17, 2026Updated 2 weeks ago
- YaraSpell is an simplified arabic spell checker☆46Feb 20, 2017Updated 9 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Oct 27, 2021Updated 4 years ago
- ☆12May 21, 2020Updated 5 years ago
- repository for the project of building large arabic multidomain lexicon for sentiment analysis using feature selection from multiple reso…☆16Jan 21, 2015Updated 11 years ago
- Arabic support for textblob☆86Oct 21, 2021Updated 4 years ago
- This repository☆30Nov 13, 2022Updated 3 years ago
- WAZEN is an Arabic NLP text utility to find word variation pattern.☆15Sep 18, 2021Updated 4 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 3 years ago
- A platform to organize the work of charity in Algiers City☆16Oct 29, 2019Updated 6 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆45Sep 10, 2023Updated 2 years ago
- Extract dates from text☆66Jan 27, 2021Updated 5 years ago
- Nile University's Arabic sentiment Lexicon☆17Nov 24, 2016Updated 9 years ago
- Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)☆20Dec 31, 2018Updated 7 years ago
- A community-driven Algerian index of reusable assets and libraries.☆30Nov 22, 2022Updated 3 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Jul 14, 2020Updated 5 years ago
- A command line version of Koja Stemmer (An Arabic rooting algorithm)☆21Apr 8, 2017Updated 8 years ago
- ☆43Aug 7, 2015Updated 10 years ago
- Arabic Dialect Identification on AOC data.☆24Mar 2, 2019Updated 7 years ago
- LABR: Large Scale Arabic Book Reviews Dataset☆46Nov 14, 2014Updated 11 years ago
- Sentiment Analysis for Arabic Text (tweets, reviews, and standard Arabic) using word2vec☆95Aug 20, 2024Updated last year
- hULMonA (حلمنا): tHe first Universal Language MOdel iN Arabic☆47Nov 16, 2020Updated 5 years ago
- Alfanous ( الفانوس ) is an Arabic search engine API provide the simple and advanced search in Quran , more features and many interfaces..…☆51Feb 16, 2025Updated last year
- JavaScript Arabic Stemmer☆26Dec 1, 2012Updated 13 years ago
- Collect tweets (tweets corpus) using Twitter API. Collection can be based on hashtags, keywords, geographical location☆25Nov 4, 2019Updated 6 years ago
- Assem's Arabic Light Stemmer is a snowball-based stemming algorithm for Arabic aimed mainly to improve search.☆149Feb 16, 2026Updated 2 weeks ago
- Mono-width companion to Amiri font family☆31Jul 29, 2025Updated 7 months ago