A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be made around whether or not to include all types of diacritics and characters or ignore them. Useful for NLP experiments where you may want to normalize text.
☆26Apr 3, 2014Updated 12 years ago
Alternatives and similar repositories for Buckwalter
Users that are interested in Buckwalter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic vocalized text corpus☆14Jan 2, 2015Updated 11 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Apr 24, 2017Updated 9 years ago
- All resources created and used in Arabic Sentiment Analysis of Arabic Tweets. Includes Sentiment lexicon generated from Arabic tweets and…☆14Dec 21, 2021Updated 4 years ago
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆30Feb 1, 2020Updated 6 years ago
- A desktop version of Edward Lane's Arabic-English Lexicon☆21Apr 21, 2018Updated 8 years ago
- repository for the project of building large arabic multidomain lexicon for sentiment analysis using feature selection from multiple reso…☆16Jan 21, 2015Updated 11 years ago
- Experimenting with Sentiment Analysis in Arabic☆10Aug 31, 2014Updated 11 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- Dictionary app that allows you to look up Arabic words in transliteration☆62Feb 17, 2026Updated 3 months ago
- ElixirFM Functional Arabic Morphology☆47Mar 15, 2023Updated 3 years ago
- Nile University's Arabic sentiment Lexicon☆17Nov 24, 2016Updated 9 years ago
- Arabic Dialect Identification on AOC data.☆24Mar 2, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- YaraSpell is an simplified arabic spell checker☆47Feb 20, 2017Updated 9 years ago
- WAZEN is an Arabic NLP text utility to find word variation pattern.☆15Sep 18, 2021Updated 4 years ago
- This buckwalter2unicode script is designed to convert Arabic text that has been transliterated to ASCII symbols using the Buckwalter Tran…☆13Sep 30, 2012Updated 13 years ago
- Benchmark Arabic text diacritization dataset☆79Apr 7, 2026Updated 2 months ago
- The Arabic NLP Python Library (Archived in favor of Matn library)☆11Apr 28, 2017Updated 9 years ago
- ☆12May 21, 2020Updated 6 years ago
- Extract dates from text☆66Jan 27, 2021Updated 5 years ago
- Arabic support for textblob☆87Oct 21, 2021Updated 4 years ago
- Jabalín is an application for generating verbs in Modern Standard Arabic. The application is implemented in python language version 3. Th…☆12Jul 12, 2015Updated 10 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)☆20Dec 31, 2018Updated 7 years ago
- A command line version of Koja Stemmer (An Arabic rooting algorithm)☆21Apr 8, 2017Updated 9 years ago
- Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..☆91Feb 7, 2021Updated 5 years ago
- JavaScript Arabic Stemmer☆26Dec 1, 2012Updated 13 years ago
- This repository☆32Nov 13, 2022Updated 3 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆47Sep 10, 2023Updated 2 years ago
- Mono-width companion to Amiri font family☆35Jul 29, 2025Updated 10 months ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 4 years ago
- Contextualised Word Representations for Lexical Semantic Change Analysis☆33Jul 17, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Pronounce Arabic words☆19May 27, 2019Updated 7 years ago
- Arabic Parser Using Stanford API☆12Nov 11, 2017Updated 8 years ago
- A Javascript library that extends the native String object with methods to help when dealing with Arabic strings for node and the browser…☆56Sep 12, 2018Updated 7 years ago
- Youtube comments topics modeling and sentiment analyzer☆16Oct 25, 2022Updated 3 years ago
- LABR: Large Scale Arabic Book Reviews Dataset☆46Nov 14, 2014Updated 11 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Jul 14, 2020Updated 5 years ago
- ☆44Aug 7, 2015Updated 10 years ago