A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be made around whether or not to include all types of diacritics and characters or ignore them. Useful for NLP experiments where you may want to normalize text.
☆26Apr 3, 2014Updated 12 years ago
Alternatives and similar repositories for Buckwalter
Users that are interested in Buckwalter are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Comparable documents miner: Arabic-English morphological analysis, text processing, n-gram features extraction, POS tagging, dictionary t…☆35Apr 24, 2017Updated 9 years ago
- All resources created and used in Arabic Sentiment Analysis of Arabic Tweets. Includes Sentiment lexicon generated from Arabic tweets and…☆14Dec 21, 2021Updated 4 years ago
- A ruby gem that contains Natural Language Processing tools for Arabic.☆11May 11, 2015Updated 11 years ago
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆63Jun 14, 2017Updated 9 years ago
- ☆30Feb 1, 2020Updated 6 years ago
- A desktop version of Edward Lane's Arabic-English Lexicon☆21Apr 21, 2018Updated 8 years ago
- repository for the project of building large arabic multidomain lexicon for sentiment analysis using feature selection from multiple reso…☆16Jan 21, 2015Updated 11 years ago
- Python (Cython) binding for harfbuzz an OpenType text shaping.☆19Aug 24, 2018Updated 7 years ago
- Experimenting with Sentiment Analysis in Arabic☆10Aug 31, 2014Updated 11 years ago
- TEAD : Large Scale Arabic Dataset for Sentiment Analysis☆12Oct 16, 2018Updated 7 years ago
- Dictionary app that allows you to look up Arabic words in transliteration☆63Feb 17, 2026Updated 4 months ago
- ElixirFM Functional Arabic Morphology☆48Mar 15, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Nile University's Arabic sentiment Lexicon☆17Nov 24, 2016Updated 9 years ago
- Arabic Dialect Identification on AOC data.☆24Mar 2, 2019Updated 7 years ago
- YaraSpell is an simplified arabic spell checker☆46Feb 20, 2017Updated 9 years ago
- WAZEN is an Arabic NLP text utility to find word variation pattern.☆14Sep 18, 2021Updated 4 years ago
- This buckwalter2unicode script is designed to convert Arabic text that has been transliterated to ASCII symbols using the Buckwalter Tran…☆13Sep 30, 2012Updated 13 years ago
- Benchmark Arabic text diacritization dataset☆78Apr 7, 2026Updated 2 months ago
- The Arabic NLP Python Library (Archived in favor of Matn library)☆10Apr 28, 2017Updated 9 years ago
- Extract dates from text☆66Jan 27, 2021Updated 5 years ago
- Arabic support for textblob☆87Oct 21, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Jabalín is an application for generating verbs in Modern Standard Arabic. The application is implemented in python language version 3. Th…☆12Jul 12, 2015Updated 10 years ago
- Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)☆20Dec 31, 2018Updated 7 years ago
- A command line version of Koja Stemmer (An Arabic rooting algorithm)☆21Apr 8, 2017Updated 9 years ago
- Arabic NLP tool used to perform Text Search, POS tagging, Translation, auto-diacritization, etc..☆91Feb 7, 2021Updated 5 years ago
- This repository☆32Nov 13, 2022Updated 3 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆47Sep 10, 2023Updated 2 years ago
- Mono-width companion to Amiri font family☆36Jul 29, 2025Updated 11 months ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Apr 3, 2022Updated 4 years ago
- Pronounce Arabic words☆19May 27, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A platform to organize the work of charity in Algiers City☆16Oct 29, 2019Updated 6 years ago
- Arabic Parser Using Stanford API☆12Nov 11, 2017Updated 8 years ago
- A Javascript library that extends the native String object with methods to help when dealing with Arabic strings for node and the browser…☆56Sep 12, 2018Updated 7 years ago
- Youtube comments topics modeling and sentiment analyzer☆16Oct 25, 2022Updated 3 years ago
- LABR: Large Scale Arabic Book Reviews Dataset☆46Nov 14, 2014Updated 11 years ago
- This is a repository of the Multi-dialect Arabic BERT model.☆38Jul 14, 2020Updated 5 years ago
- ☆44Aug 7, 2015Updated 10 years ago