A character-wise tokenizer for morphologically rich languages
☆31Sep 28, 2025Updated 8 months ago
Alternatives and similar repositories for RFTokenizer
Users that are interested in RFTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An NLP pipeline for Hebrew☆41Jun 16, 2025Updated 11 months ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆11Dec 27, 2021Updated 4 years ago
- A simple configurable tool for manipulating dependency trees.☆14Dec 25, 2024Updated last year
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- A fork of languagetool to maintain Arabic☆18Mar 22, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 5 years ago
- This is the development repository for The Oxford-BYU Syriac Corpus project.☆15Mar 10, 2026Updated 2 months ago
- A very simple python tokenizer for Hebrew text.☆26Nov 13, 2021Updated 4 years ago
- Tools for splitting, normalizing, text-shaping Arabic script☆12Jun 23, 2024Updated last year
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Jul 28, 2017Updated 8 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Apr 3, 2014Updated 12 years ago
- sentiment analysis models for Arabic tweets to analyze Twitter comments as having positive, negative or neutral sentiments.☆13Mar 17, 2018Updated 8 years ago
- The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://co…☆34Jan 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Yaziji : Arabic phrase generator☆17Jan 2, 2025Updated last year
- Mushaf in xml format, Styling with XSLT and CSS☆18Apr 24, 2021Updated 5 years ago
- Debian, Fedora, Windows, macOS packaging scripts for Apertium, HFST, CG-3, and related techs.☆13Updated this week
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆19Apr 18, 2026Updated last month
- A memory-based morphological parser for Python☆16Oct 12, 2012Updated 13 years ago
- Dead Sea Scrolls in TF format based on Abegg's data☆29Apr 22, 2026Updated last month
- مكتبة جافاسكريبت تقوم باستبدال الأحرف اللاتنية عند الكتابة بأحرف عربية (والعكس) مع واجهة برمجة مرنة☆41Oct 22, 2019Updated 6 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- dynamic-pass note-calculator☆11May 16, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Feb 17, 2019Updated 7 years ago
- A RegEx GUI☆14Jan 13, 2021Updated 5 years ago
- Ya (ي) programming language is an open-source programming language where you can write python code in the Arabic language.☆43Jan 31, 2019Updated 7 years ago
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆16Feb 1, 2026Updated 3 months ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 5 months ago
- ☆13Dec 28, 2022Updated 3 years ago
- ☆30Feb 1, 2020Updated 6 years ago
- collection of code for helping me get things done☆16Feb 21, 2022Updated 4 years ago
- This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.☆38Apr 26, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An application to display the text of the Hebrew Bible (Leningrad codex) along with an English translation (1917 JPS) and an audio record…☆14Jul 17, 2015Updated 10 years ago
- TEI-encoded contents of the Egyptian Gazette☆15Jun 11, 2024Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Arabic support for textblob☆87Oct 21, 2021Updated 4 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- Metrical position in Greek hexameter.☆13May 19, 2026Updated last week
- This project deals with hierarchical classification of web pages based on dmoz dataset.☆14Apr 10, 2014Updated 12 years ago