amir-zeldes/RFTokenizer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/amir-zeldes/RFTokenizer)

amir-zeldes / RFTokenizer

A character-wise tokenizer for morphologically rich languages

☆32

Alternatives and similar repositories for RFTokenizer

Users that are interested in RFTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

amir-zeldes / DepEdit
View on GitHub
A simple configurable tool for manipulating dependency trees.
☆14Dec 25, 2024Updated last year
linuxscout / alyahmor
View on GitHub
Arabic flexionnal morphology generator
☆35Aug 28, 2024Updated last year
disrpt / sharedtask2023
View on GitHub
Repository for DISRPT2023 shared task
☆17Jul 26, 2024Updated 2 years ago
CopticScriptorium / corpora
View on GitHub
Public repository for Coptic SCRIPTORIUM Corpora Releases
☆49Updated this week
ftyers / ud-scripts
View on GitHub
Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies
☆17Mar 4, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
UniversalDependencies / UD_Hebrew-IAHLTwiki
View on GitHub
☆10May 6, 2026Updated 2 months ago
DT-UCPH / sp
View on GitHub
Dataset of the Samaritan Pentateuch
☆14Jul 17, 2026Updated last week
pharos-alexandria / ocr-greek_cursive
View on GitHub
Training files for Greek cursive script (in early print)
☆15May 26, 2021Updated 5 years ago
mansayk / fastmorph
View on GitHub
Fast corpus search engine originally made for the Corpus of Written Tatar language
☆17Nov 9, 2019Updated 6 years ago
yassersaidi / Mushaf-XML-XSL-CSS-DTD
View on GitHub
Mushaf in xml format, Styling with XSLT and CSS
☆18Apr 24, 2021Updated 5 years ago
YontiLevin / Hebrew-Tokenizer
View on GitHub
A very simple python tokenizer for Hebrew text.
☆26Nov 13, 2021Updated 4 years ago
CAMeL-Lab / Gumar-Ngrams
View on GitHub
The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.
☆12Feb 5, 2020Updated 6 years ago
bashartalafha / Arabizi-Transliteration
View on GitHub
☆19Jan 13, 2021Updated 5 years ago
nickmasster / codernitydb3
View on GitHub
Pure python, embedded, fast, schema-less, NoSQL database
☆12Aug 1, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KentonMurray / Buckwalter
View on GitHub
A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…
☆26Apr 3, 2014Updated 12 years ago
Mohabyoussef09 / Arabic-Sentiment-Analysis
View on GitHub
sentiment analysis models for Arabic tweets to analyze Twitter comments as having positive, negative or neutral sentiments.
☆13Mar 17, 2018Updated 8 years ago
mariananeves / annotation-tools
View on GitHub
☆64Feb 2, 2023Updated 3 years ago
linuxscout / yaziji
View on GitHub
Yaziji : Arabic phrase generator
☆17Jan 2, 2025Updated last year
amir-zeldes / rstWeb
View on GitHub
Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory
☆47Aug 15, 2025Updated 11 months ago
ETCBC / dss
View on GitHub
Dead Sea Scrolls in TF format based on Abegg's data
☆31Jul 3, 2026Updated 3 weeks ago
amir-zeldes / xrenner
View on GitHub
eXternally configurable REference and Non Named Entity Recognizer
☆17Jun 17, 2024Updated 2 years ago
Fcmam5 / oktob.js
View on GitHub
مكتبة جافاسكريبت تقوم باستبدال الأحرف اللاتنية عند الكتابة بأحرف عربية (والعكس) مع واجهة برمجة مرنة
☆41Oct 22, 2019Updated 6 years ago
amir-zeldes / gum
View on GitHub
Repository for the Georgetown University Multilayer Corpus (GUM)
☆109Jun 8, 2026Updated last month
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mcallester / MetaC
View on GitHub
MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …
☆12Jul 11, 2026Updated 2 weeks ago
madjsmail / moadaly
View on GitHub
dynamic-pass note-calculator
☆11May 16, 2026Updated 2 months ago
KELLIA / dictionary
View on GitHub
The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://co…
☆34Jan 9, 2025Updated last year
sleeptillseven / LXX-Swete
View on GitHub
Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts
☆10Oct 11, 2020Updated 5 years ago
gucorpling / gitdox
View on GitHub
Repository for GitDOX, a GitHub Data-storage Online XML editor
☆16Updated this week
yalang / ya
View on GitHub
Ya (ي) programming language is an open-source programming language where you can write python code in the Arabic language.
☆43Jan 31, 2019Updated 7 years ago
wassim31 / orsh
View on GitHub
ORSH - is an Oranios simple shell written in order to understand how shells work .
☆12Jun 12, 2024Updated 2 years ago
wjbmattingly / vulgata-spacy
View on GitHub
☆14Dec 28, 2022Updated 3 years ago
HassanAzzam / Arabic-NER
View on GitHub
☆30Feb 1, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
alekkeersmaekers / glaux-nlp
View on GitHub
☆10Mar 2, 2026Updated 4 months ago
gsi-upm / scaner
View on GitHub
Social Context Analysis aNd Emotion Recognition
☆12Jul 11, 2017Updated 9 years ago
dvingo / my-clj-utils
View on GitHub
collection of code for helping me get things done
☆16Feb 21, 2022Updated 4 years ago
linuxscout / i3rab-quiz-data
View on GitHub
☆15Feb 16, 2024Updated 2 years ago
dig-eg-gaz / content
View on GitHub
TEI-encoded contents of the Egyptian Gazette
☆15Jun 6, 2026Updated last month
adhaamehab / textblob-ar
View on GitHub
Arabic support for textblob
☆87Oct 21, 2021Updated 4 years ago
google-research-datasets / noun-verb
View on GitHub
This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.
☆38Apr 26, 2019Updated 7 years ago