A character-wise tokenizer for morphologically rich languages
☆31Sep 28, 2025Updated 6 months ago
Alternatives and similar repositories for RFTokenizer
Users that are interested in RFTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple configurable tool for manipulating dependency trees.☆14Dec 25, 2024Updated last year
- Arabic flexionnal morphology generator☆36Aug 28, 2024Updated last year
- A fork of languagetool to maintain Arabic☆18Mar 22, 2025Updated last year
- Repository for DISRPT2023 shared task☆17Jul 26, 2024Updated last year
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆40Dec 12, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- Dataset of the Samaritan Pentateuch☆12Updated this week
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Nov 9, 2019Updated 6 years ago
- Repository for DISRPT2019 shared task☆12Sep 5, 2022Updated 3 years ago
- Tools for splitting, normalizing, text-shaping Arabic script☆12Jun 23, 2024Updated last year
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Jul 28, 2017Updated 8 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Apr 3, 2014Updated 12 years ago
- Pure python, embedded, fast, schema-less, NoSQL database☆12Aug 1, 2020Updated 5 years ago
- sentiment analysis models for Arabic tweets to analyze Twitter comments as having positive, negative or neutral sentiments.☆13Mar 17, 2018Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://co…☆33Jan 9, 2025Updated last year
- Yaziji : Arabic phrase generator☆17Jan 2, 2025Updated last year
- ☆64Feb 2, 2023Updated 3 years ago
- Mushaf in xml format, Styling with XSLT and CSS☆18Apr 24, 2021Updated 4 years ago
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- A memory-based morphological parser for Python☆16Oct 12, 2012Updated 13 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆48Aug 15, 2025Updated 8 months ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Jun 17, 2024Updated last year
- Dead Sea Scrolls in TF format based on Abegg's data☆28Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- مكتبة جافاسكريبت تقوم باستبدال الأحرف اللاتنية عند الكتابة بأحرف عربية (والعكس) مع واجهة برمجة مرنة☆41Oct 22, 2019Updated 6 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆22Aug 13, 2022Updated 3 years ago
- MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …☆12Mar 29, 2026Updated 2 weeks ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆107Updated this week
- dynamic-pass note-calculator☆10Feb 5, 2026Updated 2 months ago
- Do you even science, bro? Using RNN's to predict scientific titles.☆14Jun 5, 2017Updated 8 years ago
- A RegEx GUI☆14Jan 13, 2021Updated 5 years ago
- Ya (ي) programming language is an open-source programming language where you can write python code in the Arabic language.☆43Jan 31, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆16Feb 1, 2026Updated 2 months ago
- ORSH - is an Oranios simple shell written in order to understand how shells work .☆12Jun 12, 2024Updated last year
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 4 months ago
- ☆13Dec 28, 2022Updated 3 years ago
- ☆30Feb 1, 2020Updated 6 years ago
- Social Context Analysis aNd Emotion Recognition☆12Jul 11, 2017Updated 8 years ago
- yet another art bot☆13Jan 22, 2025Updated last year