A character-wise tokenizer for morphologically rich languages
☆31Sep 28, 2025Updated 6 months ago
Alternatives and similar repositories for RFTokenizer
Users that are interested in RFTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An NLP pipeline for Hebrew☆41Jun 16, 2025Updated 9 months ago
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆11Dec 27, 2021Updated 4 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11May 4, 2022Updated 3 years ago
- A simple configurable tool for manipulating dependency trees.☆14Dec 25, 2024Updated last year
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A fork of languagetool to maintain Arabic☆18Mar 22, 2025Updated last year
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆40Dec 12, 2025Updated 3 months ago
- Dataset of the Samaritan Pentateuch☆11Mar 17, 2026Updated last week
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Nov 9, 2019Updated 6 years ago
- Repository for DISRPT2019 shared task☆12Sep 5, 2022Updated 3 years ago
- A small python script that transliterates Arabic text using the Buckwalter Transliteration Scheme. It allows for multiple decisions to be…☆26Apr 3, 2014Updated 11 years ago
- Pure python, embedded, fast, schema-less, NoSQL database☆12Aug 1, 2020Updated 5 years ago
- sentiment analysis models for Arabic tweets to analyze Twitter comments as having positive, negative or neutral sentiments.☆13Mar 17, 2018Updated 8 years ago
- The dictionary comprised of the Coptic lexicon created by the BBAW and interface by Coptic SCRIPTORIUM. Currently deployed at https://co…☆32Jan 9, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Mushaf in xml format, Styling with XSLT and CSS☆18Apr 24, 2021Updated 4 years ago
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- Debian, Fedora, Windows, macOS packaging scripts for Apertium, HFST, CG-3, and related techs.☆13Feb 17, 2026Updated last month
- This is a new backend implementation of the ANNIS linguistic search and visualization system.☆18Jan 13, 2026Updated 2 months ago
- A memory-based morphological parser for Python☆16Oct 12, 2012Updated 13 years ago
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆48Aug 15, 2025Updated 7 months ago
- eXternally configurable REference and Non Named Entity Recognizer☆17Jun 17, 2024Updated last year
- مكتبة جافاسكريبت تقوم باستبدال الأحرف اللاتنية عند الكتابة بأحرف عربية (والعكس) مع واجهة برمجة مرنة☆41Oct 22, 2019Updated 6 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆23Aug 13, 2022Updated 3 years ago
- Repository for the Georgetown University Multilayer Corpus (GUM)☆107Mar 13, 2026Updated 2 weeks ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Feb 17, 2019Updated 7 years ago
- Do you even science, bro? Using RNN's to predict scientific titles.☆14Jun 5, 2017Updated 8 years ago
- A RegEx GUI☆14Jan 13, 2021Updated 5 years ago
- Ya (ي) programming language is an open-source programming language where you can write python code in the Arabic language.☆43Jan 31, 2019Updated 7 years ago
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆16Feb 1, 2026Updated last month
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 3 months ago
- Support for linguistics-style examples in Org mode☆10Dec 9, 2022Updated 3 years ago
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- ☆13Dec 28, 2022Updated 3 years ago
- Social Context Analysis aNd Emotion Recognition☆12Jul 11, 2017Updated 8 years ago
- collection of code for helping me get things done☆16Feb 21, 2022Updated 4 years ago
- ☆24Jan 27, 2026Updated 2 months ago
- Arabic support for textblob☆86Oct 21, 2021Updated 4 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago
- Word embeddings from PPMI-weighted and dirichlet-smoothed co-occurrence matrices☆10Aug 3, 2020Updated 5 years ago