A character-wise tokenizer for morphologically rich languages
☆31Sep 28, 2025Updated 7 months ago
Alternatives and similar repositories for RFTokenizer
Users that are interested in RFTokenizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Named Entity (NER) annotations of the Hebrew Treebank (Haaretz newspaper) corpus, including: morpheme and token level NER labels, nested …☆11Dec 27, 2021Updated 4 years ago
- A simple configurable tool for manipulating dependency trees.☆14Dec 25, 2024Updated last year
- Arabic flexionnal morphology generator☆35Aug 28, 2024Updated last year
- Repository for DISRPT2023 shared task☆17Jul 26, 2024Updated last year
- Public repository for Coptic SCRIPTORIUM Corpora Releases☆40Dec 12, 2025Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Scripts for compatibilitising between VISL-CG3, Apertium, CoNLL-X and Universal Dependencies☆17Mar 4, 2020Updated 6 years ago
- Dataset of the Samaritan Pentateuch☆12Apr 15, 2026Updated 3 weeks ago
- Training files for Greek cursive script (in early print)☆15May 26, 2021Updated 4 years ago
- Fast corpus search engine originally made for the Corpus of Written Tatar language☆17Nov 9, 2019Updated 6 years ago
- A very simple python tokenizer for Hebrew text.☆26Nov 13, 2021Updated 4 years ago
- Arabic named entity recognition using AnerCorp corpus (location , organisation, person, Miscellaneous Word)☆37Jul 28, 2017Updated 8 years ago
- sentiment analysis models for Arabic tweets to analyze Twitter comments as having positive, negative or neutral sentiments.☆13Mar 17, 2018Updated 8 years ago
- ☆64Feb 2, 2023Updated 3 years ago
- Mushaf in xml format, Styling with XSLT and CSS☆18Apr 24, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Swete's LXX Text from 1KY Greek with Corrections Against Manuscripts☆10Oct 11, 2020Updated 5 years ago
- Debian, Fedora, Windows, macOS packaging scripts for Apertium, HFST, CG-3, and related techs.☆13Mar 25, 2026Updated last month
- Repository for rstWeb, a browser based annotation interface for Rhetorical Structure Theory☆48Aug 15, 2025Updated 8 months ago
- Dead Sea Scrolls in TF format based on Abegg's data☆28Apr 22, 2026Updated 2 weeks ago
- مكتبة جافاسكريبت تقوم باستبدال الأحرف اللاتنية عند الكتابة بأحرف عربية (والعكس) مع واجهة برمجة مرنة☆41Oct 22, 2019Updated 6 years ago
- The complete [1 to 5]-gram Gumar Corpus in the style of Google n-grams.☆12Feb 5, 2020Updated 6 years ago
- MetaC provides a read-eval-print loop (a REPL) and notebook interactive development environment (a NIDE) for C programming. MetaC also …☆12Mar 29, 2026Updated last month
- dynamic-pass note-calculator☆10Feb 5, 2026Updated 3 months ago
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Feb 17, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Do you even science, bro? Using RNN's to predict scientific titles.☆14Jun 5, 2017Updated 8 years ago
- A RegEx GUI☆14Jan 13, 2021Updated 5 years ago
- Ya (ي) programming language is an open-source programming language where you can write python code in the Arabic language.☆43Jan 31, 2019Updated 7 years ago
- Repository for GitDOX, a GitHub Data-storage Online XML editor☆16Feb 1, 2026Updated 3 months ago
- Support for linguistics-style examples in Org mode☆10Dec 9, 2022Updated 3 years ago
- Convert Transkribus PAGE-XML to standard PAGE-XML☆12Dec 10, 2025Updated 4 months ago
- ☆13Dec 28, 2022Updated 3 years ago
- ☆30Feb 1, 2020Updated 6 years ago
- Social Context Analysis aNd Emotion Recognition☆12Jul 11, 2017Updated 8 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- collection of code for helping me get things done☆16Feb 21, 2022Updated 4 years ago
- An application to display the text of the Hebrew Bible (Leningrad codex) along with an English translation (1917 JPS) and an audio record…☆14Jul 17, 2015Updated 10 years ago
- ☆25Apr 1, 2026Updated last month
- TEI-encoded contents of the Egyptian Gazette☆15Jun 11, 2024Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Jun 27, 2023Updated 2 years ago
- Arabic support for textblob☆87Oct 21, 2021Updated 4 years ago
- Sequence to sequence model for Arabic punctuation prediction.☆12Feb 13, 2020Updated 6 years ago