A neural network that jointly part-of-speech tags and lemmatizes sentences, boosting accuracy for morphologically-rich languages (Czech, Arabic, etc.)
☆34Apr 5, 2019Updated 7 years ago
Alternatives and similar repositories for LemmaTag
Users that are interested in LemmaTag are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Morphological analysis for Udmurt.☆12May 23, 2026Updated 2 weeks ago
- Sentence generation system for evaluating composition, described in Ettinger et al. (2018) "Assessing Composition in Sentence Vector Repr…☆16Apr 25, 2020Updated 6 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Jul 11, 2019Updated 6 years ago
- ☆49Dec 23, 2018Updated 7 years ago
- Turkish Morphology Datasets☆37Sep 25, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This repo contains the software that was used to conduct the experiments reported in our article titled "Improving Named Entity Recogniti…☆20Dec 22, 2022Updated 3 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Dec 15, 2023Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 3 years ago
- several algorithms for converting dependency structures into constituency structures.☆10Feb 7, 2022Updated 4 years ago
- ☆11Jun 23, 2022Updated 3 years ago
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆29Apr 17, 2024Updated 2 years ago
- Semeval-2021 Multilingual and Cross-lingual Word-in-Context Task☆18May 27, 2021Updated 5 years ago
- [NeurIPS 2024] 🕸 GlotCC Dataset and Pipline☆20Apr 6, 2025Updated last year
- Featurize words into orthographic and phonological vectors.☆42May 20, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Curriculum training☆22Jun 25, 2025Updated 11 months ago
- Multi lingual character based named entity recognizer☆24Apr 8, 2018Updated 8 years ago
- Specifications for the DTS API☆33May 18, 2026Updated 3 weeks ago
- The SETimes.HR+ Croatian dependency treebank☆16Dec 27, 2016Updated 9 years ago
- Persists RDF Triples to both Neo4j (Graph based database) and Redis (Key/Value Database), and decide which one to query on.☆10Oct 31, 2018Updated 7 years ago
- Content Negotiation for Caddy.☆15May 16, 2026Updated 3 weeks ago
- PyTorch implementation of NAACL 2021 paper "Multi-view Subword Regularization"☆26Jun 2, 2021Updated 5 years ago
- ☆29Dec 23, 2019Updated 6 years ago
- ☆10Dec 28, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Python database interface for eXist-db☆15May 2, 2026Updated last month
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆225Dec 20, 2022Updated 3 years ago
- This repo contains the code for the paper Neural Factor Graph Models for Cross-lingual Morphological Tagging.☆52Nov 2, 2018Updated 7 years ago
- A full and updated Turkish stop words list, which should be filtered out prior to, or after, processing of natural language data, full te…☆21Mar 22, 2014Updated 12 years ago
- ☆11Nov 14, 2021Updated 4 years ago
- ☆10Jul 21, 2017Updated 8 years ago
- Arabic News Stance Corpus☆11Feb 5, 2021Updated 5 years ago
- Web application to build XML stand-off markup☆15Mar 18, 2021Updated 5 years ago
- Collect, discuss and manage feedback on OntoME☆12Dec 7, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- COLING 2018 Tutorial on Multilingual FrameNet: Automatic semantic role labeling for FrameNet☆25Aug 29, 2018Updated 7 years ago
- This dataset contains naturally-occurring English sentences that feature non-trivial noun-verb ambiguity.☆38Apr 26, 2019Updated 7 years ago
- TEI Transviewer is an interface intended to the exploration of primary and secondary sources, at the document level, in historical or oth…☆14Jul 17, 2021Updated 4 years ago
- Türkçe metinler için metin ön işleme kütüphanesi; küçük harfe dönüştürme, şapkalı karakterleri eşleniği il değiştirme, stopwords'leri çık…☆22Sep 19, 2017Updated 8 years ago
- Annotated corpus of Arabic tweets which mention a violence act.☆10Jun 6, 2018Updated 8 years ago
- This repository is about how to build an SQLite version of the Arabic WordNet database.☆11Mar 19, 2019Updated 7 years ago
- Experiment on metadata extraction using large language models such as GPT-3☆12Feb 1, 2023Updated 3 years ago