A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for SoMeWeTa
Users that are interested in SoMeWeTa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tokenizer and sentence splitter for German and English web and social media texts.☆152Dec 9, 2024Updated last year
- ☆11Nov 14, 2021Updated 4 years ago
- Deutschsprachige Einführung in die automatisierte Inhaltsanalyse mit R.☆18Sep 11, 2020Updated 5 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22May 11, 2026Updated last month
- Compound splitter for German☆112Apr 5, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆61Apr 25, 2026Updated last month
- Create and analyze argument graphs and serialize them via Protobuf☆10Updated this week
- A lemmatizer for German language text☆95Feb 7, 2023Updated 3 years ago
- ☆11Feb 13, 2026Updated 4 months ago
- Polish data.☆13May 6, 2026Updated last month
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Sep 29, 2020Updated 5 years ago
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated 2 months ago
- An invisible-XML processor for XQuery and XSLT☆14Jun 11, 2024Updated 2 years ago
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Parallel Universal Dependencies.☆13Updated this week
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Mar 23, 2026Updated 2 months ago
- Code for the paper on t-SNE with variable degree of freedom☆11Jun 27, 2019Updated 6 years ago
- Software for multi-level annotation of linguistic corpora☆17Jan 15, 2020Updated 6 years ago
- ☆10Jul 21, 2017Updated 8 years ago
- Web application to build XML stand-off markup☆15Mar 18, 2021Updated 5 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 4 months ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Aug 15, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks☆15Feb 23, 2023Updated 3 years ago
- Downloading, Processing and Visualization of Digital Elevation Model (DEM) Data☆14Dec 12, 2016Updated 9 years ago
- Blazing fast language detection using fastText model☆24Dec 18, 2022Updated 3 years ago
- A visualization tool to support reviewing the scientific literature☆14Jun 2, 2018Updated 8 years ago
- Python client to the INCEpTION annotation tool☆17Jun 10, 2025Updated last year
- Python module to remove wiki markup text.☆10Jan 15, 2016Updated 10 years ago
- Web Service wrapper for accessing the AmbiverseNLU KG stored in Neo4j☆12Nov 16, 2022Updated 3 years ago
- convert DataFrame to libffm data format in parallel☆30Apr 12, 2018Updated 8 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆16Apr 30, 2021Updated 5 years ago
- Wed is a web-based editor that assists users in editing XML documents according to a schema.☆24Dec 12, 2018Updated 7 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- A multi-label classification plugin for AllenNLP.☆11Jan 13, 2023Updated 3 years ago
- Morphological analysis for Udmurt.☆12May 23, 2026Updated 3 weeks ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago