A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for SoMeWeTa
Users that are interested in SoMeWeTa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- Deutschsprachige Einführung in die automatisierte Inhaltsanalyse mit R.☆18Sep 11, 2020Updated 5 years ago
- Compound splitter for German☆113Apr 5, 2020Updated 6 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆59Apr 1, 2026Updated last week
- Create and analyze argument graphs and serialize them via Protobuf☆10Mar 29, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆38Jun 1, 2023Updated 2 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- Polish data.☆13Nov 12, 2025Updated 5 months ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Sep 29, 2020Updated 5 years ago
- An invisible-XML processor for XQuery and XSLT☆14Jun 11, 2024Updated last year
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 2 years ago
- AMDGPU bindings for Flux☆10Apr 6, 2021Updated 5 years ago
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Mar 23, 2026Updated 3 weeks ago
- Software for multi-level annotation of linguistic corpora☆17Jan 15, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Jul 21, 2017Updated 8 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 2 months ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Aug 15, 2023Updated 2 years ago
- A visualization tool to support reviewing the scientific literature☆14Jun 2, 2018Updated 7 years ago
- Python module to remove wiki markup text.☆10Jan 15, 2016Updated 10 years ago
- D3-based interactive bubble chart for topic model visualization☆13May 10, 2022Updated 3 years ago
- Web Service wrapper for accessing the AmbiverseNLU KG stored in Neo4j☆12Nov 16, 2022Updated 3 years ago
- HTML Abstract Markup Language for Julia. Inspired by Ruby's HAML.☆17Aug 17, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Apr 30, 2021Updated 4 years ago
- Wed is a web-based editor that assists users in editing XML documents according to a schema.☆24Dec 12, 2018Updated 7 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- A multi-label classification plugin for AllenNLP.☆11Jan 13, 2023Updated 3 years ago
- Morphological analysis for Udmurt.☆12Feb 17, 2026Updated last month
- ☆14May 20, 2019Updated 6 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago
- Part-of-speech tagging using BERT☆10Nov 14, 2019Updated 6 years ago
- ☆12Jan 27, 2026Updated 2 months ago
- Just another Julia Debugger☆14May 29, 2019Updated 6 years ago
- German Morphological Analyzer☆52Nov 12, 2021Updated 4 years ago
- Ontologies of Linguistic Annotation. Machine-readable tagsets and annotation schemata for more than 100 languages.☆24Feb 4, 2026Updated 2 months ago
- DuckDB wrapper for FAISS - Experimental☆30Mar 9, 2026Updated last month