A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for SoMeWeTa
Users that are interested in SoMeWeTa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- ☆11Nov 14, 2021Updated 4 years ago
- Deutschsprachige Einführung in die automatisierte Inhaltsanalyse mit R.☆17Sep 11, 2020Updated 5 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22Jan 12, 2026Updated 2 months ago
- Compound splitter for German☆113Apr 5, 2020Updated 5 years ago
- Create and analyze argument graphs and serialize them via Protobuf☆10Updated this week
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆38Jun 1, 2023Updated 2 years ago
- A lemmatizer for German language text☆94Feb 7, 2023Updated 3 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- ☆11Feb 13, 2026Updated last month
- Polish data.☆13Nov 12, 2025Updated 4 months ago
- German lemmatization with IWNLP as extension for spaCy☆27Jul 28, 2023Updated 2 years ago
- An invisible-XML processor for XQuery and XSLT☆14Jun 11, 2024Updated last year
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 2 years ago
- Parallel Universal Dependencies.☆13Nov 19, 2025Updated 4 months ago
- APIs for accessing digital objects in the collections of the Royal Danish Library☆11Mar 14, 2023Updated 3 years ago
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Feb 25, 2026Updated 3 weeks ago
- Code for the paper on t-SNE with variable degree of freedom☆12Jun 27, 2019Updated 6 years ago
- Library for scraping, parsing, and analyzing privacy policies.☆18Feb 8, 2023Updated 3 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated last month
- ☆10Jul 21, 2017Updated 8 years ago
- Web application to build XML stand-off markup☆15Mar 18, 2021Updated 5 years ago
- From Hero to Zéroe: A Benchmark of Low-Level Adversarial Attacks☆14Feb 23, 2023Updated 3 years ago
- A visualization tool to support reviewing the scientific literature☆14Jun 2, 2018Updated 7 years ago
- Python module to remove wiki markup text.☆10Jan 15, 2016Updated 10 years ago
- Python client to the INCEpTION annotation tool☆17Jun 10, 2025Updated 9 months ago
- D3-based interactive bubble chart for topic model visualization☆13May 10, 2022Updated 3 years ago
- convert DataFrame to libffm data format in parallel☆30Apr 12, 2018Updated 7 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Wed is a web-based editor that assists users in editing XML documents according to a schema.☆24Dec 12, 2018Updated 7 years ago
- A containerized all-in-one solution for CQPWeb☆18Jan 22, 2023Updated 3 years ago
- A multi-label classification plugin for AllenNLP.☆11Jan 13, 2023Updated 3 years ago
- Morphological analysis for Udmurt.☆12Feb 17, 2026Updated last month
- ☆14May 20, 2019Updated 6 years ago
- ☆12Jan 8, 2023Updated 3 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 5 years ago
- Data visualization of thousands of dots in different colors and arrangements☆39Mar 8, 2023Updated 3 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago