A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for SoMeWeTa
Users that are interested in SoMeWeTa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- ☆11Nov 14, 2021Updated 4 years ago
- Deutschsprachige Einführung in die automatisierte Inhaltsanalyse mit R.☆18Sep 11, 2020Updated 5 years ago
- Compound splitter for German☆113Apr 5, 2020Updated 6 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆59Apr 25, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- GermaParl: Corpus of Plenary Protocols of the German Bundestag (TEI Format)☆38Jun 1, 2023Updated 2 years ago
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- ☆11Feb 13, 2026Updated 2 months ago
- CoNLL 2018 Shared Task Team UDPipe-Future☆39Sep 29, 2020Updated 5 years ago
- Specification of a stand-off element for the TEI guidelines☆12Apr 29, 2021Updated 5 years ago
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated 3 weeks ago
- Parallel Universal Dependencies.☆13Apr 27, 2026Updated last week
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 2 years ago
- AMDGPU bindings for Flux☆10Apr 6, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- APIs for accessing digital objects in the collections of the Royal Danish Library☆11Mar 14, 2023Updated 3 years ago
- Code for the paper on t-SNE with variable degree of freedom☆11Jun 27, 2019Updated 6 years ago
- Software for multi-level annotation of linguistic corpora☆17Jan 15, 2020Updated 6 years ago
- Web application to build XML stand-off markup☆15Mar 18, 2021Updated 5 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 2 months ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Aug 15, 2023Updated 2 years ago
- A visualization tool to support reviewing the scientific literature☆14Jun 2, 2018Updated 7 years ago
- D3-based interactive bubble chart for topic model visualization☆13May 10, 2022Updated 3 years ago
- Web Service wrapper for accessing the AmbiverseNLU KG stored in Neo4j☆12Nov 16, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- convert DataFrame to libffm data format in parallel☆30Apr 12, 2018Updated 8 years ago
- HTML Abstract Markup Language for Julia. Inspired by Ruby's HAML.☆17Aug 17, 2023Updated 2 years ago
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 3 years ago
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- This repository contains code and data download instructions for the workshop paper "Improving Hierarchical Product Classification using …☆17Apr 30, 2021Updated 5 years ago
- Wed is a web-based editor that assists users in editing XML documents according to a schema.☆24Dec 12, 2018Updated 7 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- A containerized all-in-one solution for CQPWeb☆18Jan 22, 2023Updated 3 years ago
- A multi-label classification plugin for AllenNLP.☆11Jan 13, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Morphological analysis for Udmurt.☆12Apr 9, 2026Updated 3 weeks ago
- ☆12Jan 8, 2023Updated 3 years ago
- ☆14May 20, 2019Updated 6 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆11Feb 21, 2022Updated 4 years ago
- Part-of-speech tagging using BERT☆10Nov 14, 2019Updated 6 years ago