A part-of-speech tagger with support for domain adaptation and external resources.
☆24Oct 26, 2022Updated 3 years ago
Alternatives and similar repositories for SoMeWeTa
Users that are interested in SoMeWeTa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A tokenizer and sentence splitter for German and English web and social media texts.☆153Dec 9, 2024Updated last year
- Deutschsprachige Einführung in die automatisierte Inhaltsanalyse mit R.☆18Sep 11, 2020Updated 5 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22May 11, 2026Updated 2 weeks ago
- DER SPIEGEL templates for Svelte components☆12Oct 15, 2024Updated last year
- Compound splitter for German☆113Apr 5, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Create and analyze argument graphs and serialize them via Protobuf☆10Updated this week
- Ukrainian ELECTRA model☆12Mar 11, 2023Updated 3 years ago
- A lemmatizer for German language text☆95Feb 7, 2023Updated 3 years ago
- language resources for Luxembourgish☆14Jul 20, 2023Updated 2 years ago
- ☆11Feb 13, 2026Updated 3 months ago
- German lemmatization with IWNLP as extension for spaCy☆27Apr 13, 2026Updated last month
- A step-by-step tutorial for publishing data and an ontology as Linked Data on your machine.☆14May 9, 2023Updated 3 years ago
- Parallel Universal Dependencies.☆13Updated this week
- APIs for accessing digital objects in the collections of the Royal Danish Library☆11Mar 14, 2023Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Stand-off Text Annotation Model (STAM) is a data model for stand-off-text annotation where any information on a text is represented as an…☆19Mar 23, 2026Updated 2 months ago
- Code for the paper on t-SNE with variable degree of freedom☆11Jun 27, 2019Updated 6 years ago
- Software for multi-level annotation of linguistic corpora☆17Jan 15, 2020Updated 6 years ago
- A library for language transfer methods and algorithms.☆16Feb 6, 2026Updated 3 months ago
- suffix array construction and searching algorithms for in-memory binary data.☆12Sep 10, 2022Updated 3 years ago
- QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…☆34Aug 15, 2023Updated 2 years ago
- Python module to remove wiki markup text.☆10Jan 15, 2016Updated 10 years ago
- convert DataFrame to libffm data format in parallel☆30Apr 12, 2018Updated 8 years ago
- HTML Abstract Markup Language for Julia. Inspired by Ruby's HAML.☆17Aug 17, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ParCourE - Parallel Corpus Explorer☆12Dec 27, 2021Updated 4 years ago
- Distribution of word meanings in Wikipedia for English, Italian, French, German and Spanish.☆10Jan 4, 2021Updated 5 years ago
- For all, who want less 📈 and more 🐱👤 to describe the current COVID-19 situation. Stay safe. 💌☆12Dec 11, 2020Updated 5 years ago
- Morphological analysis for Udmurt.☆12May 12, 2026Updated last week
- ☆12Jan 8, 2023Updated 3 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Apr 6, 2017Updated 9 years ago
- Deep memory and sequence models in JAX☆28Apr 23, 2026Updated last month
- Rapidly scaffold out visual-vocabulary projects☆11Jan 10, 2019Updated 7 years ago
- Data and scripts for the proper evaluation of cross-lingual embeddings in multiple languages☆15Apr 11, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Data visualization of thousands of dots in different colors and arrangements☆39Mar 8, 2023Updated 3 years ago
- материалы курса по питону для студентов дпо-программы "компьютерная лингвистика" в НИУ ВШЭ (2020-2021)☆12Feb 21, 2022Updated 4 years ago
- Part-of-speech tagging using BERT☆10Nov 14, 2019Updated 6 years ago
- Webpack loader to extract frontmatter using jxson/front-matter☆12May 6, 2025Updated last year
- ☆12Jan 27, 2026Updated 3 months ago
- Just another Julia Debugger☆14May 29, 2019Updated 6 years ago
- Small DynDNS Script (which works with Plesk Onyx)☆10Jun 19, 2020Updated 5 years ago