hdaSprachtechnologie / odenetLinks
Open German WordNet
☆96Updated last year
Alternatives and similar repositories for odenet
Users that are interested in odenet are comparing it to the libraries listed below
Sorting:
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆488Updated 8 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆146Updated 7 months ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆29Updated 3 years ago
- Python version for Doug Biber's Multidimensional Analysis (MDA)☆35Updated 4 months ago
- A part-of-speech tagger with support for domain adaptation and external resources.☆23Updated 2 years ago
- UIMA CAS processing library written in Python☆90Updated 3 weeks ago
- Compound splitter for German☆107Updated 5 years ago
- Toolkit to obtain and preprocess German text corpora, train models and evaluate them with generated testsets. Built with Gensim and Tenso…☆239Updated 10 months ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- ConllEditor is a tool to edit dependency syntax trees in CoNLL-U format.☆56Updated last week
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- A lemmatizer for German language text☆91Updated 2 years ago
- ☆64Updated last year
- Universal Dependencies online documentation☆288Updated this week
- Natural language processing resources for multiple languages, with an eye towards use for digital humanities.☆127Updated 4 years ago
- LingPy: Python library for quantitative tasks in historical linguistics☆136Updated 4 months ago
- Linguistic and stylistic complexity measures for (literary) texts☆82Updated last year
- Various utilities for processing the data.☆210Updated this week
- German Morphological Analyzer☆47Updated 3 years ago
- Python wrapper for the CWB to extract concordances and score frequency lists☆22Updated 3 weeks ago
- An initiative to collect and distribute resources for co-reference resolution in a unified standard.☆25Updated last year
- A fully-fledge PyTorch package for Morphological Analysis, tailored to morphologically rich and historical languages.☆23Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated 3 weeks ago
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆24Updated 3 years ago
- Detect and align similar passages☆104Updated 2 months ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- A multilingual parallel corpus created from translations of the Bible.☆182Updated last month
- Compiled tools, datasets, and other resources for historical text normalization.☆18Updated 6 years ago
- 🆕 Work continues on INCEpTION 👉 https://github.com/inception-project/inception 👈 -- ⚠️ The official WebAnno repository has reached the…☆247Updated 2 years ago
- This packages up data for the Open Multilingual Wordnet☆50Updated last month