essential-data / nlp-sk-interesting-linksLinks
Interesting links to Slovak NLP tools, utils corpuses and resources.
☆17Updated 3 years ago
Alternatives and similar repositories for nlp-sk-interesting-links
Users that are interested in nlp-sk-interesting-links are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 years ago
- MorphoDiTa: Morphologic Dictionary and Tagger☆73Updated last year
- A curated list of resources such as tools and datasets useful for the processing of Slovak language☆20Updated 2 months ago
- ☆41Updated 9 years ago
- French stopwords collection☆96Updated 5 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Unsupervised Language Model Pre-training for French☆248Updated 2 years ago
- German stopwords collection☆85Updated 2 years ago
- NameTag: Named Entity Tagger☆38Updated 9 months ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆144Updated 5 months ago
- The Open Multilingual Wordnet☆61Updated last year
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- UDPipe: Trainable pipeline for tokenizing, tagging, lemmatizing and parsing Universal Treebanks and other CoNLL-U files☆380Updated 6 months ago
- Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.☆192Updated last year
- Slovak dictionary for hunspell☆21Updated 3 weeks ago
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated 2 years ago
- UIMA CAS processing library written in Python☆89Updated 2 months ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆76Updated 3 years ago
- Named Entity Recognition (LSTM + CRF + FastText) with models for [historic] German☆26Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- Investigating multilingual language models (BERT) by using them for NER in German and English☆14Updated 6 years ago
- Repository for "Towards Robust Named Entity Recognition for Historic German"☆18Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- SIGTYP 2024 Shared Task on Word Embedding Evaluation for Ancient and Historical Languages☆9Updated last year
- T-scan: an analysis tool for dutch texts to assess the complexity of the text, based on original work by Rogier Kraf☆18Updated last week
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- A collection of task-specific NLU datasets☆149Updated 3 years ago