slovak-nlp / resources
A curated list of resources such as tools and datasets useful for the processing of Slovak language
☆20Updated last month
Alternatives and similar repositories for resources:
Users that are interested in resources are comparing it to the libraries listed below
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆17Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆244Updated 2 years ago
- ☆20Updated 2 years ago
- Clustering sentence embeddings to extract message intent☆173Updated 3 years ago
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆361Updated last year
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- ☆158Updated 10 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 5 months ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆163Updated last year
- Live survey of off-the-shelf language identification tools for python☆26Updated 3 years ago
- Multilingual sentence alignment using sentence embeddings☆116Updated 5 months ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆200Updated last year
- A french sequence to sequence pretrained model☆59Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- coFR: COreference resolution tool for FRench (and singletons).☆24Updated 4 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆387Updated last year
- Transformer based translation quality estimation☆110Updated last year
- A collection of text simplification datasets and other resources☆42Updated 7 months ago
- A Scandinavian Benchmark for sentence embeddings☆36Updated 2 months ago
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- RoBERTa models for Polish☆87Updated 3 years ago
- ☆50Updated 2 years ago
- A neural word aligner based on multilingual BERT☆346Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Primary repository for the NLP course as part of the CogSci masters program at Aarhus University.☆22Updated 4 months ago
- A Python library for calculating a large variety of metrics from text☆337Updated 4 months ago
- Natural Language Processing Research in North American Linguistics Departments☆20Updated last month
- A repo to explore different NLP tasks which can be solved using T5☆172Updated 4 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated 7 months ago