slovak-nlp / resourcesLinks
A curated list of resources such as tools and datasets useful for the processing of Slovak language
☆21Updated 3 weeks ago
Alternatives and similar repositories for resources
Users that are interested in resources are comparing it to the libraries listed below
Sorting:
- ☆20Updated 2 years ago
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆17Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆248Updated 2 years ago
- An educational tool to train, inspect, evaluate and translate using neural engines☆18Updated 3 months ago
- ☆8Updated 2 years ago
- A Python library for calculating a large variety of metrics from text☆340Updated 6 months ago
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- NeuSpell: A Neural Spelling Correction Toolkit☆695Updated last year
- A neural word aligner based on multilingual BERT☆351Updated 3 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆389Updated 2 years ago
- Data and code for "Nibbling at the Hard Core of Word Sense Disambiguation" (ACL 2022).☆15Updated 3 years ago
- A Scandinavian Benchmark for sentence embeddings☆39Updated last month
- SpanMarker for Named Entity Recognition☆434Updated 5 months ago
- E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narrativ…☆25Updated last year
- ☆48Updated 11 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆262Updated 7 months ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- ☆164Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆216Updated 5 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆182Updated 2 weeks ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆80Updated last month
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆167Updated last year
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated this week
- Transformer based translation quality estimation☆111Updated last year
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆164Updated 3 weeks ago
- The central repo for Creole based NLU and NLG work☆18Updated last month
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 2 months ago