slovak-nlp / resourcesLinks
A curated list of resources such as tools and datasets useful for the processing of Slovak language
☆22Updated last month
Alternatives and similar repositories for resources
Users that are interested in resources are comparing it to the libraries listed below
Sorting:
- A Neural Framework for MT Evaluation☆649Updated 3 weeks ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆167Updated last year
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆17Updated 3 years ago
- A neural word aligner based on multilingual BERT☆355Updated 3 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆397Updated 2 years ago
- This repo is the home of Romanian Transformers.☆105Updated 2 years ago
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,238Updated last year
- Named Entity Recognition in PyTorch on CoNLL2003 dataset☆16Updated 3 years ago
- NeuSpell: A Neural Spelling Correction Toolkit☆696Updated 2 years ago
- Improved Sentence Alignment in Linear Time and Space☆180Updated 2 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆351Updated last year
- Reference BLEU implementation that auto-downloads test sets and reports a version string to facilitate cross-lab comparisons☆1,168Updated this week
- Annotation Tool for Text Simplification Corpora☆17Updated last year
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆586Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆252Updated 2 years ago
- Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.☆585Updated 6 months ago
- A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)☆1,149Updated last year
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆144Updated last year
- Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.☆539Updated 4 months ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆375Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 7 months ago
- The robust European language model benchmark.☆120Updated this week
- 🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.☆871Updated last year
- A Scandinavian Benchmark for sentence embeddings☆40Updated 3 months ago
- A Python library for calculating a large variety of metrics from text☆346Updated 8 months ago
- Trankit is a Light-Weight Transformer-based Python Toolkit for Multilingual Natural Language Processing☆763Updated last month
- ☆37Updated 2 years ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Updated 2 years ago
- SpanMarker for Named Entity Recognition☆451Updated 7 months ago