slovak-nlp / resourcesLinks
A curated list of resources such as tools and datasets useful for the processing of Slovak language
☆22Updated 3 months ago
Alternatives and similar repositories for resources
Users that are interested in resources are comparing it to the libraries listed below
Sorting:
- ☆21Updated 2 years ago
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆17Updated 3 years ago
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆170Updated last year
- A Neural Framework for MT Evaluation☆674Updated last month
- ⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x.☆587Updated 2 years ago
- RoBERTa models for Polish☆88Updated 3 years ago
- simpleT5 is built on top of PyTorch-lightning⚡️ and Transformers🤗 that lets you quickly train your T5 models.☆399Updated 2 years ago
- A neural word aligner based on multilingual BERT☆357Updated 3 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆357Updated last year
- Machine Translation (MT) Preparation Scripts☆33Updated 4 months ago
- NeuSpell: A Neural Spelling Correction Toolkit☆694Updated 2 years ago
- A practical and feature-rich paraphrasing framework to augment human intents in text form to build robust NLU models for conversational e…☆911Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆377Updated last year
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Improved Sentence Alignment in Linear Time and Space☆184Updated 2 years ago
- BERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them t…☆144Updated last year
- Easy to use, state-of-the-art Neural Machine Translation for 100+ languages☆1,241Updated last year
- Natural language understanding benchmarks for Norwegian☆14Updated last month
- ☆37Updated 2 years ago
- Facebook Low Resource (FLoRes) MT Benchmark☆752Updated last year
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆504Updated 11 months ago
- The Arabic Error Type Annotation tool aims to annotate Arabic error types following the ALC tagset annotation.☆11Updated 2 years ago
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆24Updated 3 years ago
- Yet Another Neural Machine Translation Toolkit☆179Updated 7 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆337Updated 2 years ago
- A french sequence to sequence pretrained model☆62Updated 3 years ago
- This repo is the home of Romanian Transformers.☆106Updated 3 years ago
- Efficient Attention for Long Sequence Processing☆97Updated last year
- Multi-task modelling extensions for huggingface transformers☆13Updated 3 months ago
- Romanian Word Embeddings. Here you can find pre-trained corpora of word embeddings. Current methods: CBOW, Skip-Gram, Fast-Text (from Gen…☆12Updated last week