slovak-nlp / resources
A curated list of resources such as tools and datasets useful for the processing of Slovak language
☆19Updated 2 weeks ago
Alternatives and similar repositories for resources:
Users that are interested in resources are comparing it to the libraries listed below
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆16Updated 3 years ago
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- ☆239Updated 8 months ago
- Named Entity Recognition in PyTorch on CoNLL2003 dataset☆16Updated 3 years ago
- ☆50Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆267Updated last month
- The central repo for Creole based NLU and NLG work☆17Updated 8 months ago
- Some notebooks for NLP☆194Updated last year
- Multilingual sentence alignment using sentence embeddings☆108Updated 3 months ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆257Updated 3 months ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- cLang-8 is a dataset for grammatical error correction.☆103Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆212Updated last month
- ☆35Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- Deep Learning for Natural Language Processing - Lectures 2023☆164Updated 5 months ago
- Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.☆529Updated 6 months ago
- This is a neural spell checker☆63Updated 2 years ago
- ☆108Updated last year
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- RoBERTa models for Polish☆86Updated 2 years ago
- A neural word aligner based on multilingual BERT☆338Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 8 months ago
- Datasets for Hate Speech Detection☆124Updated last year
- NeuSpell: A Neural Spelling Correction Toolkit☆687Updated last year
- Multi-task modelling extensions for huggingface transformers☆13Updated last year
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆75Updated 3 weeks ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆238Updated 2 years ago