slovak-nlp / resources
A curated list of resources such as tools and datasets useful for the processing of Slovak language
☆18Updated this week
Related projects ⓘ
Alternatives and complementary repositories for resources
- ☆19Updated last year
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆16Updated 2 years ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆230Updated 2 years ago
- Athens NLP Summer School 2024 - Lab material☆19Updated last month
- Named Entity Recognition in PyTorch on CoNLL2003 dataset☆16Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- Some notebooks for NLP☆188Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆23Updated last year
- Live survey of off-the-shelf language identification tools for python☆26Updated 2 years ago
- Efficient Attention for Long Sequence Processing☆89Updated 11 months ago
- A neural word aligner based on multilingual BERT☆328Updated 2 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆451Updated 3 weeks ago
- A repo to explore different NLP tasks which can be solved using T5☆169Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆351Updated last year
- Improved Sentence Alignment in Linear Time and Space☆163Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆161Updated 2 weeks ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆187Updated 3 years ago
- A Neural Framework for MT Evaluation☆508Updated 3 months ago
- Clustering sentence embeddings to extract message intent☆167Updated 3 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 6 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated last year
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆312Updated last month
- Applying BERT to named entity recognition in English and Russian.☆160Updated last year
- Interpretability for sequence generation models 🐛 🔍☆377Updated last week
- This repository contains the two datasets introduced in the paper "Making Science Simple: Corpora for the Lay Summarisation of Scientific…☆21Updated 6 months ago
- An NLP system for generating reading comprehension questions☆281Updated 9 months ago
- A Scandinavian Benchmark for sentence embeddings☆28Updated last week
- Yet Another Neural Machine Translation Toolkit☆174Updated 4 months ago
- Transformer-based Long Document Classification☆15Updated 2 years ago