gerulata / slovakbert
☆19Updated last year
Alternatives and similar repositories for slovakbert:
Users that are interested in slovakbert are comparing it to the libraries listed below
- A curated list of resources such as tools and datasets useful for the processing of Slovak language☆19Updated 2 months ago
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆16Updated 3 years ago
- French word embeddings from series sub-titles☆22Updated 6 years ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆117Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- LM Pretraining with PyTorch/TPU☆134Updated 5 years ago
- German small and large versions of GPT2.☆20Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆133Updated last year
- This repo is the home of Romanian Transformers.☆98Updated 2 years ago
- I.PHI dataset generation☆25Updated last year
- xfspell — the Transformer Spell Checker☆188Updated 4 years ago
- ☆46Updated 4 years ago
- Minimal implementation of Multi-layer Recurrent Neural Networks (LSTM) for character-level language modelling in PyTorch☆46Updated 5 years ago
- French Machine Reading for Question Answering☆18Updated 2 years ago
- Robust Cross-lingual Embeddings from Parallel Sentences☆21Updated 4 years ago
- Character-based word embeddings model based on RNN for handling real world texts☆173Updated last year
- Polish BERT☆70Updated 4 years ago
- This is a neural spell checker☆62Updated 2 years ago
- Some notebooks for NLP☆189Updated last year
- ☆50Updated 2 years ago
- A tool for extracting plain text from Wikipedia dumps☆17Updated 5 years ago
- Agile reading group that works☆13Updated 2 years ago
- OpenNeuroSpell contains parts of NeuroSpell (http://neurospell.com/en.php) released as open-source. More code will be published as soon a…☆20Updated 2 months ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- Language Modeling Example with Transformers and PyTorch Lighting☆65Updated 4 years ago
- Romanian Semantic Textual Similarity Dataset☆15Updated 2 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆77Updated 4 months ago
- This is a german text corpus from Wikipedia. It is cleaned, preprocessed and sentence splitted. It's purpose is to train NLP embeddings l…☆22Updated 2 years ago