gerulata / slovakbert
☆20Updated 2 years ago
Alternatives and similar repositories for slovakbert:
Users that are interested in slovakbert are comparing it to the libraries listed below
- A curated list of resources such as tools and datasets useful for the processing of Slovak language☆19Updated 2 weeks ago
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆16Updated 3 years ago
- Polish BERT☆70Updated 4 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- ☆50Updated 2 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆164Updated 5 months ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 3 years ago
- Polish morphological tagger.☆43Updated last year
- Implementation of the GBST block from the Charformer paper, in Pytorch☆116Updated 3 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- This is a neural spell checker☆65Updated 2 years ago
- Reading comprehension with ALBERT transformer model☆15Updated 3 years ago
- ☆20Updated 6 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 10 months ago
- fastai ulmfit - Pretraining the Language Model, Fine-Tuning and training a Classifier☆33Updated 3 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 9 months ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Code for the paper "Language Models are Unsupervised Multitask Learners"☆109Updated 3 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Browse Covid-19 & SARS-CoV-2 Scientific Papers with Transformers 🦠 📖☆182Updated 2 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆135Updated last year
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- 👩🏫 Pre-trained German Language Model with sub-word tokenization for ULMFIT☆17Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆42Updated 2 years ago
- Visualising the Transformer encoder☆111Updated 4 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆200Updated last year
- Compass-aligned Distributional Embeddings. Align embeddings from different corpora☆39Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago