gerulata / slovakbertLinks
☆20Updated 2 years ago
Alternatives and similar repositories for slovakbert
Users that are interested in slovakbert are comparing it to the libraries listed below
Sorting:
- A curated list of resources such as tools and datasets useful for the processing of Slovak language☆21Updated 3 weeks ago
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆17Updated 3 years ago
- Basic implementation of BERT and Transformer in Pytorch in one short python file (also includes "predict next word" GPT task)☆42Updated last year
- German GPT-2 model☆32Updated 3 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Some notebooks for NLP☆204Updated last year
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 4 years ago
- Polish BERT☆70Updated 4 years ago
- A 🤗-style implementation of BERT using lambda layers instead of self-attention☆69Updated 4 years ago
- Main repository for "CharacterBERT: Reconciling ELMo and BERT for Word-Level Open-Vocabulary Representations From Characters"☆201Updated last year
- MorphoDiTa: Morphologic Dictionary and Tagger☆73Updated last year
- ☆50Updated 2 years ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- CNNs for Classification in PyTorch☆20Updated 10 months ago
- Interactive Neural Machine Translation tool☆53Updated last year
- An educational tool to train, inspect, evaluate and translate using neural engines☆18Updated 3 months ago
- Implementation of the GBST block from the Charformer paper, in Pytorch☆117Updated 3 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 9 months ago
- SentAugment is a data augmentation technique for NLP that retrieves similar sentences from a large bank of sentences. It can be used in c…☆362Updated 3 years ago
- French word embeddings from series sub-titles☆22Updated 6 years ago
- DeEpLearning models for MultIlingual haTespeech (DELIMIT): Benchmarking multilingual models across 9 languages and 16 datasets.☆109Updated 2 years ago
- ☆42Updated 3 years ago
- ☆31Updated 6 years ago
- Romanian Semantic Textual Similarity Dataset☆16Updated 2 years ago
- ☆39Updated 3 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- fastai ulmfit - Pretraining the Language Model, Fine-Tuning and training a Classifier☆33Updated 3 years ago
- A program to choose transfer languages for cross-lingual learning☆72Updated 2 years ago
- Machine Translation (MT) Preparation Scripts☆32Updated last month