gerulata / slovakbert
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for slovakbert
- Interesting links to Slovak NLP tools, utils corpuses and resources.☆16Updated 2 years ago
- A curated list of resources such as tools and datasets useful for the processing of Slovak language☆18Updated this week
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated last year
- BERT model trained from scratch on Finnish☆96Updated 3 years ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆71Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆76Updated 4 months ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆132Updated last year
- Tools for assessing Finnish poetry: rhymes, meter, hyphenation of Finnish and so on.☆11Updated 11 months ago
- Compound splitter for German☆103Updated 4 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆135Updated last year
- Implementation of the GBST block from the Charformer paper, in Pytorch☆117Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆230Updated 2 years ago
- Code for "Mind Your Inflections! Improving NLP for Non-Standard Englishes with Base-Inflection Encoding" (EMNLP 2020).☆11Updated last year
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 6 months ago
- The MWE identification system, MTLB-STRUCT, participated in the PARSEME 1.2 Shared Task on semi-supervised identification of verbal multi…☆14Updated 8 months ago
- A collection of Italian benchmarks for LLM evaluation☆22Updated 3 weeks ago
- German GPT-2 model☆32Updated 3 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆135Updated 3 months ago
- ☆12Updated 3 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆77Updated 3 years ago
- Polish BERT☆70Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- 🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings☆58Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 3 years ago
- Easier Automatic Sentence Simplification Evaluation☆159Updated last year
- pyTorch implementation of Recurrence over BERT (RoBERT) based on this paper https://arxiv.org/abs/1910.10781 and comparison with pyTorch …☆79Updated 2 years ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆67Updated 3 years ago
- Multi-Annotator Competence Estimation tool☆63Updated 5 years ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆32Updated last month