kldarek / polbert
Polish BERT
☆70Updated 4 years ago
Alternatives and similar repositories for polbert:
Users that are interested in polbert are comparing it to the libraries listed below
- Create interactive textual heat maps for Jupiter notebooks☆196Updated 8 months ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated last year
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆136Updated 2 years ago
- The code to reproduce results from paper "MultiFiT: Efficient Multi-lingual Language Model Fine-tuning" https://arxiv.org/abs/1909.04761☆283Updated 4 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆73Updated 3 years ago
- The weights for the embedding layer of Scandinavian UMLFiT language models☆32Updated 5 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆34Updated 3 years ago
- xfspell — the Transformer Spell Checker☆188Updated 4 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆126Updated 4 years ago
- spaCy + UDPipe☆160Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 11 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 8 months ago
- A french sequence to sequence pretrained model☆57Updated 2 years ago
- Sentence transformers models for SpaCy☆107Updated last year
- http://nlp.seas.harvard.edu/2018/04/03/attention.html☆63Updated 3 years ago
- RoBERTa models for Polish☆86Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆145Updated 3 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- ☆50Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 7 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 6 months ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- PyTorch source code of NAACL 2019 paper "An Embarrassingly Simple Approach for Transfer Learning from Pretrained Language Models"☆96Updated last year
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆48Updated 3 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Updated 3 years ago
- NLP French language model implementing ULMFiT☆87Updated 5 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 2 months ago