musixmatchresearch / umberto
UmBERTo: an Italian Language Model trained with Whole Word Masking.
☆104Updated 2 years ago
Alternatives and similar repositories for umberto:
Users that are interested in umberto are comparing it to the libraries listed below
- AlBERTo the first italian BERT model for Twitter languange understanding☆72Updated 4 years ago
- GilBERTo: A pretrained language model based on RoBERTa for Italian☆73Updated 5 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 8 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated 2 years ago
- A Dutch RoBERTa-based language model☆198Updated 9 months ago
- A large scale dataset for Question Answering in Italian☆26Updated 6 years ago
- A spaCy wrapper for DBpedia Spotlight☆108Updated last year
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).☆26Updated 6 months ago
- A collection of Italian benchmarks for LLM evaluation☆26Updated last month
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆79Updated 6 months ago
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹☆30Updated 7 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 5 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆194Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆157Updated last year
- 💫 SpaCy wrapper for ConceptNet 💫☆89Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆104Updated 9 months ago
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 9 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated last year
- Official repository of the Hate Speech Detection Tasks at Evalita☆12Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 11 months ago
- ☆15Updated 3 years ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated last year
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- Annotated corpus + evaluation metrics for text anonymisation☆53Updated 11 months ago
- PYthon Automated Term Extraction☆310Updated last year
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆135Updated last year