stefan-it / italian-bertelectra
๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)
โ18Updated 2 years ago
Alternatives and similar repositories for italian-bertelectra:
Users that are interested in italian-bertelectra are comparing it to the libraries listed below
- Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐นโ30Updated 8 months ago
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).โ26Updated 7 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ151Updated 8 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ54Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.โ58Updated last year
- A large scale dataset for Question Answering in Italianโ26Updated 6 years ago
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyiโฆโ14Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.โ86Updated last month
- A survey of corpora for Germanic low-resource languages and dialectsโ24Updated 2 months ago
- โ35Updated 2 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.โ26Updated 5 months ago
- โ15Updated 3 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.โ104Updated 2 years ago
- A High-level Library for Named Entity Recognition in Python.โ23Updated last year
- BERT and ELECTRA models trained on Europeana Newspapersโ37Updated 3 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2โฆโ67Updated 2 years ago
- โ37Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ80Updated 2 years ago
- A library to synthesize text datasets using Large Language Models (LLM)โ151Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.โ55Updated 6 months ago
- AlBERTo the first italian BERT model for Twitter languange understandingโ72Updated 4 years ago
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ89Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doโฆโ79Updated 7 months ago
- Camoscio: An Italian instruction-tuned language model based on LLaMAโ127Updated last year
- โ16Updated 2 years ago
- Efficient Language Model Training through Cross-Lingual and Progressive Transfer Learningโ29Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsโ65Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.โ76Updated last year
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ102Updated 2 years ago
- โ22Updated 2 years ago