gsarti / it5Links
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐น
โ30Updated last year
Alternatives and similar repositories for it5
Users that are interested in it5 are comparing it to the libraries listed below
Sorting:
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ79Updated 3 years ago
- A library to synthesize text datasets using Large Language Models (LLM)โ152Updated 3 years ago
- Explainable Zero-Shot Topic Extractionโ65Updated last year
- A python package for benchmarking interpretability techniques on Transformers.โ215Updated last year
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document โฆโ187Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ69Updated 3 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ105Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitterโ112Updated last year
- Bi-encoder entity linking architectureโ52Updated last year
- A Python library aimed at dissecting and augmenting NER training data.โ60Updated 2 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsโฆโ37Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsโ30Updated 4 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ96Updated 3 years ago
- โ117Updated 3 months ago
- German small and large versions of GPT2.โ20Updated 3 years ago
- ๐ ๏ธ Tools for Transformers compression using PyTorch Lightning โกโ85Updated last week
- Few-shot Named Entity Recognitionโ121Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ56Updated 2 years ago
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ95Updated last month
- Comprehensive NLP Evaluation Systemโ188Updated last year
- โ22Updated 3 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2โฆโ70Updated 3 years ago
- ๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)โ18Updated 3 years ago
- Using short models to classify long textsโ21Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scaleโ157Updated 2 years ago
- A large scale dataset for Question Answering in Italianโ27Updated 7 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.โ127Updated 5 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ156Updated last year
- Some notebooks for NLPโ207Updated 2 years ago