gsarti / it5Links
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐น
โ30Updated last year
Alternatives and similar repositories for it5
Users that are interested in it5 are comparing it to the libraries listed below
Sorting:
- A library to synthesize text datasets using Large Language Models (LLM)โ151Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ79Updated 3 years ago
- Explainable Zero-Shot Topic Extractionโ63Updated last year
- A python package for benchmarking interpretability techniques on Transformers.โ212Updated last year
- โ22Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.โ59Updated 2 years ago
- Bi-encoder entity linking architectureโ50Updated last year
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document โฆโ186Updated last year
- Comprehensive NLP Evaluation Systemโ187Updated last year
- TimeLMs: Diachronic Language Models from Twitterโ111Updated last year
- ๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)โ18Updated 3 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ96Updated 2 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsโฆโ37Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ55Updated 2 years ago
- โ13Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ68Updated 3 years ago
- German small and large versions of GPT2.โ20Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ156Updated last year
- โ35Updated 3 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).โ76Updated this week
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"โ34Updated 4 months ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ104Updated 3 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)โ61Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP taskโ127Updated 3 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.โ87Updated 3 weeks ago
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ95Updated 2 years ago
- โ43Updated 2 years ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.โ127Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsโ31Updated 4 years ago