gsarti / it5
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐น
โ30Updated 7 months ago
Alternatives and similar repositories for it5:
Users that are interested in it5 are comparing it to the libraries listed below
- A french sequence to sequence pretrained modelโ57Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ52Updated last year
- A Python library aimed at dissecting and augmenting NER training data.โ57Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ80Updated 2 years ago
- โ22Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ93Updated last year
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 2 years ago
- โ42Updated last year
- Explainable Zero-Shot Topic Extractionโ62Updated 5 months ago
- A large scale dataset for Question Answering in Italianโ26Updated 6 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.โ86Updated last week
- A library to synthesize text datasets using Large Language Models (LLM)โ151Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ102Updated 2 years ago
- โ35Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ65Updated 2 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.โ25Updated 4 months ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)โ60Updated last year
- โ16Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsโ65Updated 2 years ago
- ๐ ๏ธ Tools for Transformers compression using PyTorch Lightning โกโ81Updated 2 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ151Updated 7 months ago
- โ22Updated last year
- โ37Updated last year
- ๐ฎ๐น Italian BERT and ELECTRA models (incl. evaluation)โ17Updated 2 years ago
- Bi-encoder entity linking architectureโ44Updated 4 months ago
- German small and large versions of GPT2.โ20Updated 2 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)โ48Updated 3 years ago
- ๐ค Disaggregators: Curated data labelers for in-depth analysis.โ65Updated last year
- This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyiโฆโ14Updated 2 years ago
- negate_sentence(A Python module that doesn't negate sentences.)โ27Updated 3 months ago