gsarti / it5
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹
☆30Updated 9 months ago
Alternatives and similar repositories for it5:
Users that are interested in it5 are comparing it to the libraries listed below
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated last year
- 🇮🇹 Italian BERT and ELECTRA models (incl. evaluation)☆18Updated 2 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.☆86Updated 2 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated 7 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆93Updated 2 years ago
- ☆15Updated 3 years ago
- German small and large versions of GPT2.☆20Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- A french sequence to sequence pretrained model☆59Updated 2 years ago
- ☆22Updated 3 years ago
- ☆35Updated 3 years ago
- ☆43Updated last year
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆82Updated 4 months ago
- Source code and data for Like a Good Nearest Neighbor☆28Updated 2 months ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated 6 months ago
- Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP☆58Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging☆66Updated 2 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆27Updated 6 months ago
- MAFAND-MT☆55Updated 8 months ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)☆61Updated last year
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2…☆68Updated 2 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆56Updated 8 months ago
- ☆11Updated 4 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 10 months ago
- negate_sentence(A Python module that doesn't negate sentences.)☆30Updated 5 months ago
- Ranking of fine-tuned HF models as base models.☆35Updated last year