gsarti / it5Links
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐น
โ30Updated last year
Alternatives and similar repositories for it5
Users that are interested in it5 are comparing it to the libraries listed below
Sorting:
- A library to synthesize text datasets using Large Language Models (LLM)โ152Updated 3 years ago
- Explainable Zero-Shot Topic Extractionโ65Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ79Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.โ215Updated last year
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ95Updated 3 weeks ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsโฆโ37Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.โ60Updated 2 years ago
- Comprehensive NLP Evaluation Systemโ188Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ69Updated 3 years ago
- ๐ ๏ธ Tools for Transformers compression using PyTorch Lightning โกโ85Updated last week
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.โ127Updated 5 years ago
- โ22Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ156Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.โ59Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ105Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ56Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitterโ112Updated last year
- Bi-encoder entity linking architectureโ51Updated last year
- German small and large versions of GPT2.โ20Updated 3 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters โฆโ76Updated 2 weeks ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scaleโ157Updated 2 years ago
- Creating class-based TF-IDF matricesโ91Updated 3 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 3 years ago
- โ35Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality โฆโ105Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsโ30Updated 4 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ96Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithmโ70Updated 5 months ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2โฆโ70Updated 3 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.โ110Updated last year