gsarti / it5Links
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐น
โ30Updated last year
Alternatives and similar repositories for it5
Users that are interested in it5 are comparing it to the libraries listed below
Sorting:
- A library to synthesize text datasets using Large Language Models (LLM)โ152Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ79Updated 3 years ago
- Explainable Zero-Shot Topic Extractionโ65Updated last year
- A python package for benchmarking interpretability techniques on Transformers.โ214Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ69Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.โ59Updated 2 years ago
- Bi-encoder entity linking architectureโ51Updated last year
- โ22Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ156Updated last year
- ๐ ๏ธ Tools for Transformers compression using PyTorch Lightning โกโ85Updated this week
- Comprehensive NLP Evaluation Systemโ188Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsโฆโ37Updated 3 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.โ36Updated 2 years ago
- โ35Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ105Updated 3 years ago
- Neural information retrieval / Semantic search / Bi-encodersโ174Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ96Updated 2 years ago
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ95Updated last week
- TimeLMs: Diachronic Language Models from Twitterโ111Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.โ127Updated 5 years ago
- Few-shot Named Entity Recognitionโ122Updated 3 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 3 years ago
- German small and large versions of GPT2.โ20Updated 3 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2โฆโ70Updated 2 years ago
- Our open source implementation of MiniLMv2 (https://aclanthology.org/2021.findings-acl.188)โ61Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ56Updated 2 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).โ76Updated 2 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality โฆโ105Updated last year
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiโฆโ244Updated 2 years ago
- [EMNLP-Findings 2020] Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentencesโ63Updated last year