gsarti / it5Links
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" ๐ฎ๐น
โ30Updated last year
Alternatives and similar repositories for it5
Users that are interested in it5 are comparing it to the libraries listed below
Sorting:
- A library to synthesize text datasets using Large Language Models (LLM)โ152Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkโ80Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.โ214Updated 11 months ago
- Explainable Zero-Shot Topic Extractionโ63Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingโ68Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.โ58Updated 2 years ago
- ๐ซ SpaCy wrapper for ConceptNet ๐ซโ95Updated 2 years ago
- โ22Updated 3 years ago
- ๐ ๏ธ Tools for Transformers compression using PyTorch Lightning โกโ85Updated 10 months ago
- Comprehensive NLP Evaluation Systemโ188Updated last year
- TimeLMs: Diachronic Language Models from Twitterโ111Updated last year
- โ24Updated 2 years ago
- Some notebooks for NLPโ207Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.โ156Updated last year
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.โ95Updated 2 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.โ105Updated 3 years ago
- Neural information retrieval / Semantic search / Bi-encodersโ174Updated 2 years ago
- Data and evaluation code for the paper WikiNEuRal: Combined Neural and Knowledge-based Silver Data Creation for Multilingual NER (EMNLP 2โฆโ68Updated 2 years ago
- Few-shot Named Entity Recognitionโ123Updated 3 years ago
- Tutorial to pretrain & fine-tune a ๐ค Flax T5 model on a TPUv3-8 with GCPโ58Updated 3 years ago
- German small and large versions of GPT2.โ20Updated 3 years ago
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsโฆโ37Updated 3 years ago
- โ43Updated 2 years ago
- Creating class-based TF-IDF matricesโ89Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsโ55Updated 2 years ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.โ36Updated 2 years ago
- Camoscio: An Italian instruction-tuned language model based on LLaMAโ127Updated last year
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.โ127Updated 4 years ago
- Hashformers is a framework for hashtag segmentation with Transformers and Large Language Models (LLMs).โ73Updated last year
- Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics from an unlabeled document โฆโ187Updated last year