infinitylogesh / mutate
A library to synthesize text datasets using Large Language Models (LLM)
β151Updated 2 years ago
Alternatives and similar repositories for mutate:
Users that are interested in mutate are comparing it to the libraries listed below
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 8 months ago
- A Python library aimed at dissecting and augmenting NER training data.β57Updated last year
- π« SpaCy wrapper for ConceptNet π«β89Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 2 years ago
- Few-shot Named Entity Recognitionβ122Updated 2 years ago
- Explainable Zero-Shot Topic Extractionβ62Updated 5 months ago
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.β126Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with entiβ¦β244Updated last year
- A spaCy custom component that extracts and normalizes temporal expressionsβ52Updated last year
- SummVis is an interactive visualization tool for text summarization.β251Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scaleβ154Updated last year
- Alternate Implementation for Zero Shot Text Classification: Instead of reframing NLI/XNLI, this reframes the text backbone of CLIP modelsβ¦β37Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 11 months ago
- Question-answers, collected from Googleβ125Updated 3 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.β104Updated 9 months ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".β187Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis toolsβ145Updated last year
- All the goto functions you need to handle NLP use-cases, integrated in NLPretextβ139Updated 10 months ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β86Updated 3 weeks ago
- Open source library for few shot NLPβ77Updated last year
- Information extraction from English and German texts based on predicate logicβ135Updated last year
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Selfβ¦β201Updated 2 years ago
- A PyTorch-based open-source framework that provides methods for improving the weakly annotated data and allows researchers to efficientlyβ¦β108Updated 4 months ago
- An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.β36Updated last year
- β74Updated 3 years ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ301Updated last year
- π€ Push your spaCy pipelines to the Hugging Face Hubβ43Updated 7 months ago
- SpikeX - SpaCy Pipes for Knowledge Extractionβ397Updated 3 years ago
- Sentence transformers models for SpaCyβ107Updated last year
- Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Taggingβ65Updated 2 years ago