A library to synthesize text datasets using Large Language Models (LLM)
☆152Jan 17, 2023Updated 3 years ago
Alternatives and similar repositories for mutate
Users that are interested in mutate are comparing it to the libraries listed below
Sorting:
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- ☆13Feb 26, 2023Updated 3 years ago
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆335Apr 25, 2025Updated 10 months ago
- Few-shot Named Entity Recognition☆121Mar 30, 2022Updated 3 years ago
- STriP Net: Semantic Similarity of Scientific Papers (S3P) Network☆86Jun 13, 2022Updated 3 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆215Sep 29, 2024Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆157May 24, 2024Updated last year
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated 3 weeks ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Nov 5, 2020Updated 5 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Jun 19, 2023Updated 2 years ago
- 🛠️ Tools for Transformers compression using PyTorch Lightning ⚡☆85Feb 1, 2026Updated 3 weeks ago
- [NAACL 2021] This is the code for our paper `Fine-Tuning Pre-trained Language Model with Weak Supervision: A Contrastive-Regularized Self…☆206Aug 17, 2022Updated 3 years ago
- Self-training with Weak Supervision (NAACL 2021)☆163Jul 24, 2023Updated 2 years ago
- Efficient few-shot learning with Sentence Transformers☆2,683Dec 11, 2025Updated 2 months ago
- Implements RNNPool and SoftPool for CNNs.☆14Jan 29, 2021Updated 5 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆926Sep 2, 2024Updated last year
- A repository for our AAAI-2020 Cross-lingual-NER paper. Code will be updated shortly.☆47Dec 2, 2022Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆36Dec 9, 2022Updated 3 years ago
- ☆14May 15, 2020Updated 5 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Feb 27, 2024Updated 2 years ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets☆4,875Updated this week
- This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…☆41Jan 5, 2022Updated 4 years ago
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…☆18Jun 24, 2022Updated 3 years ago
- Text-to-SQL in the Wild: A Naturally-Occurring Dataset Based on Stack Exchange Data☆104Jun 25, 2021Updated 4 years ago
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆396May 11, 2023Updated 2 years ago
- ☆44Mar 3, 2023Updated 2 years ago
- Model explainability that works seamlessly with 🤗 transformers. Explain your transformers model in just 2 lines of code.☆1,410Aug 30, 2023Updated 2 years ago
- Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.☆1,752Dec 20, 2023Updated 2 years ago
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆108May 4, 2022Updated 3 years ago
- SpanMarker for Named Entity Recognition☆465Jan 8, 2025Updated last year
- Python library for automatic training, optimization and comparison of Transformer models on most NLP tasks.☆20May 6, 2023Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆15Jun 6, 2023Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆267Nov 8, 2024Updated last year
- BOND: BERT-Assisted Open-Domain Name Entity Recognition with Distant Supervision☆291Jun 2, 2021Updated 4 years ago
- Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge☆17Nov 16, 2021Updated 4 years ago
- ☆69May 1, 2025Updated 9 months ago
- FastFormers - highly efficient transformer models for NLU☆709Mar 21, 2025Updated 11 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆403Jul 30, 2021Updated 4 years ago