flairNLP / fabricatorLinks
[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
☆111Updated last year
Alternatives and similar repositories for fabricator
Users that are interested in fabricator are comparing it to the libraries listed below
Sorting:
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆213Updated 3 months ago
- Efficient few-shot learning with cross-encoders.☆60Updated last year
- Simply, faster, sentence-transformers☆143Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasks☆137Updated 11 months ago
- Generalist and Lightweight Model for Text Classification☆167Updated 2 weeks ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Versatile framework designed to streamline the integration of your models, as well as those sourced from Hugging Face, into complex progr…☆34Updated 4 months ago
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 4 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- Pre-train Static Word Embeddings☆94Updated 3 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆82Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated 2 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆33Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 5 months ago
- ☆53Updated 5 months ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago
- multimodal document analysis☆166Updated last month
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last month
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆74Updated 2 months ago
- ☆87Updated 8 months ago
- Powerful unsupervised domain adaptation method for dense retrieval. Requires only unlabeled corpus and yields massive improvement: "GPL: …☆338Updated 2 years ago
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆103Updated last year
- Few-shot Named Entity Recognition☆122Updated 3 years ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆64Updated last year
- Robust and fast topic models with sentence-transformers.☆84Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆71Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Updated 2 years ago
- Retrieval-Augmented Generation battle!☆61Updated 4 months ago
- ☆43Updated 2 years ago
- RaKUn 2.0 - A fast keyword detection algorithm☆69Updated 4 months ago