MoritzLaurer / zeroshot-classifier
Notebooks for training universal 0-shot classifiers on many different tasks
☆125Updated 4 months ago
Alternatives and similar repositories for zeroshot-classifier:
Users that are interested in zeroshot-classifier are comparing it to the libraries listed below
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆202Updated this week
- Generalist and Lightweight Model for Text Classification☆124Updated last week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated 11 months ago
- Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training☆66Updated 3 months ago
- Late Interaction Models Training & Retrieval☆306Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆131Updated 4 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)☆101Updated last year
- ☆62Updated 9 months ago
- ☆117Updated 8 months ago
- Pre-train Static Word Embeddings☆59Updated 3 weeks ago
- Codebase accompanying the Summary of a Haystack paper.☆77Updated 7 months ago
- experiments with inference on llama☆104Updated 11 months ago
- Robust and fast topic models with sentence-transformers.☆48Updated this week
- Efficient few-shot learning with cross-encoders.☆51Updated last year
- ☆151Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆175Updated 8 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- code for training & evaluating Contextual Document Embedding models☆184Updated this week
- Let's build better datasets, together!☆259Updated 4 months ago
- awesome synthetic (text) datasets☆280Updated 6 months ago
- ☆77Updated 11 months ago
- The official code repo for "Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations".☆81Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆180Updated 4 months ago
- Finetune mistral-7b-instruct for sentence embeddings☆81Updated last year
- My personal site☆73Updated 9 months ago