MinishLab / model2vec
Distill a Small Static Model from any Sentence Transformer
☆389Updated this week
Related projects ⓘ
Alternatives and complementary repositories for model2vec
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆242Updated last week
- FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes☆182Updated last month
- awesome synthetic (text) datasets☆239Updated 2 weeks ago
- Late Interaction Models Training & Retrieval☆161Updated 2 weeks ago
- ☆106Updated 3 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆158Updated 2 months ago
- Let's build better datasets, together!☆202Updated 3 months ago
- Easily embed, cluster and semantically label text datasets☆460Updated 7 months ago
- ☆204Updated 4 months ago
- Neural Search☆344Updated 5 months ago
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆875Updated last week
- Manage scalable open LLM inference endpoints in Slurm clusters☆237Updated 4 months ago
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,049Updated last week
- Generalist and Lightweight Model for Text Classification☆48Updated 2 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆131Updated last month
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆383Updated 8 months ago
- Retrieve, Read and LinK: Fast and Accurate Entity Linking and Relation Extraction on an Academic Budget (ACL 2024)☆326Updated last month
- ☆64Updated this week
- ☆130Updated 2 weeks ago
- An Open Source Toolkit For LLM Distillation☆352Updated last month
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆576Updated last week
- Notebooks for training universal 0-shot classifiers on many different tasks☆104Updated 7 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆219Updated last week
- ☆92Updated last month
- Efficient vector database for hundred millions of embeddings.☆200Updated 5 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆133Updated 3 months ago
- data cleaning and curation for unstructured text☆327Updated 3 months ago
- Generalist and Lightweight Model for Relation Extraction (Extract any relationship types from text)☆93Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆788Updated this week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆126Updated this week