embeddings-benchmark / arenaLinks
Code for the MTEB Arena
☆24Updated 4 months ago
Alternatives and similar repositories for arena
Users that are interested in arena are comparing it to the libraries listed below
Sorting:
- ☆86Updated 4 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆69Updated last year
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆54Updated 4 months ago
- ☆51Updated 9 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated 2 months ago
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- Code for Zero-Shot Tokenizer Transfer☆142Updated 10 months ago
- ☆138Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated last month
- ☆58Updated last year
- ☆55Updated last year
- An attribution library for LLMs☆46Updated last year
- Manage scalable open LLM inference endpoints in Slurm clusters☆277Updated last year
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 4 months ago
- An introduction to LLM Sampling☆79Updated 11 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆79Updated last year
- ☆83Updated 3 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- ☆87Updated this week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆146Updated 9 months ago
- Pre-train Static Word Embeddings☆91Updated 2 months ago
- Python library to use Pleias-RAG models☆67Updated 6 months ago
- ☆92Updated 5 months ago
- ☆120Updated last year
- The first dense retrieval model that can be prompted like an LM☆89Updated 6 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 4 months ago
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆25Updated last year
- Let's build better datasets, together!☆265Updated 11 months ago