embeddings-benchmark / arenaLinks
Code for the MTEB Arena
☆24Updated 5 months ago
Alternatives and similar repositories for arena
Users that are interested in arena are comparing it to the libraries listed below
Sorting:
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆59Updated 5 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆67Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Updated 2 months ago
- Datasets collection and preprocessings framework for NLP extreme multitask learning☆189Updated 5 months ago
- ☆59Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆71Updated last year
- ☆138Updated 4 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- ☆46Updated 5 months ago
- Python library to use Pleias-RAG models☆67Updated 7 months ago
- ☆90Updated 5 months ago
- Scrape and export data from the Open LLM Leaderboard.☆48Updated last year
- code for training & evaluating Contextual Document Embedding models☆201Updated 7 months ago
- ☆53Updated 10 months ago
- Let's build better datasets, together!☆267Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆25Updated last year
- ☆90Updated last week
- Code for Zero-Shot Tokenizer Transfer☆142Updated 11 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆67Updated 3 weeks ago
- ☆120Updated last year
- Pre-train Static Word Embeddings☆94Updated 3 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Crispy reranking models by Mixedbread☆42Updated 3 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆154Updated 5 months ago
- Synthetic Data Generation for Evaluation☆13Updated 10 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆278Updated last year
- An attribution library for LLMs☆46Updated last year
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆222Updated last week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆46Updated last year
- ☆56Updated 11 months ago