Lyon-NLP / mteb-french

MTEB: Massive Text Embedding Benchmark French extended

☆19

Alternatives and similar repositories for mteb-french

Users that are interested in mteb-french are comparing it to the libraries listed below

Sorting:

IBM / fastfit
FastFit ⚡ When LLMs are Unfit Use FastFit ⚡ Fast and Effective Text Classification with Many Classes
☆203Updated last week
worldbank / GISTEmbed
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
☆43Updated last year
MoritzLaurer / zeroshot-classifier
Notebooks for training universal 0-shot classifiers on many different tasks
☆126Updated 4 months ago
LAGoM-NLP / transtokenizer
☆45Updated 3 months ago
SalesforceAIResearch / SFR-RAG
☆74Updated 4 months ago
hltcoe / ColBERT-X
CLIR version of ColBERT
☆67Updated 2 weeks ago
google-research-datasets / swim-ir
SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…
☆48Updated last year
MinishLab / tokenlearn
Pre-train Static Word Embeddings
☆60Updated last month
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆80Updated last year
naver / bergen
Benchmarking library for RAG
☆196Updated this week
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆76Updated 6 months ago
hamelsmu / llama-inference
experiments with inference on llama
☆104Updated 11 months ago
rahmanidashti / SyntheticTestCollections
[Official Codes] Synthetic Test Collections for Retrieval Evaluation (SIGIR 2024)
☆10Updated 10 months ago
Knowledgator / GLiClass
Generalist and Lightweight Model for Text Classification
☆128Updated 2 weeks ago
urchade / GraphER
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
☆72Updated 9 months ago
vespa-engine / pyvespa
Python API for https://vespa.ai, the open big data serving engine
☆124Updated this week
kamalkraj / e5-mistral-7b-instruct
Finetune mistral-7b-instruct for sentence embeddings
☆80Updated last year
zetaalphavector / InPars
Inquisitive Parrots for Search
☆191Updated last year
IBM / InspectorRAGet
The repository contains generative AI analytics platform application code.
☆25Updated last week
DaoD / INTERS
This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"
☆203Updated 5 months ago
qdrant / bm42_eval
Evaluation of bm42 sparse indexing algorithm
☆66Updated 10 months ago
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆76Updated 7 months ago
hyintell / RetrievalQA
Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…
☆62Updated 11 months ago
DunZhang / Stella
☆62Updated 9 months ago
javyduck / KnowHalu
☆47Updated 11 months ago
yhoshi3 / RaLLe
RaLLe: A Framework for Developing and Evaluating Retrieval-Augmented Large Language Models
☆55Updated last year
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆237Updated last year
terrierteam / ir_measures
provides a common interface to many IR measure tools
☆84Updated this week
davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆61Updated last year
x-tabdeveloping / turftopic
Robust and fast topic models with sentence-transformers.
☆51Updated this week