kamalkraj/e5-mistral-7b-instruct

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kamalkraj/e5-mistral-7b-instruct)

kamalkraj / e5-mistral-7b-instruct

Finetune mistral-7b-instruct for sentence embeddings

☆89

Alternatives and similar repositories for e5-mistral-7b-instruct

Users that are interested in e5-mistral-7b-instruct are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

iliaschalkidis / flash-roberta
View on GitHub
Hugging Face RoBERTa with Flash Attention 2
☆24Sep 14, 2025Updated 10 months ago
gabrielchua / embedding-adapter
View on GitHub
A lightweight open-source package to fine-tune embedding models.
☆22Feb 4, 2024Updated 2 years ago
EliasMei / IPM
View on GitHub
Repo - Paper "Capturing Semantics for Imputation with Pre-trained Language Models." [ICDE 2021]
☆10Mar 13, 2022Updated 4 years ago
amazon-science / wikiwiki-dataset
View on GitHub
☆11May 11, 2022Updated 4 years ago
hieudx149 / X-RetroMAE
View on GitHub
Code Roberta version of RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder
☆10Mar 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ysunbp / RECA-paper
View on GitHub
Code and data for the VLDB 2023 paper: RECA: Related Tables Enhanced Column Semantic Type Annotation Framework
☆12May 7, 2025Updated last year
UKPLab / AdaSent
View on GitHub
This repository contains the code for the EMNLP'23 paper "AdaSent: Efficient Domain-Adapted Sentence Embeddings for Few-Shot Classificati…
☆16Jun 3, 2024Updated 2 years ago
megagonlabs / sudowoodo
View on GitHub
The source code of the Sudowoodo paper in ICDE 2023
☆19May 24, 2023Updated 3 years ago
microsoft / multifield-adaptive-retrieval
View on GitHub
Code for the paper "Multi-Field Adaptive Retrieval," a research project on a semi-structured document retrieval
☆18Feb 13, 2026Updated 5 months ago
pacman100 / accelerate-deepspeed-test
View on GitHub
Testing DeepSpeed integration in 🤗 Accelerate
☆11Jun 28, 2022Updated 4 years ago
ielab / llm-rankers
View on GitHub
Document Ranking with Large Language Models.
☆210Feb 14, 2026Updated 5 months ago
LeeSureman / E5-Retrieval-Reproduction
View on GitHub
Use contrastive learning to train a large language model (LLM) as a retriever
☆12Jul 19, 2024Updated 2 years ago
LoveCatc / supervised-llm-uncertainty-estimation
View on GitHub
This repo contains code for paper: "Uncertainty Estimation and Quantification for LLMs: A Simple Supervised Approach".
☆26Oct 21, 2024Updated last year
latynt / ans
View on GitHub
Arabic News Stance Corpus
☆11Feb 5, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
frinkleko / LIMIT-Sparse-Embedding
View on GitHub
Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…
☆16Sep 4, 2025Updated 10 months ago
kaistAI / InstructIR
View on GitHub
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Jun 13, 2024Updated 2 years ago
tlkh / t2t-tuner
View on GitHub
Convenient Text-to-Text Training for Transformers
☆18Dec 10, 2021Updated 4 years ago
metterian / peep-talk
View on GitHub
A Situational Conversation-Based English Education Platform
☆22Jan 16, 2026Updated 6 months ago
worldbank / GISTEmbed
View on GitHub
GISTEmbed: Guided In-sample Selection of Training Negatives for Text Embeddings
☆45Mar 6, 2024Updated 2 years ago
bcaitech1 / p3-mrc-team-ikyo
View on GitHub
Naver Boostcamp AI Tech Stage 3 : MRC (Machine Reading Comprehension)
☆10Jun 10, 2021Updated 5 years ago
megagonlabs / rotom
View on GitHub
Code for the paper "Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and Beyond…
☆24May 31, 2022Updated 4 years ago
afnf33 / emoTale
View on GitHub
☆10Dec 3, 2020Updated 5 years ago
facebookresearch / tart
View on GitHub
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆168Oct 4, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
sirluk / llm_finetuning
View on GitHub
☆15Nov 4, 2024Updated last year
megagonlabs / starmie
View on GitHub
Resources for PVLDB 2023 submission
☆29Aug 28, 2024Updated last year
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆743Jul 18, 2026Updated last week
UKPLab / EACL21-personalized-conversational-system
View on GitHub
☆12Nov 19, 2022Updated 3 years ago
yeyupiaoling / Chinese-LLM-Chat
View on GitHub
大语言模型微调的项目，包含了使用QLora微调ChatGLM和LLama
☆29Jun 26, 2023Updated 3 years ago
fabrahman / char-centric-story
View on GitHub
Codebase for character-centric story understanding
☆14Jan 20, 2022Updated 4 years ago
chentong0 / factoid-wiki
View on GitHub
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆171Jan 8, 2024Updated 2 years ago
yuri-bizzoni / EmoArc
View on GitHub
☆11Apr 9, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
dbamman / lrec2020-coref
View on GitHub
Code and data to support Bamman et al. (2020), "A Dataset of Literary Coreference" (LREC)
☆11Dec 8, 2022Updated 3 years ago
OpenBMB / DEBATER
View on GitHub
This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…
☆26Mar 2, 2025Updated last year
ssbuild / llm_rlhf
View on GitHub
realize the reinforcement learning training for gpt2 llama bloom and so on llm model
☆27Sep 19, 2023Updated 2 years ago
s-smits / modernbert-finetune
View on GitHub
Fine-tune ModernBERT with custom tokenizers, curriculum learning, and next-gen optimizers.
☆74Jan 16, 2026Updated 6 months ago
ContextualAI / gritlm
View on GitHub
Generative Representational Instruction Tuning
☆697Jun 25, 2025Updated last year
yao8839836 / kg-llm
View on GitHub
Exploring large language models for knowledge graph completion. ICASSP 2025
☆162Aug 23, 2025Updated 11 months ago
trapoom555 / Language-Model-STS-CFT
View on GitHub
Improving Text Embedding of Language Models Using Contrastive Fine-tuning
☆64Aug 2, 2024Updated last year