Code for KaLM-Embedding models
☆117Jun 30, 2025Updated 9 months ago
Alternatives and similar repositories for KaLM-Embedding
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- code for piccolo embedding model from SenseTime☆145May 21, 2024Updated last year
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 9 months ago
- ☆60Jan 26, 2025Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆225Apr 8, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆63Aug 2, 2024Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆12Mar 19, 2024Updated 2 years ago
- ☆13Jan 22, 2025Updated last year
- Official implementation of our paper "Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Opera…☆11Sep 20, 2024Updated last year
- ☆24Oct 16, 2025Updated 6 months ago
- ☆59Feb 27, 2025Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 10 months ago
- ☆29Nov 9, 2025Updated 5 months ago
- This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]☆624Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 9 months ago
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated last year
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆50Oct 18, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆78Dec 8, 2025Updated 4 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated 11 months ago
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆28Sep 5, 2024Updated last year
- SSRL: Self-Search Reinforcement Learning☆207Aug 20, 2025Updated 7 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆46Jun 11, 2025Updated 10 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- ☆55Jan 15, 2026Updated 3 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 9 months ago
- ☆35May 16, 2025Updated 11 months ago
- The code of our paper "RaSeRec: Retrieval-Augmented Sequential Recommendation"☆28Jan 7, 2025Updated last year
- Generative Representational Instruction Tuning☆690Jun 25, 2025Updated 9 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆271Sep 25, 2025Updated 6 months ago
- 中文预训练ModernBert☆99Apr 11, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆1,901Sep 30, 2025Updated 6 months ago
- Code for explaining and evaluating late chunking (chunked pooling)☆499Dec 23, 2024Updated last year
- code for training & evaluating Contextual Document Embedding models☆203May 14, 2025Updated 11 months ago
- ☆20Apr 8, 2025Updated last year
- [EMNLP 2024] This is the code for our paper "BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers".☆24Sep 19, 2024Updated last year
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆51Dec 7, 2024Updated last year