Code for KaLM-Embedding models
☆118Jun 30, 2025Updated 10 months ago
Alternatives and similar repositories for KaLM-Embedding
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- code for piccolo embedding model from SenseTime☆145May 21, 2024Updated 2 years ago
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 10 months ago
- ☆63Jan 26, 2025Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆228May 6, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- Implementation for EACL 2024 paper "Corpus-Steered Query Expansion with Large Language Models"☆13Mar 19, 2024Updated 2 years ago
- Official implementation of our paper "Separate the Wheat from the Chaff: Model Deficiency Unlearning via Parameter-Efficient Module Opera…☆11Sep 20, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- ☆59Feb 27, 2025Updated last year
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- ☆29Nov 9, 2025Updated 6 months ago
- This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]☆647Updated this week
- 🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹☆23Aug 2, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆26Jul 1, 2025Updated 10 months ago
- PreRanker: reranking tools before tool-use☆21Apr 9, 2025Updated last year
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆51Oct 18, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 7 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- ☆12Jun 13, 2025Updated 11 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆80Dec 8, 2025Updated 5 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆16Apr 23, 2025Updated last year
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆28Sep 5, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SSRL: Self-Search Reinforcement Learning☆208Aug 20, 2025Updated 9 months ago
- ☆47Jun 11, 2025Updated 11 months ago
- Control LLM☆23Apr 6, 2025Updated last year
- ☆55Apr 18, 2026Updated last month
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 10 months ago
- ☆35May 16, 2025Updated last year
- The code of our paper "RaSeRec: Retrieval-Augmented Sequential Recommendation"☆27Jan 7, 2025Updated last year
- Generative Representational Instruction Tuning☆692Jun 25, 2025Updated 11 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆274Sep 25, 2025Updated 8 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 中文预训练ModernBert☆100Apr 11, 2025Updated last year
- code for training & evaluating Contextual Document Embedding models☆205May 14, 2025Updated last year
- ☆1,931Sep 30, 2025Updated 7 months ago
- ☆20Apr 8, 2025Updated last year
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆79Jul 14, 2025Updated 10 months ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 8 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆50Aug 7, 2025Updated 9 months ago