Code for KaLM-Embedding models
☆117Jun 30, 2025Updated 10 months ago
Alternatives and similar repositories for KaLM-Embedding
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Use contrastive learning to train a large language model (LLM) as a retriever☆12Jul 19, 2024Updated last year
- code for piccolo embedding model from SenseTime☆144May 21, 2024Updated last year
- [ACM MM25] Official Pytorch implementation of [Decoupled Global-Local Alignment for Improving Compositional Understanding]☆16Jul 15, 2025Updated 9 months ago
- ☆63Jan 26, 2025Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆227Apr 8, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆63Aug 2, 2024Updated last year
- ☆13Jan 22, 2025Updated last year
- ☆59Feb 27, 2025Updated last year
- ☆24Oct 16, 2025Updated 6 months ago
- Model implementation for the contextual embeddings project☆47Jun 2, 2025Updated 11 months ago
- This repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]☆639Apr 28, 2026Updated last week
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updates☆24Jul 1, 2025Updated 10 months ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆50Oct 18, 2024Updated last year
- Fast search index for SPLADE sparse retrieval models implemented in Python using Numpy and Numba☆38Oct 16, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆49Nov 13, 2023Updated 2 years ago
- ☆12Jun 13, 2025Updated 10 months ago
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆79Dec 8, 2025Updated 5 months ago
- Effective Unsupervised Domain Adaptation of Neural Rankers by Diversifying Synthetic Query Generation☆15Apr 23, 2025Updated last year
- TextEmbed is a REST API crafted for high-throughput and low-latency embedding inference. It accommodates a wide variety of embedding mode…☆28Sep 5, 2024Updated last year
- SSRL: Self-Search Reinforcement Learning☆207Aug 20, 2025Updated 8 months ago
- ☆46Jun 11, 2025Updated 10 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- Control LLM☆23Apr 6, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆55Apr 18, 2026Updated 3 weeks ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 10 months ago
- ☆35May 16, 2025Updated 11 months ago
- Generative Representational Instruction Tuning☆691Jun 25, 2025Updated 10 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆272Sep 25, 2025Updated 7 months ago
- 中文预训练ModernBert☆100Apr 11, 2025Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)☆510Dec 23, 2024Updated last year
- code for training & evaluating Contextual Document Embedding models☆204May 14, 2025Updated 11 months ago
- ☆1,920Sep 30, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆20Apr 8, 2025Updated last year
- Submodular optimization for context engineering: query fan-out, text selection, passage reranking☆80Jul 14, 2025Updated 9 months ago
- FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens☆17Sep 8, 2025Updated 8 months ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆51Aug 7, 2025Updated 9 months ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆735May 2, 2026Updated last week
- Code for Robust Fine-tuning (RbFT)☆17Jan 31, 2025Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆51Dec 7, 2024Updated last year