HITsz-TMG / KaLM-EmbeddingLinks
Code for KaLM-Embedding models
☆96Updated 4 months ago
Alternatives and similar repositories for KaLM-Embedding
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below
Sorting:
- ☆54Updated 9 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆48Updated 11 months ago
- 🚢 Data Toolkit for Sailor Language Models☆94Updated 8 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆87Updated 9 months ago
- ☆90Updated 5 months ago
- ☆62Updated last year
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆160Updated last month
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆152Updated last year
- Complex Function Calling Benchmark.☆147Updated 9 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆35Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆80Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆68Updated last year
- Model implementation for the contextual embeddings project☆36Updated 5 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆190Updated last year
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆60Updated last year
- ☆155Updated last year
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆65Updated last year
- Verifiers for LLM Reinforcement Learning☆79Updated 6 months ago
- Reformatted Alignment☆112Updated last year
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆144Updated last year
- This is the official repository for Inheritune.☆115Updated 9 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆206Updated 4 months ago
- FuseAI Project☆87Updated 9 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated last year
- Code for paper titled "Towards the Law of Capacity Gap in Distilling Language Models"☆102Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆49Updated last year
- ☆71Updated 11 months ago
- ☆60Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year