HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆76Updated last month
Alternatives and similar repositories for KaLM-Embedding
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below
Sorting:
- ☆34Updated last year
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 3 months ago
- ☆43Updated 3 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆89Updated last year
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆77Updated last year
- 🚢 Data Toolkit for Sailor Language Models☆90Updated 2 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆40Updated 5 months ago
- ☆62Updated 9 months ago
- ☆67Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆121Updated last week
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆140Updated 4 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated last year
- An Experiment on Dynamic NTK Scaling RoPE☆64Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆136Updated 9 months ago
- Implementations of online merging optimizers proposed by Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment☆75Updated 10 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 9 months ago
- Unofficial implementation of AlpaGasus☆91Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 7 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated this week
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆45Updated 5 months ago
- ☆17Updated last year
- Complex Function Calling Benchmark.☆99Updated 3 months ago
- ☆69Updated last year
- FuseAI Project☆86Updated 3 months ago
- ☆43Updated 9 months ago
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆117Updated last year
- The paper list of multilingual pre-trained models (Continual Updated).☆21Updated 10 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆55Updated 7 months ago