HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆75Updated last month
Alternatives and similar repositories for KaLM-Embedding:
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below
- ☆42Updated 2 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆40Updated 4 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆82Updated 3 months ago
- "Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Comput…☆24Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Implementation of the paper: "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" from Google in pyTO…☆55Updated this week
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆137Updated 4 months ago
- MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning☆90Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 10 months ago
- ☆17Updated 11 months ago
- This repository contains the joint use of CPO and SimPO method for better reference-free preference learning methods.☆53Updated 8 months ago
- ☆62Updated 9 months ago
- ☆33Updated last year
- ☆67Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆34Updated 4 months ago
- ☆45Updated 3 weeks ago
- ☆47Updated 7 months ago
- This is the official repository for Inheritune.☆111Updated 2 months ago
- Complex Function Calling Benchmark.☆96Updated 3 months ago
- 🚢 Data Toolkit for Sailor Language Models☆88Updated last month
- Automatic prompt optimization framework for multi-step agent tasks.☆29Updated 5 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 11 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆63Updated 8 months ago
- Test-time compute in information retrieval☆22Updated 2 weeks ago
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆36Updated last year
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆134Updated 9 months ago
- Tool for converting LLMs from uni-directional to bi-directional by removing causal mask for tasks like classification and sentence embedd…☆57Updated 4 months ago
- ☆55Updated 5 months ago
- The paper list of multilingual pre-trained models (Continual Updated).☆20Updated 10 months ago
- Astraios: Parameter-Efficient Instruction Tuning Code Language Models☆57Updated last year