HITsz-TMG / KaLM-Embedding
Code for KaLM-Embedding models
☆71Updated last month
Alternatives and similar repositories for KaLM-Embedding:
Users that are interested in KaLM-Embedding are comparing it to the libraries listed below
- ☆62Updated 6 months ago
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆36Updated 2 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆62Updated 8 months ago
- ☆33Updated 3 weeks ago
- 🚢 Data Toolkit for Sailor Language Models☆85Updated last month
- Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper☆125Updated 6 months ago
- AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆125Updated 2 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆77Updated last month
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆128Updated 3 months ago
- ☆31Updated 11 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 9 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆27Updated 3 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆63Updated last month
- Official implementation for 'Extending LLMs’ Context Window with 100 Samples'☆76Updated last year
- Code and data for CoachLM, an automatic instruction revision approach LLM instruction tuning.☆60Updated 10 months ago
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆213Updated this week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆67Updated 3 months ago
- ☆69Updated last year
- FuseAI Project☆83Updated 3 weeks ago
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆48Updated last month
- Evaluation of bm42 sparse indexing algorithm☆64Updated 7 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆135Updated 3 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆62Updated 2 weeks ago
- Unofficial implementation of AlpaGasus☆90Updated last year
- a curated list of the role of small models in the LLM era☆90Updated 4 months ago
- Comprehensive benchmark for RAG☆112Updated 3 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 3 months ago