microsoft / MoPQLinks
☆12Updated 3 years ago
Alternatives and similar repositories for MoPQ
Users that are interested in MoPQ are comparing it to the libraries listed below
Sorting:
- ☆74Updated 2 years ago
- Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval☆15Updated 3 years ago
- Official code for "Binary embedding based retrieval at Tencent"☆43Updated last year
- ☆24Updated last year
- Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)☆43Updated 2 months ago
- [NeurIPS 2023] Model-enhanced Vector Index☆26Updated last year
- The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shen…☆121Updated last year
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆120Updated 10 months ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 4 years ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Updated 3 years ago
- Repository of LV-Eval Benchmark☆67Updated 9 months ago
- Retrieval as Attention☆82Updated 2 years ago
- Source code for COLING 2022 paper "Automatic Label Sequence Generation for Prompting Sequence-to-sequence Models"☆24Updated 2 years ago
- 🌱 梦想家(DreamerGPT):中文大语言模型指令精调☆50Updated 2 years ago
- ☆67Updated 3 years ago
- Vocabulary Parallelism☆19Updated 3 months ago
- Dynamic Context Selection for Efficient Long-Context LLMs☆33Updated last month
- TSDG: An efficient index graph for graph-based nearest neighbor search☆9Updated 2 years ago
- Odysseus: Playground of LLM Sequence Parallelism☆70Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆33Updated 8 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated last year
- This package implements THOR: Transformer with Stochastic Experts.☆65Updated 3 years ago
- Implementation of "RankCSE: Unsupervised Sentence Representation Learning via Learning to Rank" (ACL 2023)☆47Updated last year
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆57Updated last year
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆69Updated last year
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆40Updated last year
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆64Updated last year
- ☆35Updated last year
- [ICLR 2025] MiniPLM: Knowledge Distillation for Pre-Training Language Models☆47Updated 7 months ago
- ☆20Updated 2 months ago