microsoft / MoPQ
☆12Updated 3 years ago
Alternatives and similar repositories for MoPQ:
Users that are interested in MoPQ are comparing it to the libraries listed below
- ☆74Updated 2 years ago
- ☆24Updated last year
- Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval☆15Updated 3 years ago
- Official code for "Binary embedding based retrieval at Tencent"☆43Updated last year
- Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)☆43Updated 2 weeks ago
- CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.☆52Updated 3 years ago
- [NeurIPS 2023] Model-enhanced Vector Index☆25Updated last year
- This package implements THOR: Transformer with Stochastic Experts.☆61Updated 3 years ago
- WSDM'22 Best Paper: Learning Discrete Representations via Constrained Clustering for Effective and Efficient Dense Retrieval☆120Updated 9 months ago
- 🌱 梦想家(DreamerGPT):中文大语言模型指令精调☆50Updated last year
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated 10 months ago
- ☆66Updated 2 years ago
- [WWW 2024] The official repo for paper "Scalable and Effective Generative Information Retrieval".☆55Updated last year
- This is the official PyTorch implementation for the paper: "Directed Acyclic Graph Factorization Machines for CTR Prediction via Knowledg…☆13Updated 2 years ago
- Manages vllm-nccl dependency☆17Updated 11 months ago
- Inference framework for MoE layers based on TensorRT with Python binding☆41Updated 3 years ago
- ☆14Updated last year
- Differentiable Product Quantization for End-to-End Embedding Compression.☆62Updated 2 years ago
- code for EACL2024-main:Generative Dense Retrieval: Memory Can Be a Burden☆25Updated last year
- Odysseus: Playground of LLM Sequence Parallelism☆69Updated 10 months ago
- The official repository for "Bridging the Gap Between Indexing and Retrieval for Differentiable Search Index with Query Generation", Shen…☆119Updated last year
- ☆130Updated 9 months ago
- The official implementation of "Helen: Optimizing CTR Prediction Models with Frequency-wise Hessian Eigenvalue Regularization"☆16Updated last year
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆12Updated last month
- Pytorch implementation of paper "Efficient Nearest Neighbor Language Models" (EMNLP 2021)☆72Updated 3 years ago
- ☆74Updated 3 weeks ago
- hnsw implemented by python☆66Updated 5 years ago
- https://acl2023-retrieval-lm.github.io/☆153Updated last year
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆39Updated last year
- ☆22Updated last year