TencentARC / BEBRLinks
Official code for "Binary embedding based retrieval at Tencent"
☆43Updated last year
Alternatives and similar repositories for BEBR
Users that are interested in BEBR are comparing it to the libraries listed below
Sorting:
- Retrieval with Learned Similarities (http://arxiv.org/abs/2407.15462, WWW'25 Oral)☆43Updated last month
- ☆12Updated 3 years ago
- Odysseus: Playground of LLM Sequence Parallelism☆69Updated 11 months ago
- Source code for SIGMOD 2020 paper "Improving Approximate Nearest Neighbor Search through Learned Adaptive Early Termination"☆54Updated 4 years ago
- ☆74Updated 2 years ago
- hnsw implemented by python☆66Updated 6 years ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆61Updated 8 months ago
- 为HSNW源码加上了详细的注释☆19Updated 2 years ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"☆64Updated last year
- Distributed IO-aware Attention algorithm☆20Updated 9 months ago
- Fast C++ implementation of https://github.com/yahoo/lopq: Locally Optimized Product Quantization (LOPQ) model and searcher for approximat…☆35Updated 5 years ago
- PQ Fast Scan☆62Updated 6 years ago
- [NeurIPS 2023] Model-enhanced Vector Index☆26Updated last year
- A library of algorithms for approximate nearest neighbor search in high dimensions, along with a set of useful tools for designing such a…☆145Updated this week
- QuickerADC is an implementation of highly-efficient product quantizers leveraging SIMD shuffle instructions integrated into FAISS☆60Updated 6 years ago
- Implementation of IceFormer: Accelerated Inference with Long-Sequence Transformers on CPUs (ICLR 2024).☆25Updated 11 months ago
- Code for paper: Towards Similarity Graphs Constructed by Deep Reinforcement Learning☆21Updated 5 years ago
- Vocabulary Parallelism☆19Updated 2 months ago
- NASRec Weight Sharing Neural Architecture Search for Recommender Systems☆30Updated last year
- A fast header-only graph-based index for approximate nearest neighbor search (ANNS). https://flatnav.net☆27Updated 2 weeks ago
- [ACL 2024] RelayAttention for Efficient Large Language Model Serving with Long System Prompts☆39Updated last year
- CUDA implementation of Hierarchical Navigable Small World Graph algorithm☆158Updated 4 years ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆66Updated 2 weeks ago
- Best practices for testing advanced Mixtral, DeepSeek, and Qwen series MoE models using Megatron Core MoE.☆17Updated this week
- Linear Attention Sequence Parallelism (LASP)☆83Updated last year
- Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.☆20Updated 2 weeks ago
- ☆73Updated 4 months ago
- GGNN: State of the Art Graph-based GPU Nearest Neighbor Search☆157Updated 3 months ago
- ☆163Updated last week
- Reducing Dimensionality method for Nearest Neighbor Search☆15Updated 4 years ago