raphaelsty / neural-treeLinks
Tree-based indexes for neural-search
β31Updated last year
Alternatives and similar repositories for neural-tree
Users that are interested in neural-tree are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python π¦πβ70Updated 7 months ago
- Pre-train Static Word Embeddingsβ92Updated 3 months ago
- β89Updated 5 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tablesβ21Updated 6 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"β65Updated 2 years ago
- Efficient BM25 with DuckDB π¦β59Updated 11 months ago
- lossily compress representation vectors using product quantizationβ59Updated last month
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated last month
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β173Updated 7 months ago
- β53Updated 10 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.β67Updated 2 months ago
- PyTorch implementation for MRLβ20Updated last year
- Vector Database with support for late interaction and token level embeddings.β54Updated 5 months ago
- utilities for loading and running text embeddings with onnxβ44Updated 3 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β79Updated last year
- Datamodels for hugging face tokenizersβ86Updated last week
- Python library to use Pleias-RAG modelsβ67Updated 7 months ago
- Efficiently computing & storing token n-grams from large corporaβ26Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ59Updated last year
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β68Updated 2 months ago
- PyLate efficient inference engineβ68Updated 3 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β199Updated last year
- CLIR version of ColBERTβ74Updated 5 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β153Updated 4 months ago
- Training code for Sparse Autoencoders on Embedding modelsβ39Updated 9 months ago
- Efficient vector database for hundred millions of embeddings.β211Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Faceβ32Updated 2 years ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β44Updated last year
- β21Updated last year
- Python API for https://vespa.ai, the open big data serving engineβ151Updated last week