raphaelsty / neural-treeLinks
Tree-based indexes for neural-search
β31Updated last year
Alternatives and similar repositories for neural-tree
Users that are interested in neural-tree are comparing it to the libraries listed below
Sorting:
- NLP with Rust for Python π¦πβ71Updated 8 months ago
- Efficient BM25 with DuckDB π¦β61Updated last year
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"β66Updated 2 years ago
- β90Updated 7 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β104Updated 3 months ago
- utilities for loading and running text embeddings with onnxβ45Updated 5 months ago
- Pre-train Static Word Embeddingsβ94Updated 4 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tablesβ21Updated 8 months ago
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, includingβ¦β68Updated 2 months ago
- CLIR version of ColBERTβ73Updated 7 months ago
- Python library to use Pleias-RAG modelsβ68Updated 9 months ago
- PyLate efficient inference engineβ71Updated 3 weeks ago
- lossily compress representation vectors using product quantizationβ59Updated 3 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ61Updated last year
- Trully flash implementation of DeBERTa disentangled attention mechanism.β74Updated last week
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β202Updated last year
- Official Repository for "Hypencoder: Hypernetworks for Information Retrieval"β33Updated 4 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β72Updated last month
- β53Updated 11 months ago
- PyTorch implementation for MRLβ21Updated last year
- Efficiently computing & storing token n-grams from large corporaβ26Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β181Updated 9 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β155Updated 6 months ago
- Efficient vector database for hundred millions of embeddings.β211Updated last year
- Datamodels for hugging face tokenizersβ87Updated this week
- Using modal.com to process FineWeb-edu dataβ20Updated 10 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β82Updated last year
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ103Updated 2 years ago
- Training code for Sparse Autoencoders on Embedding modelsβ39Updated 11 months ago
- Vector Database with support for late interaction and token level embeddings.β54Updated 7 months ago