raphaelsty / neural-tree
Tree-based indexes for neural-search
β29Updated 11 months ago
Alternatives and similar repositories for neural-tree:
Users that are interested in neural-tree are comparing it to the libraries listed below
- NLP with Rust for Python π¦πβ61Updated 8 months ago
- Efficient BM25 with DuckDB π¦β39Updated 2 months ago
- Pre-train Static Word Embeddingsβ47Updated 3 weeks ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ44Updated 7 months ago
- Library for fast text representation and classification.β28Updated last year
- Training code for Sparse Autoencoders on Embedding modelsβ35Updated 2 months ago
- Inference engine for GLiNER models, in Rustβ40Updated this week
- utilities for loading and running text embeddings with onnxβ44Updated 6 months ago
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"β62Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β121Updated 2 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 2 months ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024β59Updated 4 months ago
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β56Updated 3 weeks ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β79Updated 11 months ago
- A clone of OpenAI's Tokenizer page for HuggingFace Modelsβ44Updated last year
- Experiments for efforts to train a new and improved t5β77Updated 10 months ago
- Latent Large Language Modelsβ17Updated 5 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated 10 months ago
- Vector Database with support for late interaction and token level embeddings.β52Updated 4 months ago
- Generalist and Lightweight Model for Text Classificationβ65Updated this week
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β25Updated 2 months ago
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 11 months ago
- β37Updated 6 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β97Updated 11 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Faceβ32Updated last year
- π€ Trade any tensors over the networkβ30Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β170Updated 5 months ago