raphaelsty / neural-tree
Tree-based indexes for neural-search
β28Updated 6 months ago
Related projects: β
- NLP with Rust for Python π¦πβ57Updated 3 months ago
- utilities for loading and running text embeddings with onnxβ39Updated last month
- Late Interaction Models Training & Retrievalβ130Updated last week
- Code repository for the paper - "AdANNS: A Framework for Adaptive Semantic Search"β57Updated 11 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)β82Updated 6 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated 5 months ago
- Generalist and Lightweight Model for Text Classificationβ29Updated 2 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β136Updated 3 weeks ago
- β58Updated 3 weeks ago
- Latent Large Language Modelsβ16Updated 3 weeks ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β73Updated 6 months ago
- β56Updated this week
- Library for fast text representation and classification.β28Updated 8 months ago
- Vector Database with support for late interaction and token level embeddings.β51Updated last week
- Python API for https://vespa.ai, the open big data serving engineβ89Updated this week
- π€ HuggingFace Inference Toolkit for Google Cloud Vertex AI (similar to SageMaker's Inference Toolkit, but for Vertex AI and unofficial)β17Updated 6 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Faceβ32Updated last year
- Efficient BM25 with DuckDB π¦β12Updated last week
- BPE modification that implements removing of the intermediate tokens during tokenizer training.β13Updated last week
- Generalised Contrastive Learning. This is a Repository for Google Shopping Dataset and Benchmarks followed by our novel fine-grained contβ¦β29Updated 2 weeks ago
- CLIR version of ColBERTβ62Updated 3 months ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 7 months ago
- β38Updated this week
- β34Updated last year
- π€ Trade any tensors over the networkβ30Updated 11 months ago
- [WIP] Transformer to embed Danbooru labelsetsβ13Updated 5 months ago
- Tools to make language models a bit easier to useβ22Updated last week
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and teβ¦β42Updated 8 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ33Updated 3 months ago
- Repository containing the SPIN experiments on the DIBT 10k ranked promptsβ22Updated 6 months ago