qdrant / bfbLinks
*high-load* benchmarking tool
☆15Updated 2 weeks ago
Alternatives and similar repositories for bfb
Users that are interested in bfb are comparing it to the libraries listed below
Sorting:
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year
- Proxy server for triton gRPC server that inferences embedding model in Rust☆21Updated last year
- This is the repo for the container that holds the models for the text2vec-transformers module☆60Updated 3 months ago
- Sentence Embedding as a Service☆15Updated 7 months ago
- Using modal.com to process FineWeb-edu data☆20Updated 10 months ago
- A framework for benchmarking embedding models in hybrid search scenarios (BM25 + vector search) using Weaviate.☆38Updated 3 weeks ago
- ☆83Updated 3 months ago
- GPU prices aggregator for cloud providers☆45Updated last month
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- Optimizing bit-level Jaccard Index and Population Counts for large-scale quantized Vector Search via Harley-Seal CSA and Lookup Tables☆21Updated 8 months ago
- Contextualized per-token embeddings☆34Updated 8 months ago
- utilities for loading and running text embeddings with onnx☆45Updated 5 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆61Updated last year
- Helm charts to deploy Weaviate to k8s☆65Updated 3 months ago
- ☆21Updated last year
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Updated 7 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆41Updated 3 months ago
- Explore vector similarity in Redis☆115Updated 2 years ago
- Self-host LLMs with vLLM and BentoML☆168Updated 2 weeks ago
- Efficient BM25 with DuckDB 🦆☆61Updated last year
- Proxy server for quota, usage monitoring and tracking of OpenAI requests☆16Updated 2 years ago
- Parallel wasm Barnes-Hut t-SNE implementation written in Rust.☆21Updated last month
- Like picoGPT but for BERT.☆51Updated 2 years ago
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆26Updated 11 months ago
- Generate fixed dimensional embeddings for multi-dimensional vectors in python based on Muvera from Google.☆19Updated 7 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆138Updated last year
- First token cutoff sampling inference example☆30Updated 2 years ago
- build your own vector database -- the littlest hnsw☆67Updated last year
- ☆18Updated last year
- Model implementation for the contextual embeddings project☆40Updated 8 months ago