qdrant / bfbLinks
*high-load* benchmarking tool
☆15Updated 2 weeks ago
Alternatives and similar repositories for bfb
Users that are interested in bfb are comparing it to the libraries listed below
Sorting:
- Sentence Embedding as a Service☆15Updated 4 months ago
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GR…☆39Updated last year
- ☆18Updated last year
- Using modal.com to process FineWeb-edu data☆20Updated 7 months ago
- Vector Database with support for late interaction and token level embeddings.☆55Updated 4 months ago
- ☆21Updated last year
- utilities for loading and running text embeddings with onnx☆44Updated 2 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆37Updated last month
- Model implementation for the contextual embeddings project☆36Updated 5 months ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆57Updated last week
- Helm charts to deploy Weaviate to k8s☆64Updated last week
- ☆79Updated last week
- Tools for formatting large language model prompts.☆13Updated last year
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- ☆64Updated 7 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆139Updated last year
- GPU accelerated client-side embeddings for vector search, RAG etc.☆65Updated last year
- Contextualized per-token embeddings☆30Updated 6 months ago
- Constrain LLM output☆113Updated last year
- Demo example of consumer goods categorization☆30Updated last year
- Trainable embedding transformation for confidence estimation, feature extraction, explainability and conversion from dense to sparse.☆26Updated 5 months ago
- Routing on Random Forest (RoRF)☆220Updated last year
- Qdrant operator creates and manages Qdrant clusters running in Kubernetes☆24Updated last year
- GPU prices aggregator for cloud providers☆43Updated last week
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated last year
- First token cutoff sampling inference example☆31Updated last year
- This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Larg…☆24Updated 8 months ago
- ☆12Updated last year
- Proxy server for triton gRPC server that inferences embedding model in Rust☆21Updated last year
- Open Weight, tool-calling LLMs☆155Updated last year