kozistr / triton-grpc-proxy-rs
Proxy server for triton gRPC server that inferences embedding model in Rust
β21Updated 8 months ago
Alternatives and similar repositories for triton-grpc-proxy-rs:
Users that are interested in triton-grpc-proxy-rs are comparing it to the libraries listed below
- GPU accelerated client-side embeddings for vector search, RAG etc.β66Updated last year
- Chat Markup Language conversation libraryβ55Updated last year
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β136Updated 9 months ago
- NLP with Rust for Python π¦πβ62Updated 10 months ago
- A high performance batching router optimises max throughput for text inference workloadβ16Updated last year
- utilities for loading and running text embeddings with onnxβ44Updated 8 months ago
- Using modal.com to process FineWeb-edu dataβ20Updated 3 weeks ago
- β39Updated 2 years ago
- Vector Database with support for late interaction and token level embeddings.β54Updated 6 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching oβ¦β131Updated 4 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async APIβ45Updated 7 months ago
- Pre-train Static Word Embeddingsβ56Updated 2 weeks ago
- β66Updated 11 months ago
- Binary vector search example using Unum's USearch engine and pre-computed Wikipedia embeddings from Co:here and MixedBreadβ18Updated last year
- β24Updated 2 months ago
- TOPLOC: is a novel method for verifiable inference that enables users to verify that LLM providers are using the correct model configuratβ¦β21Updated last week
- π End-to-end examples and analysis of deploying LLMs serverless using Modal, Runpod, and Beamβ27Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β30Updated 8 months ago
- Tree-based indexes for neural-searchβ31Updated last year
- A miniature version of Modalβ20Updated 10 months ago
- Unofficial python bindings for the rust llm library. πβ€οΈπ¦