g4lb / proxy-serviceLinks
A Proxy service using FastAPI and Protocol Buffers (Proto3)
☆13Updated 2 years ago
Alternatives and similar repositories for proxy-service
Users that are interested in proxy-service are comparing it to the libraries listed below
Sorting:
- Lightweight Nearest Neighbors with Flexible Backends☆333Updated last month
- Tree-based indexes for neural-search☆31Updated last year
- High-performance MinHash implementation in Rust with Python bindings for efficient similarity estimation and deduplication of large datas…☆230Updated last month
- 🤝 Trade any tensors over the network☆31Updated 2 years ago
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Updated last year
- High-Performance Engine for Multi-Vector Search☆207Updated 3 weeks ago
- NLP with Rust for Python 🦀🐍☆71Updated 8 months ago
- ☆90Updated 7 months ago
- Datamodels for hugging face tokenizers☆87Updated this week
- Minimal library for distributed python work. Can efficiently run CPU and GPU tasks across 100s of machines.☆88Updated this week
- Tools to make language models a bit easier to use☆64Updated this week
- Pivotal Token Search☆144Updated last month
- An introduction to DSPy☆33Updated 5 months ago
- utilities for loading and running text embeddings with onnx☆45Updated 5 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆181Updated 9 months ago
- Genalog is an open source, cross-platform python package allowing generation of synthetic document images with custom degradations and te…☆44Updated 2 years ago
- Pytorch script hot swap: Change code without unloading your LLM from VRAM☆125Updated 9 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆34Updated last year
- Efficient BM25 with DuckDB 🦆☆61Updated last year
- A library to use `modal` as a backend for `joblib`.☆32Updated last year
- lossily compress representation vectors using product quantization☆59Updated 3 months ago
- ☆73Updated last month
- Optimus is a flexible and scalable framework built to train language models efficiently across diverse hardware configurations, including…☆68Updated 2 months ago
- Vector Database with support for late interaction and token level embeddings.☆54Updated 7 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆155Updated 6 months ago
- Real-time data processing/feature engineering in Rust, Python and SQL. Tailored for modern AI/ML systems.☆75Updated last week
- Build data processing and data analysis pipelines that leverage the power of LLMs 🧠☆247Updated 3 weeks ago
- Pre-train Static Word Embeddings☆94Updated 5 months ago
- Some experiments on transformer models☆11Updated 2 years ago
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago