louiezzang / faiss-serverLinks
Faiss server for efficient similarity search and clustering of dense vectors
☆24Updated 3 years ago
Alternatives and similar repositories for faiss-server
Users that are interested in faiss-server are comparing it to the libraries listed below
Sorting:
- Plugin to integrate approximate nearest neighbor(ANN) search with Elasticsearch☆66Updated 6 years ago
- gRPC server over a FAISS index☆18Updated 3 years ago
- faiss serving :)☆135Updated last year
- Milvus management GUI☆95Updated 3 years ago
- milvus tutorials☆20Updated 3 years ago
- The Triton backend for TensorRT.☆77Updated this week
- The Triton backend for TensorFlow.☆52Updated last month
- Large-scale exact string matching tool☆17Updated 4 months ago
- Sample implementation of natural language image search with OpenAI's CLIP and Elasticsearch or Opensearch.☆71Updated 2 years ago
- Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints☆158Updated 9 months ago
- A data migration tool for Milvus.☆70Updated 2 years ago
- Deploy stable diffusion model with onnx/tenorrt + tritonserver☆124Updated last year
- ☆23Updated last year
- Common source, scripts and utilities shared across all Triton repositories.☆74Updated this week
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆95Updated last year
- Vector Search Engine base on BRPC + FAISS☆148Updated 5 years ago
- 一个基于 faiss 的检索服务.☆49Updated 7 years ago
- Open Source, Cloud Native, RESTful Search Engine Powered by Neural Networks☆142Updated 3 years ago
- ☆68Updated 2 years ago
- ☆31Updated 3 years ago
- ElasticCTR,即飞桨弹性计算推荐系统,是基于Kubernetes的企业级推荐系统开源解决方案。该方案融合了百度业务场景下持续打磨的高精度CTR模型、飞桨开源框架的大规模分布式训练能力、工业级稀疏参数弹性调度服务,帮助用户在Kubernetes环境中一键完成推荐系统部…☆185Updated 5 years ago
- Whisper in TensorRT-LLM☆16Updated last year
- A library for building and serving multi-node distributed faiss indices.☆268Updated last year
- ☆55Updated last year
- Real time vector search engine☆138Updated 2 years ago
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- The Triton backend for the ONNX Runtime.☆156Updated this week
- Official code for "Binary embedding based retrieval at Tencent"☆43Updated last year
- ☆110Updated last year
- Inference speed-up for stable-diffusion (ldm) with TensorRT.☆35Updated 2 years ago