louiezzang / faiss-serverLinks
Faiss server for efficient similarity search and clustering of dense vectors
☆25Updated 3 years ago
Alternatives and similar repositories for faiss-server
Users that are interested in faiss-server are comparing it to the libraries listed below
Sorting:
- faiss serving :)☆137Updated 2 years ago
- gRPC server over a FAISS index☆19Updated 4 years ago
- The Triton backend for TensorFlow.☆55Updated last month
- milvus tutorials☆20Updated 3 years ago
- java wrapper for facebook faiss☆43Updated 6 years ago
- Open Source, Cloud Native, RESTful Search Engine Powered by Neural Networks☆144Updated 4 years ago
- ☆113Updated last year
- Official code for "Binary embedding based retrieval at Tencent"☆44Updated last year
- Common source, scripts and utilities shared across all Triton repositories.☆79Updated 2 weeks ago
- The Triton backend for TensorRT.☆82Updated 2 weeks ago
- A web service build on top of Facebook's Faiss☆93Updated 4 years ago
- Serving AI/ML models in the open standard formats PMML and ONNX with both HTTP (REST API) and gRPC endpoints☆164Updated last week
- ☆70Updated 2 years ago
- Sample implementation of natural language image search with OpenAI's CLIP and Elasticsearch or Opensearch.☆73Updated 3 years ago
- 参考faiss4j,已经废弃,采用c版本rpc通信的形式☆12Updated 3 years ago
- ☆21Updated 3 weeks ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆97Updated last year
- XVERSE-7B: A multilingual large language model developed by XVERSE Technology Inc.☆53Updated last year
- ☆25Updated 2 years ago
- Byzer-retrieval is a distributed retrieval system which designed as a backend for LLM RAG (Retrieval Augmented Generation). The system su…☆49Updated 9 months ago
- ☆33Updated 3 years ago
- implement bert in pure c++☆36Updated 5 years ago
- 一个基于 faiss 的检索服务.☆49Updated 7 years ago
- Minimal example of using a traced huggingface transformers model with libtorch☆35Updated 5 years ago
- A library integrating embedding and reranker models from OpenAI, SentenceTransformers etc for semantic search in vector database.☆58Updated 8 months ago
- The Triton backend for the ONNX Runtime.☆170Updated 2 weeks ago
- ☆321Updated last week
- Large-scale exact string matching tool☆17Updated 9 months ago
- A memory efficient DLRM training solution using ColossalAI☆106Updated 3 years ago
- This project combines the model provided by Bert and Milvus to realize a question and answer (QA) system.☆25Updated 4 years ago