bentoml / CLIP-API-serviceLinks
CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search
β66Updated 4 months ago
Alternatives and similar repositories for CLIP-API-service
Users that are interested in CLIP-API-service are comparing it to the libraries listed below
Sorting:
- Turn any OCR models into online inference API endpoint π πβ57Updated last month
- An JS web client for connecting to Pipecat bots with voice and visionβ44Updated 11 months ago
- β20Updated last year
- A library to extract the main content from html. Developed for information on LLM and for feeding data into LangChain and LlamaIndex.β51Updated last year
- β74Updated last year
- A complete(grpc service and lib) Rust inference with multilingual embedding support. This version leverages the power of Rust for both GRβ¦β39Updated last year
- An open-source cloud-native of large multi-modal models (LMMs) serving framework.β164Updated 2 years ago
- 𧬠[WIP] Lobe Flow - an open-source ai powered node flow editorβ22Updated last year
- Demo example of consumer goods categorizationβ30Updated 2 years ago
- Easy to deploy.A cloud service for python code interpreter sandbox for Code-Interpreter.β57Updated last year
- VideoDB Python SDKβ84Updated this week
- Source code of the food discovery demo built on top of Qdrantβ47Updated 2 years ago
- Poor man's phind.com/perplexity.aiβ52Updated last year
- A PoC to run Segment Anything Model (SAM) entirely in the browser without any backendβ77Updated 2 years ago
- A mono-repo to house the various supported Transport options to be used with Pipecat's client-js packageβ29Updated last week
- β29Updated 2 years ago
- faster-whisper as serverless endpointβ125Updated 3 weeks ago
- Browser-based Voice Assistantβ44Updated 2 years ago
- β ChatGPT Plugin for performing basic arithmetic operationsβ18Updated 2 years ago
- Semantic Search demo featuring UForm, USearch, UCall, and StreamLit, to visual and retrieve from image datasets, similar to "CLIP Retrievβ¦β53Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ121Updated last year
- LLaVA server (llama.cpp).β183Updated 2 years ago
- β33Updated 2 years ago
- Langchain Agent utilizing OpenAI Function Calls to execute Git commands using Natural Languageβ44Updated 2 years ago
- A repository for creating, and sample code for consuming an ONNX embedding modelβ34Updated 2 years ago
- Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, proviβ¦β41Updated 8 months ago
- A multimodal RAG application that enables semantic search on multimedia sources like audio, video and imagesβ41Updated 2 years ago
- HuggingChat like UI in Gradioβ70Updated 2 years ago
- A function to do allβ35Updated last year
- Cog wrapper for Coqui / xtts-v2β79Updated last year