bentoml / sentence-embedding-bentoLinks
Sentence Embedding as a Service
☆15Updated 2 weeks ago
Alternatives and similar repositories for sentence-embedding-bento
Users that are interested in sentence-embedding-bento are comparing it to the libraries listed below
Sorting:
- Cortex-compatible model server for Python and TensorFlow☆17Updated 2 years ago
- Benchmark for machine learning model online serving (LLM, embedding, Stable-Diffusion, Whisper)☆28Updated 2 years ago
- setup the env for vllm users☆16Updated last year
- Demo example of consumer goods categorization☆28Updated last year
- 🚂 Fine-tune OpenAI models for text classification, question answering, and more☆16Updated 2 years ago
- MLFlow Deployment Plugin for Ray Serve☆45Updated 3 years ago
- Lightning Fast: Faiss CPU + Onnx Quantized Multilingual Embedding Model☆23Updated 10 months ago
- ☆37Updated this week
- The collection of bulding blocks building fine-tunable metric learning models☆32Updated 3 months ago
- A file utility for accessing both local and remote files through a unified interface.☆43Updated 2 months ago
- ☆55Updated 2 weeks ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆82Updated this week
- Open sourced backend for Martian's LLM Inference Provider Leaderboard☆18Updated 11 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 2 months ago
- Experiments w/ ChatGPT, LangChain, local LLMs☆24Updated 2 years ago
- A collection of reproducible inference engine benchmarks☆32Updated 2 months ago
- ☆39Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated last year
- OpenAI compatible API for open source LLMs☆15Updated last year
- ☆15Updated 3 months ago
- The backend behind the LLM-Perf Leaderboard☆10Updated last year
- Tools for formatting large language model prompts.☆13Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated last year
- CLI-based tool to automatically build ML models from training data into a servable Docker container☆58Updated 2 years ago
- Deploy and Scale LLM-based applications☆26Updated 2 years ago
- This is the repo for the container that holds the models for the text2vec-transformers module☆51Updated last week
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆36Updated last year
- A specification for OpenInference, a semantic mapping of ML inferences☆47Updated last year