bentoml / BentoSentenceTransformersLinks
how to build a sentence embedding application using BentoML
☆14Updated 10 months ago
Alternatives and similar repositories for BentoSentenceTransformers
Users that are interested in BentoSentenceTransformers are comparing it to the libraries listed below
Sorting:
- Self-host LLMs with vLLM and BentoML☆167Updated last week
- ☆67Updated 10 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆59Updated 3 weeks ago
- CLIP as a service - Embed image and sentences, object recognition, visual reasoning, image classification and reverse image search☆66Updated 6 months ago
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆47Updated last year
- A list of language models with permissive licenses such as MIT or Apache 2.0☆24Updated 11 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- A collection of reproducible inference engine benchmarks☆38Updated 9 months ago
- Evaluation of bm42 sparse indexing algorithm☆72Updated last year
- Evaluation framework for document processing models and services.☆62Updated last week
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆109Updated 8 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆85Updated last year
- The backend behind the LLM-Perf Leaderboard☆11Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Updated 4 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Updated 2 years ago
- Code for NeurIPS LLM Efficiency Challenge☆60Updated last year
- Multi-threaded matrix multiplication and cosine similarity calculations for dense and sparse matrices. Appropriate for calculating the K …☆86Updated last year
- Visualize multi-model embedding spaces. The first goal is to quickly get a lay of the land of any embedding space. Then be able to scroll…☆27Updated last year
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆59Updated 8 months ago
- ☆51Updated last year
- Use Grounding DINO, Segment Anything, and CLIP to label objects in images.☆34Updated 2 years ago
- ☆101Updated last year
- TitanML Takeoff Server is an optimization, compression and deployment platform that makes state of the art machine learning models access…☆114Updated 2 years ago
- Mistral-7B finetuned for function calling☆16Updated 2 years ago
- ☆41Updated 2 years ago
- 🤝 Trade any tensors over the network☆30Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Updated 2 years ago