bentoml / BentoVLLM
Self-host LLMs with vLLM and BentoML
☆74Updated this week
Related projects ⓘ
Alternatives and complementary repositories for BentoVLLM
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generation☆94Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆165Updated 2 weeks ago
- Simple examples using Argilla tools to build AI☆40Updated this week
- Dynamic Metadata based RAG Framework☆71Updated 3 months ago
- ☆75Updated 5 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆57Updated 4 months ago
- Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)☆74Updated 2 months ago
- ☆46Updated this week
- ☆105Updated last month
- GPT-4 Level Conversational QA Trained In a Few Hours☆55Updated 3 months ago
- DSPY on action with OpenSource LLMs.☆57Updated 7 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 3 weeks ago
- ☆94Updated 2 months ago
- ☆64Updated 5 months ago
- ☆78Updated this week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- ☆18Updated this week
- Open source and AI-powered web search engine: local, private, dockerized and supported by a fluffy llama🦙☆51Updated 3 months ago
- ☆119Updated this week
- RAG example using DSPy, Gradio, FastAPI☆66Updated 7 months ago
- One click templates for inferencing Language Models☆119Updated this week
- A collection of all available inference solutions for the LLMs