substratusai / vllm-docker
☆45Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for vllm-docker
- Self-host LLMs with vLLM and BentoML☆72Updated this week
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆55Updated 3 months ago
- 🕹️ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.☆132Updated 3 months ago
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated last month
- A guidance compatibility layer for llama-cpp-python☆34Updated last year
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆160Updated this week
- Just a bunch of benchmark logs for different LLMs☆113Updated 3 months ago
- ☆77Updated 6 months ago
- ☆52Updated 5 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆26Updated 8 months ago
- Tutorial for building LLM router☆157Updated 3 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.☆52Updated 7 months ago
- Embed anything.☆29Updated 5 months ago
- Complex RAG backend☆28Updated 7 months ago
- Python client library for improving your LLM app accuracy☆96Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆52Updated last week
- ☆64Updated 5 months ago
- ☆37Updated 11 months ago
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platform☆81Updated 2 months ago
- ☆200Updated 9 months ago
- ☆91Updated last month
- ☆16Updated 2 months ago
- Simple examples using Argilla tools to build AI☆38Updated this week
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆36Updated 9 months ago
- ☆148Updated 3 months ago
- Develop, evaluate and monitor LLM applications at scale☆93Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆57Updated last month
- One click templates for inferencing Language Models☆115Updated 3 weeks ago
- Generate train.jsonl and valid.jsonl files to use for fine-tuning Mistral and other LLMs.☆77Updated 9 months ago
- A toolkit for building multimodal AI agents☆107Updated 2 weeks ago