backprop-ai / vllm-benchmark
Benchmarking the serving capabilities of vLLM
β33Updated 6 months ago
Alternatives and similar repositories for vllm-benchmark:
Users that are interested in vllm-benchmark are comparing it to the libraries listed below
- Data preparation code for Amber 7B LLMβ86Updated 10 months ago
- experiments with inference on llamaβ104Updated 9 months ago
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β32Updated 11 months ago
- πΉοΈ Performance Comparison of MLOps Engines, Frameworks, and Languages on Mainstream AI Models.β136Updated 7 months ago
- A toolkit for fine-tuning, inferencing, and evaluating GreenBitAI's LLMs.β80Updated this week
- A collection of all available inference solutions for the LLMsβ80Updated last week
- Client Code Examples, Use Cases and Benchmarks for Enterprise h2oGPTe RAG-Based GenAI Platformβ83Updated last month
- β113Updated 5 months ago
- β48Updated 3 months ago
- A simple service that integrates vLLM with Ray Serve for fast and scalable LLM serving.β63Updated 11 months ago
- Set of scripts to finetune LLMsβ36Updated 11 months ago
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β73Updated 4 months ago
- β53Updated 9 months ago
- β235Updated this week
- GPT-4 Level Conversational QA Trained In a Few Hoursβ58Updated 6 months ago
- Evaluation of bm42 sparse indexing algorithmβ64Updated 8 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.β58Updated 2 months ago
- β74Updated last year
- β77Updated 2 weeks ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needsβ208Updated this week
- Sakura-SOLAR-DPO: Merge, SFT, and DPOβ116Updated last year
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.β59Updated 2 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β204Updated 4 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β76Updated 4 months ago
- Benchmark baseline for retrieval qa applicationsβ103Updated 10 months ago
- Mixing Language Models with Self-Verification and Meta-Verificationβ101Updated 2 months ago
- Dynamic batching library for Deep Learning inference. Tutorials for LLM, GPT scenarios.β94Updated 6 months ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.β34Updated 2 months ago
- Repository for organizing datasets and papers used in Open LLM.β93Updated last year
- vLLM performance dashboardβ23Updated 10 months ago