IBM / vllmLinks
vLLM with support for span semantics
☆21Updated 2 weeks ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Updated 2 years ago
- Transformer GPU VRAM estimator☆68Updated last year
- IBM development fork of https://github.com/huggingface/text-generation-inference☆63Updated 4 months ago
- Aana SDK is a powerful framework for building AI enabled multimodal applications.☆55Updated 5 months ago
- AirLLM 70B inference with single 4GB GPU☆17Updated 7 months ago
- [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI☆49Updated 7 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- ☆76Updated 7 months ago
- Tutorial to get started with SkyPilot!☆58Updated last year
- Benchmark structured generation libraries☆30Updated last year
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated 2 years ago
- Benchmark suite for LLMs from Fireworks.ai☆89Updated last week
- ☆270Updated 7 months ago
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆14Updated 10 months ago
- The Granite Guardian models are designed to detect risks in prompts and responses.☆130Updated 4 months ago
- A collection of all available inference solutions for the LLMs☆94Updated 11 months ago
- ☆68Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated this week
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆49Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- Harness used to benchmark aider against SWE Bench benchmarks☆79Updated last year
- Small, simple agent task environments for training and evaluation☆19Updated last year
- This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang☆100Updated this week
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆115Updated 6 months ago
- ☆56Updated last year
- 🔧 Compare how Agent systems perform on several benchmarks. 📊🚀☆103Updated 6 months ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆287Updated this week
- ☆59Updated last year
- vLLM adapter for a TGIS-compatible gRPC server.☆50Updated this week
- Google TPU optimizations for transformers models☆134Updated 2 weeks ago