IBM / vllmLinks
vLLM with support for span semantics
☆21Updated last month
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆75Updated 7 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆114Updated 5 months ago
- Transformer GPU VRAM estimator☆67Updated last year
- A repository of Python scripts to scrape code contents of the public repositories of `huggingface`.☆53Updated last year
- The Granite Guardian models are designed to detect risks in prompts and responses.☆128Updated 3 months ago
- Visualize expert firing frequencies across sentences in the Mixtral MoE model☆18Updated 2 years ago
- Benchmark structured generation libraries☆30Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆114Updated 9 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆30Updated last year
- ☆269Updated 7 months ago
- Benchmarking tool for assessing LLM models' performance across different hardwares☆17Updated 2 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆53Updated 2 years ago
- A collection of all available inference solutions for the LLMs☆94Updated 10 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆78Updated last year
- Public repository containing METR's DVC pipeline for eval data analysis☆186Updated last week
- An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.☆70Updated 4 months ago
- Python console application designed to provide an engaging and visually appealing LLM chat experience on Unix-like consoles or Terminals.☆23Updated 3 weeks ago
- Official code for "SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient"☆149Updated 2 years ago
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆282Updated this week
- ☆38Updated 5 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆93Updated last week
- Machine Learning Agility (MLAgility) benchmark and benchmarking tools☆40Updated 5 months ago
- Voyage AI Official Python Library☆91Updated last month
- Granite 3.1 Language Models☆136Updated 7 months ago
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Google TPU optimizations for transformers models☆133Updated last week
- An NVIDIA AI Workbench example project for fine-tuning a Mistral 7B model☆69Updated last year
- Tutorial to get started with SkyPilot!☆58Updated last year