IBM / vllmLinks
vLLM with support for IBM Spyre
☆14Updated this week
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆60Updated last week
- ☆99Updated last week
- ☆39Updated last month
- Small, simple agent task environments for training and evaluation☆18Updated 7 months ago
- An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.☆60Updated last month
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆105Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 3 weeks ago
- 🚀 Collection of tuning recipes with HuggingFace SFTTrainer and PyTorch FSDP.☆44Updated this week
- 🦾💻🌐 distributed training & serverless inference at scale on RunPod☆17Updated last year
- 🦄 Unitxt is a Python library for enterprise-grade evaluation of AI performance, offering the world's largest catalog of tools and data …☆196Updated this week
- Cray-LM unified training and inference stack.☆22Updated 4 months ago
- LM engine is a library for pretraining/finetuning LLMs☆56Updated last week
- Benchmark structured generation libraries☆27Updated 7 months ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆78Updated 2 months ago
- Self-host LLMs with LMDeploy and BentoML☆19Updated 2 months ago
- InstructLab Training Library - Efficient Fine-Tuning with Message-Format Data☆42Updated this week
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆60Updated 2 months ago
- ☆68Updated 3 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆29Updated 6 months ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆9Updated 8 months ago
- ☆12Updated 8 months ago
- ☆59Updated 2 weeks ago
- ☆215Updated this week
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆70Updated 7 months ago
- Observability API server for bee-agent-framework☆13Updated 2 months ago
- Inference server benchmarking tool☆68Updated last month
- ☆14Updated 9 months ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆41Updated 5 months ago
- Official PyTorch implementation for Hogwild! Inference: Parallel LLM Generation with a Concurrent Attention Cache☆105Updated last month