IBM / vllm
vLLM with support for IBM Spyre
☆13Updated 3 weeks ago
Alternatives and similar repositories for vllm:
Users that are interested in vllm are comparing it to the libraries listed below
- An HTTP service intended as a backend for an LLM that can run arbitrary pieces of Python code.☆56Updated 2 weeks ago
- Data preparation code for Amber 7B LLM☆88Updated 11 months ago
- Observability API server for bee-agent-framework☆13Updated 3 weeks ago
- Small, simple agent task environments for training and evaluation☆18Updated 5 months ago
- ☆66Updated 11 months ago
- ❓Curie: Automated and Rigorous Scientific Experimentation with AI Agents☆77Updated this week
- IBM development fork of https://github.com/huggingface/text-generation-inference☆60Updated 4 months ago
- ArcticTraining is a framework designed to simplify and accelerate the post-training process for large language models (LLMs)☆66Updated this week
- A collection of all available inference solutions for the LLMs☆86Updated last month
- Accelerating your LLM training to full speed! Made with ❤️ by ServiceNow Research☆178Updated this week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆46Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- ☆30Updated 9 months ago
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆277Updated this week
- Source code for the collaborative reasoner research project at Meta FAIR.☆33Updated last week
- Train your own SOTA deductive reasoning model☆88Updated last month
- PyTorch/XLA integration with JetStream (https://github.com/google/JetStream) for LLM inference"☆60Updated 3 weeks ago
- Google TPU optimizations for transformers models☆108Updated 3 months ago
- Machine Learning Serving focused on GenAI with simplicity as the top priority.☆58Updated 2 weeks ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆105Updated 2 weeks ago
- Inference server benchmarking tool☆53Updated 3 weeks ago
- Evals meant to evaluate language models' ability to reason over long contexts.☆9Updated 7 months ago
- ☆37Updated 2 months ago
- ☆38Updated 2 weeks ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆22Updated 3 weeks ago
- ☆11Updated 7 months ago
- ☆29Updated last week
- ☆129Updated 8 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆262Updated 6 months ago
- Code-Langchain☆39Updated last year