SakanaAI / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆28Updated last year
Alternatives and similar repositories for vllm:
Users that are interested in vllm are comparing it to the libraries listed below
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated last week
- LLM reads a paper and produce a working prototype☆51Updated 2 weeks ago
- ☆17Updated last month
- BH hackathon☆14Updated 11 months ago
- Automatic Prompt Optimization☆28Updated 10 months ago
- ☆48Updated 4 months ago
- ☆54Updated last year
- Interactive Textbook Demo☆40Updated last year
- Finetune any model on HF in less than 30 seconds☆58Updated 2 months ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- The next evolution of Agents☆48Updated 2 weeks ago
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆74Updated last week
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆20Updated 2 weeks ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Tutorial for DSPy☆23Updated 10 months ago
- Unofficial implementation and experiments related to Set-of-Mark (SoM) 👁️☆85Updated last year
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 5 months ago
- tickr-agent is an enterprise-ready, scalable Python library for building swarms of financial agents that conduct comprehensive stock anal…☆43Updated last week
- ☆29Updated last year
- Code for ExploreTom☆79Updated 3 months ago
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆19Updated 2 weeks ago
- A radically simple, reliable, and high performance template to enable you to quickly get set up building multi-agent applications☆32Updated last week
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 2 months ago
- ☆21Updated 4 months ago
- ☆28Updated last year
- Example implementation of Iteration of Tought - Gives a star if you like the project☆39Updated 3 months ago
- A pure MLX-based training pipeline for fine-tuning LLMs using GRPO on Apple Silicon.☆32Updated last month
- A forest of autonomous agents.☆19Updated 2 months ago
- Enhancement in Multimodal Representation Learning.☆40Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year