SakanaAI / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆27Updated 9 months ago
Alternatives and similar repositories for vllm:
Users that are interested in vllm are comparing it to the libraries listed below
- ☆54Updated last year
- Official homepage for "Self-Harmonized Chain of Thought"☆88Updated last month
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- LLM reads a paper and produce a working prototype☆46Updated 2 weeks ago
- A desktop for AI agents☆36Updated this week
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆68Updated 3 weeks ago
- The Next Generation Multi-Modality Superintelligence☆70Updated 4 months ago
- Q-Star Agent Code: A reinforcement learning-based framework for intelligent agents using Microsoft AutoGen. It leverages Q-Star, a Q-lear…☆76Updated 11 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆20Updated 3 months ago
- ☆49Updated 3 months ago
- Command-line script for inferencing from models such as WizardCoder☆26Updated last year
- The next evolution of Agents☆47Updated last week
- ☆17Updated 3 months ago
- BH hackathon☆14Updated 9 months ago
- entropix style sampling + GUI☆25Updated 2 months ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆19Updated 4 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆39Updated last week
- ☆83Updated 3 months ago
- Code for TrackTheMind☆68Updated last month
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆16Updated this week
- Finetune any model on HF in less than 30 seconds☆57Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 6 months ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆37Updated 11 months ago
- ☆29Updated last year
- ☆108Updated last month
- Implementation of AutoRT: "AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents"☆34Updated 2 months ago
- ☆46Updated 2 months ago
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectio…☆79Updated 7 months ago
- ☆29Updated 10 months ago
- Minimalistic repository to reproduce and serve CAMEL models.☆19Updated last year