SakanaAI / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆29Updated last year
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆33Updated last week
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆17Updated 3 months ago
- This repository contains a fork from "language-models-trajectory-generators", the goal is to test the same functionality with Mistrals LL…☆21Updated 7 months ago
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆19Updated 3 weeks ago
- A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) within reach of everyone, particu…☆34Updated last year
- BH hackathon☆14Updated last year
- ☆29Updated last year
- 🧠 Societies of Mind & Economy of Minds☆57Updated 2 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆90Updated 3 months ago
- LLM reads a paper and produce a working prototype☆56Updated last month
- The next evolution of Agents☆48Updated 3 weeks ago
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆23Updated last year
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 11 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆61Updated 8 months ago
- ☆14Updated last year
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆23Updated last year
- BabyCommandAGI is designed to test what happens when you combine CLI and LLM, which are older computer interfaces than GUI. Based on Baby…☆46Updated 2 months ago
- ☆28Updated last year
- ☆28Updated last year
- All the world is a play, we are but actors in it.☆49Updated this week
- The Next Generation Multi-Modality Superintelligence☆71Updated 8 months ago
- Finetune any model on HF in less than 30 seconds☆58Updated last month
- A forest of autonomous agents.☆19Updated 3 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆53Updated 3 months ago
- Simple Implementation of a Transformer in the new framework MLX by Apple☆20Updated 5 months ago
- Fine tune Gemma 3 on an object detection task☆20Updated this week
- ☆54Updated last year
- Modified Beam Search with periodical restart☆12Updated 8 months ago
- A repository of projects and datasets under active development by Alignment Lab AI☆22Updated last year