A high-throughput and memory-efficient inference and serving engine for LLMs
☆16Feb 28, 2026Updated this week
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 7 months ago
- Presentation, Code and Notebooks used in the conference☆11Aug 1, 2023Updated 2 years ago
- ☆10Aug 10, 2024Updated last year
- ☆12Dec 15, 2025Updated 2 months ago
- Airline AI Agent with Langflow and DataStax Astra☆12Jan 14, 2025Updated last year
- Official implementation of Rethinking the "Heatmap + Monte Carlo Tree Search" Paradigm for Large Scale TSP.☆11Nov 15, 2024Updated last year
- Satcom radio website☆15Jan 29, 2026Updated last month
- This repository is the project page for "Point Anywhere: Directed Object Estimation from Omnidirectional Images", including source code …☆12Aug 25, 2023Updated 2 years ago
- A simple python package to stretch audio files and change their speed☆12Feb 18, 2026Updated 2 weeks ago
- ☆10Oct 6, 2024Updated last year
- ocr照片识别文字,包括裁剪图片,能识别中文和英文,是现有网上资源中识别率最好的☆13Sep 20, 2016Updated 9 years ago
- This repo contains information regarding cloud offerings of OpenVINO™ and demos to showcase OpenVINO™ via sample Jupyter notebooks.☆12Jul 14, 2025Updated 7 months ago
- Speech Recognition in python☆10Jul 12, 2018Updated 7 years ago
- Automate indexing files and pages from SharePoint into Azure AI Search☆11Jun 6, 2024Updated last year
- ☆18Feb 13, 2026Updated 2 weeks ago
- An LLM inference engine, written in C++☆18Feb 5, 2026Updated 3 weeks ago
- An open source code of the GitHub Copilot Workspace☆12Jun 8, 2024Updated last year
- ☆11Feb 13, 2024Updated 2 years ago
- ☆12Oct 22, 2019Updated 6 years ago
- Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…☆36Feb 20, 2026Updated last week
- ☆14Jul 5, 2024Updated last year
- Tools for GPS navigation in ROS☆13Feb 1, 2023Updated 3 years ago
- Git scrapers for scraping the fediverse☆19Updated this week
- OSINT tool for researching targets based on Email Address or Username using SerperDev, Firecrawl, HIBP & OSINT Industries APIs, and OpenA…☆16Sep 17, 2024Updated last year
- An LLM-based Multi-Agent Framework for Financial Crime & Suspicious Matter Reporting☆13Apr 28, 2024Updated last year
- Leveraging revolutionary Agent and Phi-2 technology, Graph Detective uncovers concealed linkages and discerns patterns, enabling pinpoint…☆10Apr 21, 2024Updated last year
- Using this New Customers Can Open New account by Submitting their Documents , signature , fill up form etc. then Customer can make video…☆12Mar 2, 2024Updated 2 years ago
- rUv-Engineer - let's you describe UI using your imagination, then see it rendered live.☆10Sep 28, 2024Updated last year
- a simple casual graph evaluator (for experiments)☆13Jan 3, 2019Updated 7 years ago
- ☆17Nov 13, 2024Updated last year
- ☆13Jun 15, 2023Updated 2 years ago
- Run AuraFlow on Replicate☆14Jul 12, 2024Updated last year
- ☆13Sep 11, 2024Updated last year
- Notas das aulas da Aceleração Dev #4 da DIO sobre Engenharia de Dados, ministrado pela Everis.☆13Feb 6, 2021Updated 5 years ago
- This project showcases a comprehensive authentication system using Supabase as the backend, implemented with both Streamlit and FastAPI f…☆15Aug 30, 2024Updated last year
- An enterprise-grade AI-powered backtesting framework built on the Swarms framework for automated trading strategy validation and optimiza…☆11Oct 13, 2025Updated 4 months ago
- ☆12Nov 13, 2024Updated last year
- Lightweight and friendly .NET library for realizing Semantic Web applications (OWL2, SWRL)☆13Nov 16, 2025Updated 3 months ago
- Solutions to some coding challenges written with Python☆15May 19, 2020Updated 5 years ago