A high-throughput and memory-efficient inference and serving engine for LLMs
☆12Nov 14, 2025Updated 4 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm☆11Nov 12, 2021Updated 4 years ago
- Model optimizer used in Adlik.☆42May 23, 2023Updated 2 years ago
- ☆11Dec 26, 2025Updated 3 months ago
- ☆38Jun 14, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.☆11May 10, 2024Updated last year
- 基于鼠标键盘操作的微信自动聊天机器人☆13Nov 26, 2024Updated last year
- ☆11Dec 8, 2022Updated 3 years ago
- ☆27Jul 30, 2024Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆12Jan 30, 2026Updated last month
- 图像滤镜处理组件☆14Aug 28, 2015Updated 10 years ago
- For Recording Showroom Streaming Video☆24Dec 11, 2025Updated 3 months ago
- PureWeber 2015 Summer Web class☆13Sep 27, 2015Updated 10 years ago
- ☆16Mar 6, 2026Updated 2 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated last year
- ☆15May 20, 2023Updated 2 years ago
- ☆12Jan 30, 2016Updated 10 years ago
- 杂物,不维护,没文档,不保证能用☆11Aug 1, 2022Updated 3 years ago
- A package for filtering sensitive data (parameters, keys) from a variety of JS objects☆10Feb 17, 2026Updated last month
- Various items related to running linux on a Lenovo Yoga c630.☆11Dec 18, 2020Updated 5 years ago
- おはなしジェネレーター for アイドルマスターミリオンライブ!☆14Apr 27, 2016Updated 9 years ago
- "프로그래밍 러스트: 빠르고 안전한 시스템 개발, 개정2판" 예제 코드☆18Dec 11, 2023Updated 2 years ago
- Random snippets based on tensorflow☆10Jan 22, 2016Updated 10 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 一些书的源码☆13Sep 13, 2017Updated 8 years ago
- Research project leveraging Abstract Syntax Trees and Knowledge Graphs with Retrieval-Augmented Generation to develop an advanced, contex…☆12Jul 18, 2024Updated last year
- 카카오톡 GPT☆19Apr 9, 2024Updated last year
- Rust version of nomadcoin (https://github.com/nomadcoders/nomadcoin)☆12May 15, 2022Updated 3 years ago
- 女声优拼音缩写查询☆18Mar 2, 2026Updated 3 weeks ago
- FOSSLight Scanner☆18Mar 5, 2026Updated 3 weeks ago
- Core function for Topology Optimization☆25Jun 21, 2022Updated 3 years ago
- Fanbox Batch Downloader on browser userscript☆16Mar 13, 2022Updated 4 years ago
- The LDAP Server Base on Netty. 基于netty编写的ldap服务器☆11Mar 29, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SDK for developing motion control software on WeiLan Dev series robots.☆18Nov 25, 2025Updated 4 months ago
- Bootstrap 5 tutorial with Gulp, Sass, BrowserSync, and Vanilla JS☆13Jun 1, 2021Updated 4 years ago
- 这是一个网页展示slide + 实时弹幕评论系统。☆19Jul 21, 2015Updated 10 years ago
- Linux kernel source tree☆18Sep 29, 2023Updated 2 years ago
- boltcli is the redis-cli for boltdb with Lua script support☆28Nov 22, 2023Updated 2 years ago
- Scripts and instructions to run Fedora 27 in the GPD Pocket☆16Sep 5, 2018Updated 7 years ago
- Adlik: Toolkit for Accelerating Deep Learning Inference☆806Dec 27, 2023Updated 2 years ago