A high-throughput and memory-efficient inference and serving engine for LLMs
☆12Nov 14, 2025Updated 7 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Zen-NAS, a lightning fast, training-free Neural Architecture Searching algorithm☆11Nov 12, 2021Updated 4 years ago
- Model optimizer used in Adlik.☆41May 23, 2023Updated 3 years ago
- ☆11Dec 26, 2025Updated 6 months ago
- ☆38Jun 14, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- This projects aims to show how whisper model can be fine-tuned on language it was not trained but is trained on similar language to it.☆11May 10, 2024Updated 2 years ago
- 基于鼠标键盘操作的微信自动聊天机器人☆13Nov 26, 2024Updated last year
- ☆27Jul 30, 2024Updated last year
- 纯c++的全平台llm加速库,支持python调用,支持chatglm-6B, llama, baichuan, moss基座,x86 / ARM☆13Jun 10, 2026Updated 3 weeks ago
- 图像滤镜处理组件☆14Aug 28, 2015Updated 10 years ago
- PureWeber 2015 Summer Web class☆13Sep 27, 2015Updated 10 years ago
- Evaluate gpt-4o on CLIcK (Korean NLP Dataset)☆20May 18, 2024Updated 2 years ago
- ☆19May 20, 2026Updated last month
- ☆15May 20, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆12Jan 30, 2016Updated 10 years ago
- 杂物,不维护,没文档,不保证能用☆11Aug 1, 2022Updated 3 years ago
- Various items related to running linux on a Lenovo Yoga c630.☆11Dec 18, 2020Updated 5 years ago
- A package for filtering sensitive data (parameters, keys) from a variety of JS objects☆10Feb 17, 2026Updated 4 months ago
- おはなしジェネレーター for アイドルマスターミリオンライブ!☆14Apr 27, 2016Updated 10 years ago
- "프로그래밍 러스트: 빠르고 안전한 시스템 개 발, 개정2판" 예제 코드☆18Dec 11, 2023Updated 2 years ago
- Random snippets based on tensorflow☆10Jan 22, 2016Updated 10 years ago
- 一些书的源码☆13Sep 13, 2017Updated 8 years ago
- Research project leveraging Abstract Syntax Trees and Knowledge Graphs with Retrieval-Augmented Generation to develop an advanced, contex…☆13Jul 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 카카오톡 GPT☆19Apr 9, 2024Updated 2 years ago
- Rust version of nomadcoin (https://github.com/nomadcoders/nomadcoin)☆12May 15, 2022Updated 4 years ago
- FOSSLight Scanner☆18Jun 25, 2026Updated last week
- 女声优拼音缩写查询☆18Jun 12, 2026Updated 3 weeks ago
- Core function for Topology Optimization☆25Jun 21, 2022Updated 4 years ago
- Fanbox Batch Downloader on browser userscript☆16Mar 13, 2022Updated 4 years ago
- The LDAP Server Base on Netty. 基于netty编写的ldap服务器☆11Jun 12, 2026Updated 3 weeks ago
- SDK for developing motion control software on WeiLan Dev series robots.☆21Nov 25, 2025Updated 7 months ago
- Bootstrap 5 tutorial with Gulp, Sass, BrowserSync, and Vanilla JS☆13Jun 1, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 这是一个网页展示slide + 实时弹幕评论系统。☆18Jul 21, 2015Updated 10 years ago
- Linux kernel source tree☆19Sep 29, 2023Updated 2 years ago
- boltcli is the redis-cli for boltdb with Lua script support☆29Nov 22, 2023Updated 2 years ago
- Scripts and instructions to run Fedora 27 in the GPD Pocket☆16Sep 5, 2018Updated 7 years ago
- Adlik: Toolkit for Accelerating Deep Learning Inference☆806Dec 27, 2023Updated 2 years ago
- T100TA / T100TAF Linux and Android kernel☆16Apr 7, 2017Updated 9 years ago
- Deep Galerkin Method☆17Feb 10, 2019Updated 7 years ago