A high-throughput and memory-efficient inference and serving engine for LLMs
☆13Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆31Apr 19, 2025Updated 10 months ago
- Dockerfile to generate Intellij Idea project shared index☆10Jun 2, 2022Updated 3 years ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- 一个移动终端的轻量级前端类库☆17May 24, 2013Updated 12 years ago
- Central Point on how to install macOS on Lenovo Ideapad Flex 5 14IIL05 and maybe similar models.☆10May 18, 2023Updated 2 years ago
- pip install patchelf. patchelf Python wheel for PyPI.☆11Mar 2, 2026Updated last week
- Generate PHPUnit tests from annotations, which you can write in your methods documentation☆11Aug 4, 2021Updated 4 years ago
- Integration and automation of NS-3 network simulator and Linux Containers☆12Nov 12, 2019Updated 6 years ago
- A conda-smithy repository for ctng-compiler-activation.☆14Feb 12, 2026Updated 3 weeks ago
- Hot-plug devices into a Docker container as they are plugged.☆16Nov 18, 2025Updated 3 months ago
- More Quake. Less bullshit.☆12Apr 26, 2015Updated 10 years ago
- This repository contains the results and code for the MLPerf™ Training v3.0 benchmark.☆12Aug 10, 2023Updated 2 years ago
- Provides a lot of basic functionality for our algorithms for the Virtual Network Embedding Problem (VNEP).☆12May 3, 2020Updated 5 years ago
- Pytest plugin: add multihost framework.☆11Nov 27, 2025Updated 3 months ago
- Expert Specialization MoE Solution based on CUTLASS☆27Jan 19, 2026Updated last month
- f8app 集成测试覆盖率收集 demo☆10Jun 5, 2017Updated 8 years ago
- Terribly incorrect and incomplete AOT compiler for mRuby. Source code for the LLVM Social Berlin #20☆10Aug 25, 2022Updated 3 years ago
- Haidar's Web Page☆13Oct 9, 2024Updated last year
- Fork of LLVM Project containing a Colossus IPU backend implementation☆13Feb 2, 2026Updated last month
- A conda-smithy repository for jaxlib.☆17Nov 4, 2025Updated 4 months ago
- ET Accelerator Firmware and Runtime☆35Updated this week
- A conda-smithy repository for ctng-compilers.☆15Feb 18, 2026Updated 2 weeks ago
- SGLang is a fast serving framework for large language models and vision language models.☆19Updated this week
- Main Repo for the OpenHW Group Software Task Group☆17Mar 11, 2025Updated 11 months ago
- Development containers for triton and triton-cpu☆24Mar 2, 2026Updated last week
- ☆18Oct 29, 2025Updated 4 months ago
- Systolic Blood Pressure level based on PPG signal's parameteres☆12Jun 4, 2018Updated 7 years ago
- Visual Studio Code GDB Debug Adapter for C and C++ programs.☆14Aug 4, 2023Updated 2 years ago
- A fuzzer for the CAN bus☆18Mar 1, 2025Updated last year
- A Triton-only attention backend for vLLM☆24Feb 11, 2026Updated 3 weeks ago
- A minimal toolkit for Context Engineering — Select, Compress, and Persist context with pure functions.☆30Jan 20, 2026Updated last month
- Cerebro plugin to record and fetch list of items on clipboard☆13Aug 8, 2017Updated 8 years ago
- 《Startup Playbook》中文版☆18Jun 24, 2021Updated 4 years ago
- LLM training parallelisms (DP, FSDP, TP, PP) in pure C☆26Jan 27, 2026Updated last month
- ☆11Jan 10, 2025Updated last year
- Python GDB pretty printer using Natvis files for formatting☆15Apr 23, 2020Updated 5 years ago
- Java library for parsing information from a structured Javadoc string.☆14Nov 4, 2024Updated last year
- 多集群使用thanos sidecar+MinIO监控告警☆15Feb 20, 2023Updated 3 years ago
- GDB 学习,及根据大牛Liam Huang的文章,模仿的一个GDB调试的demo☆13Apr 2, 2019Updated 6 years ago