QuixiAI/runpod-vllm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/QuixiAI/runpod-vllm)

QuixiAI / runpod-vllm

☆14

Alternatives and similar repositories for runpod-vllm

Users that are interested in runpod-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

crazy-max / crazy-max
View on GitHub
☆12Updated this week
chris-ch / llama2.hs
View on GitHub
Inference Llama 2 in one file of pure Haskell (A port of llama2.c from Andrej Karpathy)
☆14Oct 17, 2025Updated 9 months ago
LiuXiaoxuanPKU / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆11Sep 4, 2025Updated 10 months ago
leo-du / llama2.rs
View on GitHub
Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust
☆40Aug 2, 2023Updated 2 years ago
vacmar01 / mixtral_data_extraction
View on GitHub
☆11Dec 23, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
DougGregor / swift
View on GitHub
The Swift Programming Language
☆12Updated this week
evisdrenova / gotorch
View on GitHub
A barebones go implementation of pytorch
☆27Aug 11, 2024Updated last year
JohannesGaessler / llama.cpp
View on GitHub
Port of Facebook's LLaMA model in C/C++
☆13Updated this week
soon / basecharts
View on GitHub
Baserow charts plugin
☆16Oct 14, 2023Updated 2 years ago
celikin / llama2.c-android-wrapper
View on GitHub
Android wrapper for Inference Llama 2 in one file of pure C
☆18Aug 21, 2023Updated 2 years ago
kingrc15 / multimodal-clinical-pretraining
View on GitHub
This is the official code for "Multimodal Pretraining of Medical Time Series and Notes" at Machine Learning for Health 2023
☆21Jan 6, 2025Updated last year
bachittle / open-voice-pilot
View on GitHub
Open-source AI for voice control, rivaling Alexa and Siri
☆13Mar 9, 2024Updated 2 years ago
graydon / swift
View on GitHub
The Swift Programming Language
☆19Jan 8, 2021Updated 5 years ago
ankan-ban / llama2.cu
View on GitHub
Inference Llama 2 in one file of pure Cuda
☆17Aug 20, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
inlined / versioningishard
View on GitHub
A walkthrough of some forwards-compatibility concerns encountered in the CloudEvents spec.
☆12Aug 22, 2018Updated 7 years ago
coldlarry / llama2.cpp
View on GitHub
Inference Llama 2 in one file of pure C
☆13Nov 17, 2023Updated 2 years ago
etown / LifeNarration
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
tmc / clocks
View on GitHub
clocks is a small utility to render additional time zones in your MacOS status bar.
☆12Jul 6, 2023Updated 3 years ago
acdha / django-performance-tools
View on GitHub
EXPERIMENTAL Django performance monitoring utilities
☆15Nov 5, 2013Updated 12 years ago
bradfitz / grpc-go16-demo
View on GitHub
Demonstrating using Go 1.6's http2 to do grpc
☆14Jan 10, 2016Updated 10 years ago
SyedMuzamilM / ai-summariser
View on GitHub
☆14May 8, 2023Updated 3 years ago
StrongResearch / isc-demos
View on GitHub
Deep learning examples for the Instant Super Computer
☆20Jan 28, 2026Updated 5 months ago
NeuralSamurAI / Comfyui-TelegramSender
View on GitHub
Send images, captions and text to Telegram channels and DM's from comfyui
☆12Apr 22, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
milesmcc / truthsocial
View on GitHub
Automatically updated dump of Truth Social's source code (reskinned Mastodon)
☆45May 5, 2025Updated last year
reproducible-containers / repro-pkg-cache
View on GitHub
Dockerfile examples for reproducing package cache (e.g., `/etc/apk/cache`)
☆30Sep 16, 2023Updated 2 years ago
kiwigrid / gcp-serviceaccount-controller
View on GitHub
This is a controller to automatically create gcp service accounts an save them into kubernetes secrets
☆16Feb 6, 2023Updated 3 years ago
ricklon / USB-Arduino-Developer-Device
View on GitHub
Use an Arduino with with USB HID support to control a project in Git
☆13Jan 3, 2012Updated 14 years ago
webrecorder / browsertrix-behaviors
View on GitHub
Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.
☆58Updated this week
tmc / sc
View on GitHub
Statecharts
☆17May 28, 2026Updated last month
Prismadic / magnet
View on GitHub
the small distributed language model toolkit; fine-tune state-of-the-art LLMs anywhere, rapidly
☆33Oct 19, 2024Updated last year
johndpope / DockerParseyMcParsefaceAPI
View on GitHub
DEPRECATED use https://github.com/tensorflow/models/blob/master/syntaxnet/g3doc/CLOUD.md
☆13Apr 6, 2017Updated 9 years ago
gleicon / habitat
View on GitHub
habitat is a coreutils 'env' clone to link 12factor apps and service discovery systems as consul and etcd. It queries the kv db embedded …
☆17Sep 1, 2016Updated 9 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
makenew / serverless-benthos
View on GitHub
Bootstrap a new Benthos Serverless project in five minutes or less.
☆16Nov 29, 2022Updated 3 years ago
juvi21 / llama2.jl
View on GitHub
Inference Llama 2 in one file of pure C. Nahh wait, now fresh in Julia!
☆25Aug 2, 2023Updated 2 years ago
magefile / mage-action
View on GitHub
GitHub Action for Mage
☆34Updated this week
thomastaylor312 / ollama-provider
View on GitHub
A wasmCloud provider for the ollama API
☆12Apr 23, 2024Updated 2 years ago
0xfourzerofour / chatgpt-mac-shortcut
View on GitHub
A simple shortcut to have access to chatgpt anywhere on your computer
☆15Mar 26, 2023Updated 3 years ago
hvkshetry / office-365-mcp-server
View on GitHub
A Model Context Protocol (MCP) server for Microsoft 365 integration. Provides 24 consolidated tools for email, calendar, Teams, planner, …
☆16Updated this week
jmanhype / mcp-flux-studio
View on GitHub
A Model Context Protocol server for Flux image generation, providing tools for image generation, manipulation, and control
☆25Mar 25, 2026Updated 4 months ago