yblir/vllm-learn

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yblir/vllm-learn)

yblir / vllm-learn

☆17

Alternatives and similar repositories for vllm-learn

Users that are interested in vllm-learn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

LPD-EPFL / swarm-kv
View on GitHub
A fault-tolerant RDMA-based disaggregated key-value store with 1-RTT UPDATEs and GETs thanks to the SWARM replication protocol
☆14Sep 25, 2024Updated last year
alibaba / hap
View on GitHub
☆16Apr 13, 2024Updated 2 years ago
sysml / xen
View on GitHub
Mirror of the Xen Repository (PRs not accepted see: http://wiki.xenproject.org/wiki/Submitting_Xen_Project_Patches)
☆17Sep 12, 2017Updated 8 years ago
Dominic789654 / LongGenBench
View on GitHub
Source code for the paper "LongGenBench: Long-context Generation Benchmark"
☆24Oct 8, 2024Updated last year
ThisIsHwang / EXIT
View on GitHub
Official code and resources for the paper "EXIT: Context-Aware Extractive Compression for Enhancing Retrieval-Augmented Generation."
☆25Jul 15, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
anoma / alucard
View on GitHub
A common lisp DSL for writing zero knowledge circuits
☆18Oct 19, 2022Updated 3 years ago
Oneflow-Inc / conda-env
View on GitHub
☆12Mar 13, 2023Updated 3 years ago
Geeloon / hexagon_examples
View on GitHub
some hexagon intrinsic examples based on Qualcomm Hexagon
☆17Mar 7, 2025Updated last year
lmmarisej / DDD-smartrm-micro-services-study
View on GitHub
领域驱动设计——实战落地代码，基于单体项目拆分【基于SpringBootCloud的微服务项目】
☆13Sep 21, 2022Updated 3 years ago
max8rr8 / kvm-guest-drivers-windows
View on GitHub
Windows paravirtualized
☆26Sep 5, 2025Updated 10 months ago
TianWeiChang / java_mianshi
View on GitHub
java面试相关内容
☆15Nov 2, 2022Updated 3 years ago
renjithsraj / epicupsdraw
View on GitHub
Drawing Diagram tool. Which is made by angular 6.0.8 and mxGraph
☆14Jul 25, 2018Updated 8 years ago
owenliang / learnpytorch
View on GitHub
☆11Jun 6, 2023Updated 3 years ago
kfish / micrograd-cpp-2023
View on GitHub
A C++ port of karpathy/micrograd, a tiny scalar-valued autograd engine and a neural net library
☆13Nov 24, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ProjectMitosisOS / dmerge-eurosys24-ae
View on GitHub
Artifact evaluation repo for EuroSys'24.
☆29Nov 7, 2023Updated 2 years ago
NathanSWard / simd_hash_map
View on GitHub
A c++ hash map/table which utilizes simd (specifically Intel x86 SSE/AVX)
☆12Apr 30, 2019Updated 7 years ago
WenqiJiang / SC-ANN-FPGA
View on GitHub
☆26May 30, 2025Updated last year
chenzhuoyu / SimpleRPC
View on GitHub
A C++-based RPC framework
☆12Oct 28, 2021Updated 4 years ago
Shybert-AI / claude-code-deepseek
View on GitHub
可运行的Claude代码源码,采用双端模型进行驱动
☆18Apr 5, 2026Updated 3 months ago
IsaacRe / vllm-kvcompress
View on GitHub
KV cache compression for high-throughput LLM inference
☆158Feb 5, 2025Updated last year
Red-EAD / helmsman
View on GitHub
Large-Scale Disk-Based Vector Index
☆39Jun 18, 2026Updated last month
uygarkurt / BERT-PyTorch
View on GitHub
☆17Jan 3, 2025Updated last year
KNGB / wps-master
View on GitHub
☆11Apr 25, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pkunlp-icler / ChildTuning
View on GitHub
☆33Sep 29, 2021Updated 4 years ago
Bruce-Lee-LY / cutlass_gemm
View on GitHub
Multiple GEMM operators are constructed with cutlass to support LLM inference.
☆20Aug 3, 2025Updated 11 months ago
lmmarisej / DDD-smartrm-monolith-study
View on GitHub
领域驱动设计——实战落地代码【基于SpringBoot的单体项目】
☆19Nov 16, 2022Updated 3 years ago
XPU-Forces / xpu_graph
View on GitHub
A torch compile backend for multi-targets
☆51May 27, 2026Updated last month
microsoft / dist-ir
View on GitHub
An IR for efficiently simulating distributed ML computation.
☆33Jan 13, 2024Updated 2 years ago
Peter-Chou / transformer_cpp_tokenizers
View on GitHub
transformer tokenizers (e.g. BERT tokenizer) in C++ (WIP)
☆18Apr 7, 2022Updated 4 years ago
zyxxmu / cam
View on GitHub
Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference
☆50Jun 19, 2024Updated 2 years ago
zhihu / TLLM_QMM
View on GitHub
TLLM_QMM strips the implementation of quantized kernels of Nvidia's TensorRT-LLM, removing NVInfer dependency and exposes ease of use Pyt…
☆16Jul 5, 2024Updated 2 years ago
mutonix / pyramidinfer
View on GitHub
☆47Nov 25, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
KusionStack / konfig
View on GitHub
Shared repository of application models and components, and CI suite for GitOps workflows
☆27Jan 16, 2025Updated last year
Ascend / torchair
View on GitHub
☆26Jun 8, 2026Updated last month
NgCafai / deep-learning-system
View on GitHub
Homework of CMU 10-414/714: Deep Learning Systems (https://dlsyscourse.org/)
☆15Mar 21, 2024Updated 2 years ago
Balding-Lee / Pytorch4NLP
View on GitHub
This repository contains some sentiment analysis models and sequence tagging models, including BiLSTM, TextCNN, BERT for both tasks. All …
☆13Feb 1, 2023Updated 3 years ago
JF-D / Proteus
View on GitHub
☆24Jul 7, 2024Updated 2 years ago
windhan2100 / graphql
View on GitHub
GraphQL的简单学习例子
☆19Mar 8, 2019Updated 7 years ago
upsj / gpu_selection
View on GitHub
Parallel selection on GPUs
☆15Mar 23, 2021Updated 5 years ago