a fun and educational take on vLLM
☆194Jan 25, 2026Updated 3 months ago
Alternatives and similar repositories for nano-vllm
Users that are interested in nano-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Notes and code for Programming Massively Parallel Processors☆13Mar 29, 2025Updated last year
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 4 months ago
- Hierarchical Navigable Small World Graphs☆20Aug 17, 2024Updated last year
- Using RAG to generate data for model fine-tuning.☆13Apr 16, 2025Updated last year
- How to quickly serve an LLM using Fast API, Celery, and Redis☆17Aug 29, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- fmchisel: Efficient Compression and Training Algorithms for Foundation Models☆87Oct 23, 2025Updated 6 months ago
- ☆12Sep 18, 2024Updated last year
- Distributed by design. Data-driven by default.☆62Updated this week
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆21Apr 15, 2026Updated 3 weeks ago
- ☆46Mar 31, 2025Updated last year
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- Current Alpha version of the ONTO-TRON-5000☆40Dec 1, 2025Updated 5 months ago
- Advanced futures library☆16Apr 24, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- torchcomms: a modern PyTorch communications API☆359Updated this week
- NeuraChip Accelerator Simulator☆16Apr 26, 2024Updated 2 years ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11Updated this week
- Example Rust repo emitting Wasm SIMD 128 instructions☆12Jul 26, 2019Updated 6 years ago
- A cycle-accurate RISC-V CPU simulator + RTL modeling library in pure Python.☆18Aug 27, 2025Updated 8 months ago
- Lenient parser for Semantic Version numbers in Rust☆12Feb 13, 2023Updated 3 years ago
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 3 years ago
- Programming framework for serverless compute☆15Dec 3, 2019Updated 6 years ago
- ☆78Feb 10, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A repo explaining with an example how to extend the kubernetes default scheduler☆17Jul 11, 2019Updated 6 years ago
- To better understand the ggml library☆27Jun 13, 2025Updated 10 months ago
- A stand-alone pure C++ library for linear algebra and machine learning☆10Mar 16, 2016Updated 10 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- 基于FP16的二维脉动阵列电路设计☆13Feb 23, 2023Updated 3 years ago
- ☆10May 9, 2019Updated 6 years ago
- A simulation framework for modeling efficiency of Graph Neural Network Dataflows☆24Feb 14, 2025Updated last year
- An HBM FPGA based SpMV Accelerator☆18Aug 29, 2024Updated last year
- Agentic Virtual Lab☆19Nov 30, 2025Updated 5 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆15Feb 27, 2024Updated 2 years ago
- Code for evaluating WebRTC performance☆14Sep 18, 2017Updated 8 years ago
- GPGPU-SIM 使用篇☆14Nov 12, 2022Updated 3 years ago
- gnosis: signifying knowing through observation or experience☆22Jul 19, 2020Updated 5 years ago
- ☆30Jul 21, 2025Updated 9 months ago
- Source code of WSiP model☆11Aug 14, 2022Updated 3 years ago
- A YAML 1.2 parser using a greedy parsing algorithm with PEG atoms. Support anchors, directives, positions, tags, serde, and no-std.☆20Aug 28, 2025Updated 8 months ago