a fun and educational take on vLLM
☆199Jan 25, 2026Updated 4 months ago
Alternatives and similar repositories for nano-vllm
Users that are interested in nano-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Lightning based framework to run experiments for self-supervised learning tasks.☆10Feb 14, 2020Updated 6 years ago
- Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation☆12Feb 23, 2023Updated 3 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 4 months ago
- ☆17Apr 29, 2025Updated last year
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- QuickClash Revit Add-in for Clash Detection☆11Jun 17, 2022Updated 3 years ago
- Sardeenz is a proof-of-concept application that allows you to load more than one model on a given GPU. It allows you to add more and more…☆58Updated this week
- Two implementations of ZeRO-1 optimizer sharding in JAX☆14Jun 11, 2023Updated 2 years ago
- A Telegram bot to attach a banner about Yalda on your avatar.☆13Feb 10, 2023Updated 3 years ago
- fmchisel: Efficient Compression and Training Algorithms for Foundation Models☆87May 4, 2026Updated 3 weeks ago
- 该仓库主要记录 NLP 算法工程师相关的 搜索引擎 学习笔记☆14Apr 9, 2022Updated 4 years ago
- 中文文档理解多模态语言模型,支持多模态文档信息抽取,文档embedding☆12Jun 26, 2022Updated 3 years ago
- A machine learning framework with readable source code☆15Apr 30, 2025Updated last year
- Efficient implementation (and explorations) into polar coordinate positional embedding (PoPE) - from Gopalakrishnan et al. under Schmidhu…☆64Mar 25, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆23Apr 15, 2026Updated last month
- ☆45Mar 31, 2025Updated last year
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 2 years ago
- ☆14Aug 7, 2021Updated 4 years ago
- Current Alpha version of the ONTO-TRON-5000☆41Dec 1, 2025Updated 5 months ago
- AugmentCode 批量注册账号脚本☆24Aug 19, 2025Updated 9 months ago
- ☆22Feb 25, 2019Updated 7 years ago
- E-prescription app developed with Flutter and Firebase🔥.☆10Mar 31, 2022Updated 4 years ago
- ☆33Jan 6, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A collection of experiments related to LLM inference with llama.cpp/mlx☆40May 19, 2026Updated last week
- A curated list of Artificial Intelligence Labs doing cutting edge research. Feel free to raise pull request with any additions.☆18Jul 12, 2020Updated 5 years ago
- torchcomms: a modern PyTorch communications API☆364Updated this week
- NeuraChip Accelerator Simulator☆16Apr 26, 2024Updated 2 years ago
- [ACL 2025] Official implementation of the "CoT-ICL Lab" framework☆11May 1, 2026Updated 3 weeks ago
- Example Rust repo emitting Wasm SIMD 128 instructions☆12Jul 26, 2019Updated 6 years ago
- Mindwrite, is simple flutter project with clean architecture and Bloc☆13Oct 24, 2024Updated last year
- A cycle-accurate RISC-V CPU simulator + RTL modeling library in pure Python.☆18Aug 27, 2025Updated 8 months ago
- Cross-GPU KV Cache Marketplace☆22Nov 12, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- (Verilog) A simple convolution layer implementation with systolic array structure☆13May 9, 2022Updated 4 years ago
- ☆11May 26, 2021Updated 5 years ago
- Train an LLM to generate cracked Manim animations for mathematical concepts.☆23Mar 14, 2025Updated last year
- ☆29Feb 27, 2025Updated last year
- A repo explaining with an example how to extend the kubernetes default scheduler☆17Jul 11, 2019Updated 6 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- pydantic-ai 介紹教學☆16Aug 17, 2025Updated 9 months ago