a fun and educational take on vLLM
☆204Jan 25, 2026Updated 4 months ago
Alternatives and similar repositories for nano-vllm
Users that are interested in nano-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Educational WIP☆73Feb 16, 2026Updated 3 months ago
- Hierarchical Navigable Small World Graphs☆24Aug 17, 2024Updated last year
- Structured Generation Evals☆14Sep 25, 2024Updated last year
- Using RAG to generate data for model fine-tuning.☆14Apr 16, 2025Updated last year
- A Telegram bot to attach a banner about Yalda on your avatar.☆13Feb 10, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- How to quickly serve an LLM using Fast API, Celery, and Redis☆17Aug 29, 2023Updated 2 years ago
- fmchisel: Efficient Compression and Training Algorithms for Foundation Models☆88May 4, 2026Updated last month
- A simple C compiler I wrote to demonstrate how to write simple compilers. Currently targets x86_64☆29Jun 27, 2025Updated 11 months ago
- A machine learning framework with readable source code☆16Apr 30, 2025Updated last year
- Modern, minimal, and modular LaTeX CV template ✨ 📄☆35Dec 4, 2025Updated 6 months ago
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆22Apr 15, 2026Updated 2 months ago
- ☆45Mar 31, 2025Updated last year
- native Rust implementation of Kafka protocol and api☆14Jun 13, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- torchcomms: a modern PyTorch communications API☆368Updated this week
- The "CoT-ICL Lab" framework for meta-training transformers☆11Jun 3, 2026Updated last week
- A YAML representation library which explicitly retains provenance data☆19May 26, 2025Updated last year
- Ludic – an LLM-RL library for the era of experience☆63Jan 9, 2026Updated 5 months ago
- Database write ahead log in Rust.☆21Mar 15, 2025Updated last year
- Anti-aliasing for MikuMikuDance☆10Nov 25, 2021Updated 4 years ago
- Lenient parser for Semantic Version numbers in Rust☆13Feb 13, 2023Updated 3 years ago
- Programming framework for serverless compute☆15Dec 3, 2019Updated 6 years ago
- ☆30Feb 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A repo explaining with an example how to extend the kubernetes default scheduler☆17Jul 11, 2019Updated 6 years ago
- Node.js client for Fluvio☆15May 4, 2026Updated last month
- A stand-alone pure C++ library for linear algebra and machine learning☆10Mar 16, 2016Updated 10 years ago
- A simple attribution engine.☆34Feb 1, 2023Updated 3 years ago
- ☆10Jul 4, 2022Updated 3 years ago
- Source code of the IPDPS '21 paper: "TileSpMV: A Tiled Algorithm for Sparse Matrix-Vector Multiplication on GPUs" by Yuyao Niu, Zhengyang…☆13Aug 12, 2022Updated 3 years ago
- A python wrap for Baidu Yuyin API☆10Aug 3, 2016Updated 9 years ago
- A Structured Reasoning Framework for AI Agents featuring cascade thinking and structured problem solving☆21Jul 25, 2025Updated 10 months ago
- A simulation framework for modeling efficiency of Graph Neural Network Dataflows☆24Feb 14, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An HBM FPGA based SpMV Accelerator☆18Aug 29, 2024Updated last year
- ☆16Feb 27, 2024Updated 2 years ago
- GPGPU-SIM 使用篇☆14Nov 12, 2022Updated 3 years ago
- FFmpeg 练习☆10Jul 4, 2020Updated 5 years ago
- A curated list of resources on Reinforcement Learning with Verifiable Rewards (RLVR) and the reasoning capability boundary of Large Langu…☆88Dec 12, 2025Updated 6 months ago
- DuckDB Extension for cryptographic hash functions and HMAC☆28Apr 22, 2026Updated last month
- For CPU experiment☆14Feb 23, 2021Updated 5 years ago