Build your own high performance LLM inference engine in C++ and CUDA - a smaller version of vLLM
☆805Apr 14, 2026Updated 2 months ago
Alternatives and similar repositories for tiny-vllm
Users that are interested in tiny-vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20May 30, 2026Updated 2 weeks ago
- Stream Claude Code's hidden output (thinking, tool calls, subagents) to a separate terminal in real-time☆148Jun 9, 2026Updated last week
- ☆19Nov 11, 2025Updated 7 months ago
- A playground to make it easy to try crazy things☆33Feb 13, 2026Updated 4 months ago
- Llama2 transformer walkthrough with code examples☆31Nov 9, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Fun with wgpu: Simulating slime mold☆24Aug 22, 2024Updated last year
- A repository of ELL models☆21Jan 16, 2026Updated 5 months ago
- CODA: Rewriting Transformer Blocks as GEMM-Epilogue Programs☆213Updated this week
- 3D geoms for plotnine (grammar of graphics in Python)☆13Aug 5, 2022Updated 3 years ago
- File Shooter (uv_link_t, uv_ssl_t, uv_http_t)☆11Jul 8, 2016Updated 9 years ago
- [ACM MM 2022] This is the official implementation of "Temporal Sentiment Localization: Listen and Look in Untrimmed Videos"☆18Feb 14, 2025Updated last year
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 6 years ago
- Jest Image Snapshot Example with puppeteer and Circle CI☆13Apr 20, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Learning Pytorch☆13Jun 12, 2018Updated 8 years ago
- The inference engine the open-source world built for itself.☆151Jun 12, 2026Updated last week
- Tensor library & inference framework for machine learning☆118Oct 3, 2025Updated 8 months ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- ☆10Feb 12, 2021Updated 5 years ago
- VPN over UDP☆114Feb 3, 2026Updated 4 months ago
- Robust manipulation and inspection of JSON data using the already familiar Chromium Devtools☆14Nov 28, 2016Updated 9 years ago
- Image-Processing-Node-Editor で動作するYouTube入力用ノード☆13Jul 12, 2025Updated 11 months ago
- Python3 reimplementation of Wissner-Gross & Freer, 2013☆15Dec 18, 2025Updated 6 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- RLHF (Supervised fine-tuning, reward model, and PPO) step-by-step in 3 Jupyter notebooks☆248Jun 20, 2025Updated 11 months ago
- NanoDetをGoogle Colaboratory上で訓練しONNX形式のファイルをエクスポートするサンプル(This is a sample to training NanoDet on Google Colaboratory and export a file in…☆13Aug 4, 2022Updated 3 years ago
- ☆10Jul 12, 2017Updated 8 years ago
- PyCon mini 東海 2024 のトーク「Google Colaboratoryで 試すVLM」で紹介したサンプル集☆12Nov 15, 2024Updated last year
- A D3 plugin to draw contour plots of 2D functions.☆19Sep 26, 2024Updated last year
- LDC: Lightweight Dense CNN for Edge DetectionのPythonでのONNX推論サンプル☆15May 6, 2023Updated 3 years ago
- Implementation of the Monte-Carlo CTW AIXI approximation as described by Joel Veness et al.☆12Jan 14, 2017Updated 9 years ago
- メイカーの交流を円滑に進めるための<心がまえ>を明文化するプロジェクト☆10May 1, 2020Updated 6 years ago
- MAV/EVS replacement system based on the CasparCG open video server☆19Nov 15, 2016Updated 9 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- M-LSDを用いて四角形を検出し、射影変換を行うサンプルプログラム☆10Jun 2, 2021Updated 5 years ago
- This is the start page i use for raspberry pi zero usb dongle☆12Oct 30, 2016Updated 9 years ago
- A coöperative multitasking framework based on `liburing` and `libucontext`☆17Jan 2, 2026Updated 5 months ago
- VSCode LLVM Compiler Explorer☆234May 30, 2024Updated 2 years ago
- オーディオスペクトラムや波形をOpenCVで描画するサンプル☆14Aug 16, 2025Updated 10 months ago
- Modern PCL (Printer Command Language) Viewer☆11Aug 11, 2016Updated 9 years ago
- Imagen-mini for girl image generation☆12Nov 19, 2022Updated 3 years ago