RBLN-SW / vllm-rblnView external linksLinks
vLLM plugin for RBLN NPU
☆41Updated this week
Alternatives and similar repositories for vllm-rbln
Users that are interested in vllm-rbln are comparing it to the libraries listed below
Sorting:
- ⚡ A seamless integration of HuggingFace Transformers & Diffusers with RBLN SDK for efficient inference on RBLN NPUs.☆15Updated this week
- ☆27Jan 8, 2024Updated 2 years ago
- Example code for RBLN SDK developers building inference applications☆30Updated this week
- Generic library for neural collapse and several derivative works on the phenomenon.☆18Apr 14, 2025Updated 10 months ago
- ☆11Aug 23, 2023Updated 2 years ago
- Ditto is an open-source framework that enables direct conversion of HuggingFace PreTrainedModels into TensorRT-LLM engines.☆55Jul 16, 2025Updated 6 months ago
- API serving for your diffusers models☆11Jan 19, 2024Updated 2 years ago
- Parallel Self-Adjusting Computation☆15Jul 5, 2021Updated 4 years ago
- Korean politics data for research and development.☆12Jun 21, 2016Updated 9 years ago
- [CVPR 2023] "TrojViT: Trojan Insertion in Vision Transformers" by Mengxin Zheng, Qian Lou, Lei Jiang☆14Jan 5, 2024Updated 2 years ago
- ☆18Aug 27, 2025Updated 5 months ago
- ROS TensorRT Inference Nodes for DIGITS on the Jetson☆15Apr 6, 2019Updated 6 years ago
- Versor: Stop Projecting, Start Rotating. GBN (Geometric Blade Network) - A new era of AI beyond Linear Algebra.☆41Updated this week
- Website for CSE 234, Winter 2025☆13Mar 24, 2025Updated 10 months ago
- The first open source triton inference engine for Stable Diffusion, specifically for sdxl☆12Nov 27, 2023Updated 2 years ago
- ☆13May 11, 2023Updated 2 years ago
- QUICK: Quantization-aware Interleaving and Conflict-free Kernel for efficient LLM inference☆120Mar 6, 2024Updated last year
- Bluespec environment for working with the ulx3s board and its lattice ecp5 fpga☆15Mar 9, 2025Updated 11 months ago
- ☆41Aug 26, 2025Updated 5 months ago
- Hal Daume's hbc☆20Jan 23, 2010Updated 16 years ago
- Komoran 3 in Python☆11Dec 10, 2018Updated 7 years ago
- CentOS docker images, build weekly with latest security updates☆11Updated this week
- Check the latest items from gumtree and message it☆12Feb 1, 2016Updated 10 years ago
- Human Oversight for Autonomous AI Agents using Azure Logic Apps + Python☆20Feb 6, 2026Updated last week
- TransPimLib is a library for transcendental (and other hard-to-calculate) functions in general-purpose PIM systems, TransPimLib provides …☆15Apr 21, 2023Updated 2 years ago
- FMO (Friendli Model Optimizer)☆13Jan 8, 2025Updated last year
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆61Mar 25, 2025Updated 10 months ago
- Python for Informatics: Exploring Information (Korean)☆29Sep 14, 2015Updated 10 years ago
- ☆61Jul 21, 2024Updated last year
- ☆13Jul 5, 2023Updated 2 years ago
- Agent and Subagent example in OpenCode and ClaudeCode☆35Sep 11, 2025Updated 5 months ago
- An Attention Superoptimizer☆22Jan 20, 2025Updated last year
- A Collection of Parallel Algorithms for Computational Geometry☆12Mar 10, 2022Updated 3 years ago
- 삼각형의 실전! Triton☆16Feb 15, 2024Updated last year
- 🚀 Launching Bento in a Kubernetes cluster☆17Mar 16, 2025Updated 10 months ago
- Minimal repository to demonstrate fast LoRA inference with Flux family of models.☆26Jul 23, 2025Updated 6 months ago
- C++ client of a GAN model hosted by TensorFlow Serving☆11Jul 31, 2018Updated 7 years ago
- ArXiv daily dump and viewer using GitHub Actions - luvata.github.io/arxive☆14Updated this week
- ☆17Jun 9, 2024Updated last year