☆69Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for vllm-mlu
Users that are interested in vllm-mlu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cloud Native Distributed Nearest Neighbour Search☆15Jun 9, 2020Updated 5 years ago
- Implementation of the ERFNet for Real-Time Semantic Segmentation using caffe☆15Sep 11, 2018Updated 7 years ago
- c++ implementation for ssh detector for object detect. something likes ssd☆14Jan 14, 2019Updated 7 years ago
- A one-page WebUI integrating VITS inference, training, and output in Sherpa-Onnx format.☆12Feb 2, 2025Updated last year
- ☆14Mar 29, 2020Updated 6 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Jan 21, 2021Updated 5 years ago
- Is it difficult to develop C++ high-concurrency server applications? Come and use XServer☆10Jun 13, 2024Updated last year
- SDA: Low-Bit Stable Diffusion Acceleration on Edge FPGAs☆18May 23, 2024Updated last year
- Indexing Module for elasticell (https://github.com/deepfabric/elasticell)☆24Feb 13, 2018Updated 8 years ago
- OpenIris-EPSIDF is the firmware part of the EyeTrackVR Project - OpenIris. This time rewritten from scrach in esp-idf☆24Feb 16, 2026Updated last month
- Mathematical expression evaluator with just in time code generation.☆12Apr 7, 2013Updated 13 years ago
- 翻译一些比较好的论文☆16Sep 15, 2016Updated 9 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- Distributed SDDMM Kernel☆12Jul 8, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Flight connections map done with D3.js data visualization library.☆12Dec 5, 2019Updated 6 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- 📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.☆15Nov 15, 2022Updated 3 years ago
- c++ version of ViT☆12Nov 13, 2022Updated 3 years ago
- Kratos: An FPGA Benchmark for Unrolled Deep Neural Networks with Fine-Grained Sparsity and Mixed Precision☆12Jan 19, 2026Updated 2 months ago
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆33Nov 13, 2025Updated 4 months ago
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- ☆19May 30, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"☆84Mar 17, 2025Updated last year
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆15Apr 26, 2025Updated 11 months ago
- Chinese Guide for Alveo Getting Started☆12May 18, 2020Updated 5 years ago
- [ICCV'25] The official code of paper "Combining Similarity and Importance for Video Token Reduction on Large Visual Language Models"☆71Jan 13, 2026Updated 2 months ago
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated last year
- ☆12Jun 3, 2019Updated 6 years ago
- A SIMD-based C++ library providing rank/select queries over mutable bitmaps.☆36Jan 8, 2023Updated 3 years ago
- ☆10Jul 18, 2024Updated last year
- ☆18Sep 17, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- Example of applying CUDA graphs to LLaMA-v2☆11Aug 25, 2023Updated 2 years ago
- 参考sub2api和cc-gateway☆51Updated this week
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆87Apr 3, 2026Updated last week
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- C++ Library for Quantum State Preparation (QSP)☆12Jan 5, 2023Updated 3 years ago
- ☆13Jan 7, 2025Updated last year