☆113May 11, 2026Updated last month
Alternatives and similar repositories for vllm-mlu
Users that are interested in vllm-mlu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Jan 21, 2021Updated 5 years ago
- Is it difficult to develop C++ high-concurrency server applications? Come and use XServer☆10Jun 13, 2024Updated 2 years ago
- Mathematical expression evaluator with just in time code generation.☆12Apr 7, 2013Updated 13 years ago
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated 2 years ago
- [ACL2025 Oral🔥]Turning Trash into Treasure: Accelerating Inference of Large Language Models with Token Recycling☆29Nov 11, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Flight connections map done with D3.js data visualization library.☆12Dec 5, 2019Updated 6 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- 📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.☆15Nov 15, 2022Updated 3 years ago
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- Proxy google DoH by cloudflare workers.☆19Jun 2, 2021Updated 5 years ago
- ☆94May 16, 2025Updated last year
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- ☆19May 30, 2019Updated 7 years ago
- Code for paper "Spider: Any-to-Many Multimodal LLM"☆16Apr 26, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- An optimized Merkle Patricia Trie implementation on GPU, fully compatible with and integrable into Ethereum. The paper is published on VL…☆14Apr 15, 2024Updated 2 years ago
- [DATE'2025, TCAD'2025] Terafly : A Multi-Node FPGA Based Accelerator Design for Efficient Cooperative Inference in LLMs☆37Nov 13, 2025Updated 7 months ago
- ☆15Oct 11, 2019Updated 6 years ago
- A library for constructing allocators and memory pools. It also contains broadly useful abstractions and utilities for memory management.…☆95Updated this week
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- ☆13Jan 7, 2025Updated last year
- Spacemacs configuration layer for elpy☆18Jun 14, 2015Updated 11 years ago
- Using TVM to depoly Transformer on CPU and GPU☆11Aug 25, 2021Updated 4 years ago
- a game framework. warning: wip, dev, unstable, radiation hazard, defcon 3☆24May 10, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…☆17Mar 3, 2023Updated 3 years ago
- inference on tvm runtime using c++ with gpu enabled☆10Apr 25, 2018Updated 8 years ago
- ☆12Jan 25, 2023Updated 3 years ago
- xeCJK使用范例说明解析☆14Feb 27, 2020Updated 6 years ago
- ☆15Apr 28, 2023Updated 3 years ago
- Real-time panorama and image stitching using c++ and openCV CUDA☆12Sep 8, 2021Updated 4 years ago
- Simple starter CMake project that uses NVBench.☆15May 6, 2025Updated last year
- 小飞机翻墙教程☆24Nov 14, 2019Updated 6 years ago
- Baidu Hook☆13Jan 7, 2016Updated 10 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- tensorrt部署教程☆11Aug 1, 2025Updated 10 months ago
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- 友善之臂(FriendlyARM)开发板Tiny6410学习笔记☆15Jun 5, 2018Updated 8 years ago
- Simple test of ARM NEON code. Performs a blit to the framebuffer.☆15Jul 23, 2013Updated 12 years ago
- ☆44Oct 11, 2025Updated 8 months ago
- Applications for OpenCL testing on Toradex Apalis iMX6Q☆13Dec 2, 2022Updated 3 years ago
- The open-source materials for paper "Sparsing Law: Towards Large Language Models with Greater Activation Sparsity".☆32Nov 12, 2024Updated last year