Cambricon / vllm-mluView external linksLinks
☆60Feb 4, 2026Updated last week
Alternatives and similar repositories for vllm-mlu
Users that are interested in vllm-mlu are comparing it to the libraries listed below
Sorting:
- Pythonic interface to the EMC Unity REST API☆10Mar 9, 2022Updated 3 years ago
- a simple pingpong buffer test☆12Feb 11, 2015Updated 11 years ago
- An SSH plugin for Dify☆12Jan 16, 2026Updated last month
- In this programming assignment you will implement a streaming video server and client that communicate control commands via the Real-Time…☆11Dec 29, 2012Updated 13 years ago
- ☆13Jun 11, 2024Updated last year
- This is a depth-anything-v2 onnxruntime inference by cpp☆15Sep 2, 2024Updated last year
- MobileSAM のエンコーダー/デコーダーをONNXに変換し、推論するサンプル☆12Apr 11, 2024Updated last year
- yolov8在hisi3536a推理☆11Dec 15, 2023Updated 2 years ago
- A simple tool for parsing the profile.json file of mxnet☆14Aug 1, 2018Updated 7 years ago
- Docker&vLLM官方镜像部署DeepSeek模型,在生产环境中提供类OpenAI接口服务。☆15Jul 17, 2025Updated 6 months ago
- TMMA: A Tiled Matrix Multiplication Accelerator for Self-Attention Projections in Transformer Models, optimized for edge deployment on Xi…☆25Mar 24, 2025Updated 10 months ago
- 动手写全文搜索引擎☆10Aug 12, 2020Updated 5 years ago
- ☆11Aug 4, 2020Updated 5 years ago
- tensorrt部署教程☆11Aug 1, 2025Updated 6 months ago
- ☆13Jan 7, 2025Updated last year
- ☆10Jul 18, 2024Updated last year
- Build a simple CMD chat interface with llama.cpp and C++☆14Sep 19, 2025Updated 4 months ago
- Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…☆16Mar 3, 2023Updated 2 years ago
- ☆14Nov 28, 2023Updated 2 years ago
- Simple starter CMake project that uses NVBench.☆15May 6, 2025Updated 9 months ago
- ☆19Jan 28, 2025Updated last year
- Chinese Guide for Alveo Getting Started☆12May 18, 2020Updated 5 years ago
- Object browser for manipulating content in ECS.☆11Jan 8, 2019Updated 7 years ago
- Mathematical expression evaluator with just in time code generation.☆12Apr 7, 2013Updated 12 years ago
- CUDA C simple application for Nvidia's GPU☆11Jun 7, 2022Updated 3 years ago
- Dockerfiles for poetry/mlc-llm(rk3588)/...☆10Sep 13, 2023Updated 2 years ago
- inference on tvm runtime using c++ with gpu enabled☆10Apr 25, 2018Updated 7 years ago
- Multi Layer Perceptron by Vivado HLS for Xilinx FPGA implementation☆12Dec 26, 2016Updated 9 years ago
- ☆12Jun 3, 2019Updated 6 years ago
- 简单rtsp服务器,支持264 aac☆12Dec 17, 2021Updated 4 years ago
- Example of Matrix Multiplication using Map Reduce paradigm in python☆10Oct 25, 2016Updated 9 years ago
- 简单快速的部署深度学习模型☆13Sep 3, 2023Updated 2 years ago
- Automate the batch upload and parsing of documents into Dify's knowledge base, reducing manual intervention and wait time.☆15Aug 29, 2024Updated last year
- Connects Campaign Manager to the RTB4FREE bidders☆13Nov 16, 2022Updated 3 years ago
- Libdigest is a small C library for parsing and generating HTTP Digest Access Authentication (rfc2617) header strings☆13Apr 3, 2016Updated 9 years ago
- A unified and extensible pipeline for deep learning model inference with C++. Now support yolov8, yolov9, clip, and nanosam. More models …☆12Aug 3, 2025Updated 6 months ago
- a game framework. warning: wip, dev, unstable, radiation hazard, defcon 3☆24May 10, 2015Updated 10 years ago
- A package for pedestrian detection, tracking, and re-identification.☆13Feb 28, 2021Updated 4 years ago
- ☆58May 4, 2024Updated last year