Cambricon/mlu-ops

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Cambricon/mlu-ops)

Cambricon / mlu-ops

Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .

☆176

Alternatives and similar repositories for mlu-ops

Users that are interested in mlu-ops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Cambricon / catch
View on GitHub
☆33Apr 20, 2023Updated 3 years ago
superlich7 / caffe
View on GitHub
This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning fram…
☆40May 15, 2020Updated 6 years ago
Cambricon / triton-linalg
View on GitHub
Development repository for the Triton-Linalg conversion
☆221Feb 7, 2025Updated last year
Cambricon / magicmind_cloud
View on GitHub
☆16Nov 28, 2023Updated 2 years ago
Cambricon / CNStream
View on GitHub
CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/Solu…
☆55Mar 21, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Cambricon / cambricon-k8s-device-plugin
View on GitHub
☆61Jun 26, 2026Updated 3 weeks ago
microsoft / triton-shared
View on GitHub
Shared Middle-Layer for Triton Compilation
☆340Dec 5, 2025Updated 7 months ago
FuyuWang / Soter
View on GitHub
☆13Jan 7, 2025Updated last year
liaohsiaopin / Cambricon_BangC_Practice
View on GitHub
智能计算系统实验在Cambricon编程平台上实现用BangC实现五个算子
☆32Dec 5, 2019Updated 6 years ago
Cambricon / ffmpeg-mlu
View on GitHub
Integrated MLU-accelerated video processing into ffmpeg on Ubuntu/Centos
☆63Dec 15, 2025Updated 7 months ago
microsoft / AttentionEngine
View on GitHub
☆123May 19, 2025Updated last year
leoluopy / autotvm_tutorial
View on GitHub
autoTVM神经网络推理代码优化搜索演示，基于tvm编译开源模型centerface，并使用autoTVM搜索最优推理代码，　最终部署编译为c++代码，演示平台是cuda，可以是其他平台，例如树莓派，安卓手机，苹果手机．Thi is a demonstration of …
☆31May 6, 2021Updated 5 years ago
ZhW-loop / UniCoMo
View on GitHub
☆13Sep 19, 2024Updated last year
summerspringwei / souffle-ae
View on GitHub
☆17Jan 24, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
flagos-ai / libtriton_jit
View on GitHub
A Triton JIT runtime and ffi provider in C++
☆37Updated this week
arnavdantuluri / StableTriton
View on GitHub
The first open source triton inference engine for Stable Diffusion, specifically for sdxl
☆12Nov 27, 2023Updated 2 years ago
pcr-upm / faces_framework
View on GitHub
Framework for face annotations, viewer and submodules
☆21Jan 26, 2021Updated 5 years ago
octoml / synr
View on GitHub
A library for syntactically rewriting Python programs, pronounced (sinner).
☆66Feb 22, 2022Updated 4 years ago
zhaiyi000 / tlp
View on GitHub
☆42Apr 25, 2024Updated 2 years ago
wu-kan / wuk_cupti_wrapper
View on GitHub
a simple API to use CUPTI
☆10Aug 19, 2025Updated 11 months ago
tanzelin430 / libsmctrl
View on GitHub
libsmctrl论文的复现，添加了python端接口，可以在python端灵活调用接口来分配计算资源
☆12May 21, 2024Updated 2 years ago
ROCm / rocMLIR
View on GitHub
☆183Updated this week
bdhirsh / pytorch_open_registration_example
View on GitHub
Example of using pytorch's open device registration API
☆31Oct 14, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
alibaba / BladeDISC
View on GitHub
BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.
☆932Dec 30, 2024Updated last year
zhisbug / Cavs
View on GitHub
Cavs: An Efficient Runtime System for Dynamic Neural Networks
☆15Sep 18, 2020Updated 5 years ago
triton-lang / triton-ext
View on GitHub
A collection of out-of-tree extensions for the Triton language and compiler
☆30Updated this week
tensorflow / mlir-hlo
View on GitHub
☆421Feb 24, 2026Updated 4 months ago
UKPLab / incorporating-relevance
View on GitHub
Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…
☆14Mar 30, 2026Updated 3 months ago
XPU-Forces / xpu_graph
View on GitHub
A torch compile backend for multi-targets
☆51May 27, 2026Updated last month
mit-han-lab / ncu-report-skill
View on GitHub
☆156May 24, 2026Updated last month
LeiWang1999 / tvm_gpu_gemm
View on GitHub
play gemm with tvm
☆91Jul 22, 2023Updated 2 years ago
alibaba / heterogeneity-aware-lowering-and-optimization
View on GitHub
heterogeneity-aware-lowering-and-optimization
☆259Jan 20, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
YJMSTR / flash-linear-attention
View on GitHub
FLA but cuTile
☆27Apr 17, 2026Updated 3 months ago
humuyan / Korch
View on GitHub
ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch
☆41Mar 27, 2025Updated last year
tile-ai / tvm
View on GitHub
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆20Jul 13, 2026Updated last week
serdes21 / flashtile
View on GitHub
FlashTile is a CUDA Tile IR compiler that is compatible with NVIDIA's tileiras, targeting SM70 through SM121 NVIDIA GPUs.
☆61Feb 6, 2026Updated 5 months ago
Ascend / triton-ascend
View on GitHub
Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend
☆127May 18, 2026Updated 2 months ago
PKUZHOU / PetS-ATC-2022
View on GitHub
☆10Sep 14, 2023Updated 2 years ago
xforcevesa / pytorch-riscv64-oe24
View on GitHub
PyTorch for RISC-V Architecture on OpenEuler 24.03
☆13Jun 27, 2024Updated 2 years ago