Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .
☆166Jun 2, 2026Updated last week
Alternatives and similar repositories for mlu-ops
Users that are interested in mlu-ops are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Apr 20, 2023Updated 3 years ago
- ☆16Nov 28, 2023Updated 2 years ago
- CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/Solu…☆55Mar 21, 2025Updated last year
- This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning fram…☆40May 15, 2020Updated 6 years ago
- Development repository for the Triton-Linalg conversion☆219Feb 7, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆54Mar 15, 2025Updated last year
- A Triton JIT runtime and ffi provider in C++☆35May 27, 2026Updated last week
- ☆61Apr 27, 2026Updated last month
- Shared Middle-Layer for Triton Compilation☆335Dec 5, 2025Updated 6 months ago
- ☆13Jan 7, 2025Updated last year
- Integrated MLU-accelerated video processing into ffmpeg on Ubuntu/Centos☆63Dec 15, 2025Updated 5 months ago
- 智能计算系统实验 在Cambricon编程平台上实现用BangC实现五个算子☆33Dec 5, 2019Updated 6 years ago
- ☆121May 19, 2025Updated last year
- autoTVM神经网络推理代码优化搜索演示,基于tvm编译开源模型centerface,并使用autoTVM搜索最优推理代码, 最终部署编译为c++代码,演示平台是cuda,可以是其他平台,例如树莓派,安卓手机,苹果手机.Thi is a demonstration of …☆31May 6, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A intelligent matrix format designer for SpMV☆10Oct 10, 2023Updated 2 years ago
- ☆17Jan 24, 2024Updated 2 years ago
- Framework for face annotations, viewer and submodules☆21Jan 26, 2021Updated 5 years ago
- Triton adapter for Ascend. Mirror of https://gitcode.com/ascend/triton-ascend☆125May 18, 2026Updated 3 weeks ago
- A library for syntactically rewriting Python programs, pronounced (sinner).☆66Feb 22, 2022Updated 4 years ago
- The first open source triton inference engine for Stable Diffusion, specifically for sdxl☆12Nov 27, 2023Updated 2 years ago
- ☆182Updated this week
- Example of using pytorch's open device registration API☆31Oct 14, 2022Updated 3 years ago
- a simple API to use CUPTI☆10Aug 19, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- libsmctrl论文的复现,添加了python端接口,可以在python端灵活调用接口来分配计算资源☆12May 21, 2024Updated 2 years ago
- Compact and Agent-Native MoE Training System☆144Updated this week
- The vLLM XPU kernels for Intel GPU☆47Updated this week
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆928Dec 30, 2024Updated last year
- Cavs: An Efficient Runtime System for Dynamic Neural Networks☆15Sep 18, 2020Updated 5 years ago
- Code for Incorporating Relevance Feedback for Information-Seeking Retrieval using Few-Shot Document Re-Ranking, EMNLP 2022, https://aclan…☆14Mar 30, 2026Updated 2 months ago
- ☆423Feb 24, 2026Updated 3 months ago
- play gemm with tvm☆91Jul 22, 2023Updated 2 years ago
- mKernel: fast multi-node, multi-GPU fused kernels☆216Updated this week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- High-Performance KV Cache Storage Engine on CXL Shared Memory for LLM Inference☆52Jun 2, 2026Updated last week
- CrowdOS☆10Jun 22, 2021Updated 4 years ago
- ASPLOS'24: Optimal Kernel Orchestration for Tensor Programs with Korch