Deep-Spark / iluvatar-corex-ixrtLinks
This repository contains the Open Source Software components of the Iluvatar Corex IxRT. It includes the sources for IxRT plugins and deploy tools, as well as sample applications demonstrating the usages and capabilities of the IxRT platform.
☆17Updated 2 weeks ago
Alternatives and similar repositories for iluvatar-corex-ixrt
Users that are interested in iluvatar-corex-ixrt are comparing it to the libraries listed below
Sorting:
- DeepSparkHub selects hundreds of application algorithms and models, covering various fields of AI and general-purpose computing, to suppo…☆69Updated 3 weeks ago
- The DeepSpark open platform selects hundreds of open source application algorithms and models that are deeply coupled with industrial app…☆45Updated last week
- CNStream is a streaming framework for building Cambricon machine learning pipelines http://forum.cambricon.com https://gitee.com/Solu…☆53Updated 8 months ago
- ☆32Updated 2 years ago
- This fork of BVLC/Caffe is dedicated to supporting Cambricon deep learning processor and improving performance of this deep learning fram…☆41Updated 5 years ago
- Yinghan's Code Sample☆358Updated 3 years ago
- Development repository for the Triton-Linalg conversion☆206Updated 10 months ago
- A CPU tool for benchmarking the peak of floating points☆569Updated 4 months ago
- heterogeneity-aware-lowering-and-optimization☆256Updated last year
- Machine learning compiler based on MLIR for Sophgo TPU.☆824Updated last week
- Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruct…☆503Updated last year
- A model compilation solution for various hardware☆457Updated 3 months ago
- Efficient operation implementation based on the Cambricon Machine Learning Unit (MLU) .☆141Updated this week
- FlagGems is an operator library for large language models implemented in the Triton Language.☆783Updated this week
- row-major matmul optimization☆691Updated 3 months ago
- learning how CUDA works☆347Updated 9 months ago
- collection of benchmarks to measure basic GPU capabilities☆461Updated last month
- how to learn PyTorch and OneFlow☆460Updated last year
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆908Updated 11 months ago
- A self-learning tutorail for CUDA High Performance Programing.☆768Updated 5 months ago
- Hands-On Practical MLIR Tutorial☆677Updated 2 years ago
- Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.☆394Updated 11 months ago
- 📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).☆51Updated 7 months ago
- An MLIR-based compiler framework bridges DSLs (domain-specific languages) to DSAs (domain-specific architectures).☆663Updated last week
- 先进编译实验室的个人主页☆174Updated last month
- AI Accelerator Benchmark focuses on evaluating AI Accelerators from a practical production perspective, including the ease of use and ver…☆279Updated 3 months ago
- ☆1,041Updated last year
- code reading for tvm☆76Updated 3 years ago
- ☆156Updated 11 months ago
- compiler learning resources collect.☆2,598Updated 8 months ago