liaohsiaopin / Cambricon_BangC_PracticeLinks
智能计算系统实验 在Cambricon编程平台上实现用BangC实现五个算子
☆31Updated 5 years ago
Alternatives and similar repositories for Cambricon_BangC_Practice
Users that are interested in Cambricon_BangC_Practice are comparing it to the libraries listed below
Sorting:
- 陈云霁 智能计算系统 课后实验 一键运行☆40Updated 4 years ago
- Implement custom operators in PyTorch with cuda/c++☆62Updated 2 years ago
- 智能计算系统课程(陈云霁)课后作业记录☆55Updated 4 years ago
- Cambricon-Test for BANG , homework☆5Updated 5 years ago
- ☆10Updated 4 months ago
- ☆60Updated 11 months ago
- ☆35Updated last year
- ☆36Updated 2 years ago
- ☆44Updated 3 years ago
- ☆45Updated 5 years ago
- hands on model tuning with TVM and profile it on a Mac M1, x86 CPU, and GTX-1080 GPU.☆48Updated last year
- Pytorch implementation of TPAMI 2022 -- 1xN Pattern for Pruning Convolutional Neural Networks☆43Updated 2 years ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆132Updated last year
- ☆38Updated 2 years ago
- CUDA 6大并行计算模式 代码与笔记☆61Updated 4 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆54Updated 2 years ago
- My paper/code reading notes in Chinese☆46Updated last year
- Code for ICML 2021 submission☆34Updated 4 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆46Updated last year
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆28Updated 4 years ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆122Updated last year
- A benchmark suited especially for deep learning operators☆42Updated 2 years ago
- SpInfer: Leveraging Low-Level Sparsity for Efficient Large Language Model Inference on GPUs☆47Updated 2 months ago
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆108Updated last week
- Deep Learning Accelerate Knowledge Review☆35Updated 5 years ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆37Updated last year
- play gemm with tvm☆91Updated last year
- ☆112Updated last year
- SGEMM optimization with cuda step by step☆19Updated last year
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆25Updated 2 years ago