inskil / Cambricon-Test
Cambricon-Test for BANG , homework
☆5Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for Cambricon-Test
- 智能计算系统实验 在Cambricon编程平台上实现用BangC实现五个算子☆30Updated 4 years ago
- Canvas: End-to-End Kernel Architecture Search in Neural Networks☆13Updated last week
- Code for ICML 2021 submission☆35Updated 3 years ago
- 智能计算系统课程(陈云霁)课后作业记录☆55Updated 4 years ago
- ☆14Updated 2 years ago
- ArkVale: Efficient Generative LLM Inference with Recallable Key-Value Eviction (NIPS'24)☆19Updated last week
- ☆41Updated 2 years ago
- ☆9Updated 5 months ago
- ☆18Updated 11 months ago
- Official PyTorch implementation of IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact☆33Updated 6 months ago
- ☆45Updated 4 years ago
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆26Updated 4 years ago
- 陈云霁 智能计算系统 课后实验 一键运行☆40Updated 3 years ago
- ☆17Updated 3 years ago
- Binary neural networks developed by Huawei Noah's Ark Lab☆29Updated 3 years ago
- ☆13Updated 3 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆49Updated last year
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆23Updated last year
- SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models☆13Updated last month
- ☆41Updated 7 months ago
- This is a repo which contains some details about how to use OpenCL backend (Xilinx/Intel).☆24Updated 5 years ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆76Updated 2 months ago
- Personal Digest of NAS (Under Construction 🛠)☆25Updated 4 years ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆42Updated last year
- Implement some method of LLM KV Cache Sparsity☆24Updated 5 months ago
- ☆34Updated 2 years ago
- My solutions to the assignments of CMU 10-714 Deep Learning Systems 2022☆34Updated 8 months ago
- ATC23 AE☆43Updated last year
- ☆36Updated 3 months ago
- Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616☆129Updated last year