HaoKang-Timmy / torchanalyse
A pytorch model profiler with information about macs, energy and e.t.c
☆13Updated last year
Alternatives and similar repositories for torchanalyse:
Users that are interested in torchanalyse are comparing it to the libraries listed below
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆83Updated 7 months ago
- ☆43Updated 3 years ago
- ☆93Updated last year
- An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…☆83Updated 11 months ago
- ☆141Updated 2 years ago
- ☆26Updated 3 months ago
- [ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…☆23Updated 2 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆51Updated 3 weeks ago
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆107Updated 4 months ago
- EDA toolchain for processing-in-memory architectures, including an architecture synthesizer, a compiler, and a simulator☆11Updated 4 months ago
- ☆132Updated 9 months ago
- ☆23Updated last week
- ☆70Updated 5 years ago
- ☆39Updated 9 months ago
- Torch2Chip (MLSys, 2024)☆51Updated this week
- Tender: Accelerating Large Language Models via Tensor Decompostion and Runtime Requantization (ISCA'24)☆14Updated 9 months ago
- ViTALiTy (HPCA'23) Code Repository☆21Updated 2 years ago
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆27Updated last year
- ☆27Updated 2 years ago
- An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences☆25Updated last year
- ☆40Updated 5 months ago
- The official implementation of the DAC 2024 paper GQA-LUT☆16Updated 3 months ago
- ☆53Updated last year
- ☆29Updated last year
- ☆33Updated 3 years ago
- ☆17Updated 4 years ago
- FRAME: Fast Roofline Analytical Modeling and Estimation☆34Updated last year
- [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design☆104Updated last year
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆19Updated last year
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆24Updated 2 years ago