HaoKang-Timmy / torchanalyseLinks
A pytorch model profiler with information about macs, energy and e.t.c
☆13Updated last year
Alternatives and similar repositories for torchanalyse
Users that are interested in torchanalyse are comparing it to the libraries listed below
Sorting:
- ☆150Updated last year
- ☆175Updated last year
- PyTorch extension for emulating FP8 data formats on standard FP32 Xeon/GPU hardware.☆110Updated 8 months ago
- ☆67Updated last year
- Repository for artifact evaluation of ASPLOS 2023 paper "SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning"☆26Updated 2 years ago
- ☆15Updated 2 years ago
- ☆154Updated 2 years ago
- ☆117Updated 2 weeks ago
- ☆107Updated last year
- Torch2Chip (MLSys, 2024)☆53Updated 4 months ago
- some docs for rookies in nics-efc☆22Updated 3 years ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆100Updated 11 months ago
- This is a list of awesome edgeAI inference related papers.☆97Updated last year
- LLM Inference analyzer for different hardware platforms☆83Updated last month
- An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…☆86Updated last year
- H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference☆46Updated 3 months ago
- Code Repository of Evaluating Quantized Large Language Models☆130Updated 11 months ago
- LLM serving cluster simulator☆108Updated last year
- MAGIS: Memory Optimization via Coordinated Graph Transformation and Scheduling for DNN (ASPLOS'24)☆53Updated last year
- SparseTIR: Sparse Tensor Compiler for Deep Learning☆137Updated 2 years ago
- ☆29Updated this week
- A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores☆52Updated last year
- PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization☆32Updated last year
- ☆49Updated 3 years ago
- ☆19Updated 10 months ago
- TileFlow is a performance analysis tool based on Timeloop for fusion dataflows☆61Updated last year
- PyTorch-Based Fast and Efficient Processing for Various Machine Learning Applications with Diverse Sparsity☆114Updated 3 weeks ago
- Automatic Mapping Generation, Verification, and Exploration for ISA-based Spatial Accelerators☆115Updated 2 years ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆91Updated last year
- PyTorch emulation library for Microscaling (MX)-compatible data formats☆277Updated last month