pytorch-profiler
☆49Jun 1, 2023Updated 2 years ago
Alternatives and similar repositories for flops-profiler
Users that are interested in flops-profiler are comparing it to the libraries listed below
Sorting:
- ☆15Nov 12, 2023Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆152Mar 9, 2026Updated last week
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- a high performance system for customized-precision distributed deep learning☆12Dec 10, 2020Updated 5 years ago
- ☆14Mar 3, 2026Updated 2 weeks ago
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- Hi-Speed DNN Training with Espresso: Unleashing the Full Potential of Gradient Compression with Near-Optimal Usage Strategies (EuroSys '2…☆15Sep 21, 2023Updated 2 years ago
- ☆12Jan 10, 2016Updated 10 years ago
- Sequence-level 1F1B schedule for LLMs.☆38Aug 26, 2025Updated 6 months ago
- ☆18Apr 21, 2024Updated last year
- PiCAS executor + ROS 2 Real-Time Working Group's reference system☆12Oct 4, 2023Updated 2 years ago
- ☆17Jan 15, 2026Updated 2 months ago
- ☆25Nov 10, 2025Updated 4 months ago
- Microsoft Collective Communication Library☆66Nov 23, 2024Updated last year
- Deduplication over dis-aggregated memory for Serverless Computing☆14Mar 21, 2022Updated 3 years ago
- ☆65Apr 26, 2025Updated 10 months ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆39Dec 22, 2025Updated 2 months ago
- Example codes appears in lectures☆22Jan 11, 2022Updated 4 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- A library to analyze PyTorch traces.☆474Updated this week
- Estimating neural network runtime characteristics☆12Mar 25, 2023Updated 2 years ago
- 🌈 The Bangumi extension for VSCode. Her data source came from Bilibili. [Maintenance phase]☆12Oct 7, 2023Updated 2 years ago
- ☆15Apr 18, 2023Updated 2 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- 基于AnimeGAN2+serverless+NAS存储的漫画风图片生成工具(demo 已失效)☆12May 11, 2022Updated 3 years ago
- Python package for automatic contraction of tensor networks.☆13Feb 17, 2020Updated 6 years ago
- Torch Distributed Experimental☆117Aug 5, 2024Updated last year
- easyanimate generete videos with ExLlamaV2 quantization LLM prompt☆13Jun 26, 2024Updated last year
- SmartTLS is the project introduced at the paper "A Case for SmartNIC-accelerated Private Communication" (APNET 20). It accelerates web se…☆17Feb 20, 2025Updated last year
- UT Campus Object Dataset (CODa): Models for 3D Object Detection☆17Feb 4, 2025Updated last year
- [Nature Communications 2023] "Wearable in-sensor reservoir computing using optoelectronic polymers with through-space charge-transport ch…☆16Jan 11, 2023Updated 3 years ago
- Odysseus: Playground of LLM Sequence Parallelism☆79Jun 17, 2024Updated last year
- Implementation of the unary leapfrog join for efficient intersection of sorted sets.☆10Dec 4, 2019Updated 6 years ago
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆15Aug 26, 2020Updated 5 years ago
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 4 years ago
- Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)☆14Jun 20, 2025Updated 9 months ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆37Aug 29, 2025Updated 6 months ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- C++17 implementation of einops for libtorch - clear and reliable tensor manipulations with einstein-like notation☆11Oct 16, 2023Updated 2 years ago