pytorch-profiler
☆49Jun 1, 2023Updated 2 years ago
Alternatives and similar repositories for flops-profiler
Users that are interested in flops-profiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2025] Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception☆14Jul 4, 2025Updated 9 months ago
- ☆15Nov 12, 2023Updated 2 years ago
- PArametrized Recommendation and Ai Model benchmark is a repository for development of numerous uBenchmarks as well as end to end nets for…☆153Apr 1, 2026Updated last week
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- Official implementation for the paper Lancet: Accelerating Mixture-of-Experts Training via Whole Graph Computation-Communication Overlapp…☆14Nov 17, 2025Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- a high performance system for customized-precision distributed deep learning☆12Dec 10, 2020Updated 5 years ago
- ☆14Mar 3, 2026Updated last month
- Quantized Attention on GPU☆44Nov 22, 2024Updated last year
- ☆24Oct 30, 2024Updated last year
- ☆18Apr 21, 2024Updated last year
- PiCAS executor + ROS 2 Real-Time Working Group's reference system☆12Oct 4, 2023Updated 2 years ago
- ☆44Jul 4, 2024Updated last year
- Microsoft Collective Communication Library☆66Nov 23, 2024Updated last year
- Deduplication over dis-aggregated memory for Serverless Computing☆14Mar 21, 2022Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆65Apr 26, 2025Updated 11 months ago
- Example codes appears in lectures☆22Jan 11, 2022Updated 4 years ago
- Nsight Compute In Docker☆13Dec 21, 2023Updated 2 years ago
- ☆11Apr 18, 2021Updated 4 years ago
- Estimating neural network runtime characteristics☆12Mar 25, 2023Updated 3 years ago
- torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.☆25Mar 29, 2024Updated 2 years ago
- ☆15Apr 18, 2023Updated 2 years ago
- Official Pytorch Implementation of Length-Adaptive Transformer (ACL 2021)☆102Nov 2, 2020Updated 5 years ago
- Python package for automatic contraction of tensor networks.☆13Feb 17, 2020Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SmartTLS is the project introduced at the paper "A Case for SmartNIC-accelerated Private Communication" (APNET 20). It accelerates web se…☆17Feb 20, 2025Updated last year
- An experimental implementation of compiler-driven automatic sharding of models across a given device mesh.☆59Updated this week
- Homepage for the Data Interaction Group at CMU☆13Updated this week
- UT Campus Object Dataset (CODa): Models for 3D Object Detection☆17Feb 4, 2025Updated last year
- Parallel construction of binary radix trees, implemented from an nVidia paper.☆20May 12, 2020Updated 5 years ago
- Odysseus: Playground of LLM Sequence Parallelism☆78Jun 17, 2024Updated last year
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 4 years ago
- zombie game☆11Apr 19, 2019Updated 6 years ago
- ☆28Mar 12, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Know Your Enemy To Save Cloud Energy: Energy-Performance Characterization of Machine Learning Serving (HPCA '23)☆14Jun 20, 2025Updated 9 months ago
- Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …☆37Aug 29, 2025Updated 7 months ago
- Official pytorch implementation for CVPR2022 paper "Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training"☆18Apr 11, 2022Updated 3 years ago
- ☆55Apr 2, 2026Updated last week
- ☆12Oct 9, 2023Updated 2 years ago
- An auxiliary project analysis of the characteristics of KV in DiT Attention.☆34Nov 29, 2024Updated last year
- CUPTI GPU Profiler☆40Feb 26, 2019Updated 7 years ago