LukasHedegaard / pytorch-benchmarkView external linksLinks
Easily benchmark PyTorch model FLOPs, latency, throughput, allocated gpu memory and energy consumption
☆109Aug 25, 2023Updated 2 years ago
Alternatives and similar repositories for pytorch-benchmark
Users that are interested in pytorch-benchmark are comparing it to the libraries listed below
Sorting:
- [CF ’20] Verified Instruction-Level Energy Consumption Measurement for NVIDIA GPUs☆15Dec 11, 2020Updated 5 years ago
- Benchmarking PyTorch 2.0 different models☆20Mar 19, 2023Updated 2 years ago
- Complete code for linear and non-linear unmixing Hyperspectral images in Python.☆14Aug 1, 2021Updated 4 years ago
- Examples for interactive distributed machine learning on the Cori supercomputer☆11Nov 6, 2019Updated 6 years ago
- [WACV 2025] I Dream My Painting: Connecting MLLMs and Diffusion Models via Prompt Generation for Text-Guided Multi-Mask Inpainting☆16Dec 29, 2025Updated last month
- Jackson serializers for JTS Geometry objects☆14Jan 29, 2026Updated 2 weeks ago
- Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…☆17Nov 11, 2024Updated last year
- Page for the CVPR 2023 Tutorial - Efficient Neural Networks: From Algorithm Design to Practical Mobile Deployments☆12Jun 30, 2023Updated 2 years ago
- A GPU performance prediction toolkit for CUDA programs☆18Mar 25, 2019Updated 6 years ago
- ☆19Mar 18, 2021Updated 4 years ago
- Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"☆17Jul 12, 2021Updated 4 years ago
- Python package for NN generation from physics☆15Mar 25, 2023Updated 2 years ago
- Spatial Spectral Machine Learning☆14Oct 15, 2025Updated 4 months ago
- A Leaderboard for Certifiable Robustness against Adversarial Patch Attacks☆20Oct 30, 2023Updated 2 years ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆17Jul 7, 2022Updated 3 years ago
- ☆20Nov 29, 2021Updated 4 years ago
- PyCUDA based PyTorch Extension Made Easy☆26Mar 22, 2024Updated last year
- Code for the paper "Function-Space Learning Rates"☆25Jun 3, 2025Updated 8 months ago
- Official implementation of RDST. A residual dense swin transformer for medical image super-resolution☆19Mar 14, 2023Updated 2 years ago
- Performant parser for textual data (CSV parser)☆34Oct 28, 2018Updated 7 years ago
- Forest fire prediction using finetuning on CPU with MODIS and NAIP aerial photos and resnet with acceleration using Intel Extensions for …☆26Jun 17, 2024Updated last year
- Adaptive Local Implicit Image Function for Arbitrary-scale Super-resolution, accepted by the International Conference on Image Processing…☆21Nov 2, 2022Updated 3 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆30Apr 18, 2021Updated 4 years ago
- LogicCircuit is a program that helps build/simulate simple circuits using logic gates. It is meant to teach people the basics of how logi…☆10Jan 22, 2025Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- ☆23Oct 21, 2021Updated 4 years ago
- ☆35Apr 8, 2025Updated 10 months ago
- An efficient GPU support for LLM inference with x-bit quantization (e.g. FP6,FP5).☆276Jul 16, 2025Updated 7 months ago
- A database of over 1.4 billion 3x3 convolution filters extracted from hundreds of diverse CNN models with relevant meta information (CVPR…☆34Jun 28, 2023Updated 2 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆32Apr 2, 2025Updated 10 months ago
- GEMM and Winograd based convolutions using CUTLASS☆28Jul 15, 2020Updated 5 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆32Apr 8, 2023Updated 2 years ago
- The package 'data-driven density estimation x' (dddex) turns any standard point forecasting model into an estimator of the underlying con…☆10Dec 1, 2025Updated 2 months ago
- ☆14Jun 24, 2024Updated last year
- 基于GSConv+SlimNeck的YOLOv5的消防通道占用检测系统☆10Nov 24, 2023Updated 2 years ago
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆40Mar 5, 2024Updated last year
- Physical Downlink Shared Channel (PDSCH) in 5G New Radio.☆12Jan 29, 2024Updated 2 years ago
- 本文提出了一种基于多视图卷积神经网络的三维物体识别算法,以实现三维物体的准确识别。首先实现一个标准的卷积神经网络架构,该架构经过训练可以独立地识别形状的渲染视图,以实现即使从单一视图中也可以识别出一个三维形状。随后使用该三维物体多个角度的二维视图通过卷积神经网络识别的结果进…☆11May 16, 2022Updated 3 years ago
- How to export PyTorch models with unsupported layers to ONNX and then to Intel OpenVINO☆28Feb 20, 2025Updated 11 months ago