Context Manager to profile the forward and backward times of PyTorch's nn.Module
☆83Oct 10, 2023Updated 2 years ago
Alternatives and similar repositories for torchnnprofiler
Users that are interested in torchnnprofiler are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- reproduction of the CVPR'21 paper Distilling Knowledge via Knowledge Review for the ML Reproducibility Challenge 2021☆11Apr 16, 2022Updated 4 years ago
- our submission for the microsoft membership inference competion at SaTML 2023☆15Apr 5, 2023Updated 3 years ago
- This is a question bank for practicing Machine Learning for Interviews.☆33Oct 7, 2022Updated 3 years ago
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- Winning solution of work done on model extraction over Vision Transformers such as Video-Swin-T and MoViNeT-A2-Base on Video Action-Recog…☆21Mar 30, 2022Updated 4 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 📥 🎯 (1,4/4) an MLIR-based toolchain with Vitis HLS LLVM input/output targeting FPGAs.☆15Nov 15, 2022Updated 3 years ago
- Code of "What Images are More Memorable to Machines?"☆15Feb 13, 2023Updated 3 years ago
- Benchmark PyTorch Custom Operators☆14Jul 6, 2023Updated 2 years ago
- The official evaluation suite and dynamic data release for MixEval.☆11Sep 23, 2024Updated last year
- Code for paper "FuSeConv Fully Separable Convolutions for Fast Inference on Systolic Arrays" published at DATE 2021☆18Aug 23, 2021Updated 4 years ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆17Aug 5, 2022Updated 3 years ago
- ☆25Apr 3, 2023Updated 3 years ago
- SOTA Learning-augmented Systems☆37May 21, 2022Updated 4 years ago
- ☆144Jan 30, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Dec 20, 2021Updated 4 years ago
- Differentiable Combinatorial Scheduling at Scale (ICML'24). Mingju Liu, Yingjie Li, Jiaqi Yin, Zhiru Zhang, Cunxi Yu.☆22Oct 31, 2024Updated last year
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆25May 11, 2022Updated 4 years ago
- PyTorch compilation tutorial covering TorchScript, torch.fx, and Slapo☆17Mar 13, 2023Updated 3 years ago
- ☆34May 28, 2023Updated 3 years ago
- Code accompanying ICCC 2019 Creative Submission paper - "ChordAL: A Chord-Based Approach for Music Generation using Bi-LSTMs".☆19Mar 4, 2022Updated 4 years ago
- A simple framework for distributed reinforcement learning in PyTorch.☆16Apr 24, 2020Updated 6 years ago
- A Python library for automatically solving Abstraction and Reasoning Corpus (ARC) challenges using Claude and object-centric modeling.☆28Jan 6, 2025Updated last year
- ☆17Jul 23, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The repository for the Deep Learning practicals of the ASCI Computer Vision by Learning course 2022☆14May 11, 2022Updated 4 years ago
- Wanwu models release, code will be released soon☆24Aug 24, 2022Updated 3 years ago
- Computer Architecture -VLSI -Verilog Codes-Xilinx-Irsim☆13May 8, 2021Updated 5 years ago
- A easy general acc.☆18Mar 22, 2021Updated 5 years ago
- An extension library of WMMA API (Tensor Core API)☆113Jul 12, 2024Updated last year
- ☆52May 19, 2025Updated last year
- Presents an optimized Apache Beam pipeline for generating sentence embeddings (runnable on Cloud Dataflow).☆20Mar 7, 2022Updated 4 years ago
- ☆17Jul 2, 2021Updated 4 years ago
- ☆24Nov 18, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Aug 29, 2023Updated 2 years ago
- A GPU performance profiling tool for PyTorch models☆22Jul 5, 2022Updated 3 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Sep 27, 2021Updated 4 years ago
- ☆14Jan 25, 2026Updated 4 months ago
- A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.☆960Updated this week
- DAC System Design Contest 2020☆29Jun 11, 2020Updated 6 years ago
- Python implementation of the random-walk inductive classification algorithm Modified Adsorption from P. Talukdar☆15Jul 30, 2014Updated 11 years ago