Simple PyTorch profiler that combines DeepSpeed Flops Profiler and TorchInfo
☆11Feb 12, 2023Updated 3 years ago
Alternatives and similar repositories for torch_profiler
Users that are interested in torch_profiler are comparing it to the libraries listed below
Sorting:
- DeepSZ: A Novel Framework to Compress Deep Neural Networks by Using Error-Bounded Lossy Compression☆11Oct 7, 2020Updated 5 years ago
- Repo for Performance Interfaces for Hardware Accelerators.☆16Aug 19, 2025Updated 6 months ago
- ☆13Feb 22, 2023Updated 3 years ago
- Artifact for "Shockwave: Fair and Efficient Cluster Scheduling for Dynamic Adaptation in Machine Learning" [NSDI '23]☆47Nov 24, 2022Updated 3 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- ☆18Mar 15, 2020Updated 5 years ago
- Linux on RISC-V on FPGA (LOROF): RV64GC Sv39 Quad-Core Superscalar Out-of-Order Virtual Memory CPU☆15Feb 23, 2026Updated last week
- ☆22Nov 7, 2018Updated 7 years ago
- A collection of different PyTorch wrappers for training neural networks and reinforcement algorithms☆13Dec 15, 2022Updated 3 years ago
- Tutorial: Writing R and Python Packages with Multithreaded C++ Code using BLAS, AVX2/AVX512, OpenMP, C++11 Threads and Cuda GPU accelerat…☆13Nov 27, 2022Updated 3 years ago
- ☆12Aug 12, 2022Updated 3 years ago
- A Doom Source Port based on SDL2☆29Jan 30, 2024Updated 2 years ago
- Notes on Advanced Placement Physics C: Electricity and Magnetism☆12May 13, 2019Updated 6 years ago
- A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup☆35Jan 9, 2023Updated 3 years ago
- ML Input Data Processing as a Service. This repository contains the source code for Cachew (built on top of TensorFlow).☆40Sep 10, 2024Updated last year
- Some microbenchmarks and design docs before commencement☆12Feb 1, 2021Updated 5 years ago
- Just for fun riscv64 emulator, which boots the Linux.☆41Dec 14, 2022Updated 3 years ago
- Anchored Diffusion Language Model (NeurIPS 2025)☆27Oct 13, 2025Updated 4 months ago
- 🚀全流程自己训练一个VLA 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!☆27Oct 16, 2025Updated 4 months ago
- 🌿快速生成文件夹目录结构,支持定义目录层级,支持生成到 markdown 文件。☆13Oct 19, 2022Updated 3 years ago
- Simulator for a superscalar processor with dynamic scheduling and branch prediction☆15Nov 23, 2018Updated 7 years ago
- 经典的嵌入式OS - ucos-II 2.52版本全注释,仅供学习交流使用。☆12Oct 16, 2019Updated 6 years ago
- ☆12Jan 12, 2024Updated 2 years ago
- A Texas Holdem poker framework written in C++ 20.☆11Apr 23, 2023Updated 2 years ago
- A QA system based on k8s-specific knowledge build on ChatGLM2-6B, serving by Ray.☆10Sep 14, 2023Updated 2 years ago
- Alpha64 R10000 Two-Way Superscalar Processor☆11May 6, 2019Updated 6 years ago
- Reinforcement Learning (PPO) applied to a multiplayer simple card game (Witches)☆10Jun 7, 2020Updated 5 years ago
- A Filesystem Semi-Microkernel.☆46Oct 24, 2023Updated 2 years ago
- Advanced_Data_Integration_Project☆11Jul 31, 2018Updated 7 years ago
- ☆13Jul 14, 2025Updated 7 months ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- Works for Applied Deep Learning / Machine Learning and Having It Deep and Structured (2017 FALL) @ NTU☆11Aug 14, 2018Updated 7 years ago
- simple 4-BIT CPU with 74-serials chip,origin by Kaoru Tonami in his book “How to build a CPU”☆14Oct 19, 2024Updated last year
- ☆10Apr 14, 2020Updated 5 years ago
- ☆12Mar 1, 2024Updated 2 years ago
- This the implementation of our paper entitled "Local Patch Network with Global Attention for Infrared Small Target Detection"☆10May 16, 2022Updated 3 years ago
- ☆16May 8, 2020Updated 5 years ago
- A Flexible Cache Architectural Simulator☆16Sep 16, 2025Updated 5 months ago
- Implementation of: Kristiadi, Agustinus, and Asja Fischer. "Predictive Uncertainty Quantification with Compound Density Networks." (2019)…☆16May 26, 2022Updated 3 years ago