bertmaher/tf32_gemm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bertmaher/tf32_gemm)

bertmaher / tf32_gemm

Example of binding a TF32 CUTLASS GEMM kernel to PyTorch

☆12

Alternatives and similar repositories for tf32_gemm

Users that are interested in tf32_gemm are comparing it to the libraries listed below

Sorting:

bertmaher / llama2.so
View on GitHub
Inference Llama 2 with a model compiled to native code by TorchInductor
☆14Feb 8, 2024Updated 2 years ago
manishucsd / py-codegen
View on GitHub
☆16Sep 24, 2024Updated last year
cchan / tccl
View on GitHub
extensible collectives library in triton
☆95Mar 31, 2025Updated 11 months ago
hrvach / yubikill
View on GitHub
Disable YubiKey output on MacOS without a modifier key pressed
☆10Aug 10, 2022Updated 3 years ago
h-ohsaki / chatgpt-el
View on GitHub
access ChatGPT/Gemini/Claude from Emacs without APIs
☆10Dec 25, 2025Updated 2 months ago
meta-pytorch / kraken
View on GitHub
Triton-based Symmetric Memory operators and examples
☆85Jan 15, 2026Updated last month
Mogball / triton_lite
View on GitHub
☆20May 24, 2025Updated 9 months ago
InterDigitalInc / NeoRadium
View on GitHub
A Python Library for the 3GPP physical layer
☆14Dec 18, 2025Updated 2 months ago
UCL / composite-actions
View on GitHub
Templates for commonly used GitHub actions steps
☆13Dec 13, 2024Updated last year
ananyahjha93 / libself
View on GitHub
PyTorch Lightning based framework to run experiments for self-supervised learning tasks.
☆10Feb 14, 2020Updated 6 years ago
kyolebu / janestreet-gpumode-hackathon
View on GitHub
1st Place Team Crane: @aswinkumar1999 @rathull @kyolebu
☆29Sep 8, 2025Updated 5 months ago
James-QiuHaoran / Tools
View on GitHub
This repository consists of useful tools or guides for system software development or anything interesting.
☆11Updated this week
malfet / llm_experiments
View on GitHub
☆12Aug 26, 2025Updated 6 months ago
PASAUCMerced / Sentinel
View on GitHub
Efficient-Tensor-Management-on-HM-for-Deep-Learning
☆10Nov 15, 2021Updated 4 years ago
WJMatthew / MovieLens-EDA
View on GitHub
Analysis of the MovieLens dataset of movie ratings and reviews.
☆11Sep 2, 2018Updated 7 years ago
nicksypark / rope-triton
View on GitHub
☆15Mar 30, 2024Updated last year
pocke / xrandr-mirror
View on GitHub
Mirroring displays on Linux
☆13Aug 22, 2016Updated 9 years ago
loopspace / mathgrep
View on GitHub
A perl script for searching and replacing in mathematics in LaTeX documents.
☆13Jul 21, 2021Updated 4 years ago
natolambert / job-search-viz
View on GitHub
A tool for visualization of complex job searches.
☆13Jul 8, 2022Updated 3 years ago
Qualcomm-AI-research / dynamic-sparsity
View on GitHub
☆15Mar 26, 2025Updated 11 months ago
thomasahle / kanmlps
View on GitHub
KANs and MLPs
☆12Jun 7, 2024Updated last year
ecrcentral / postdoc-funding-schemes
View on GitHub
Funding schemes and travel grant opportunities for postdocs
☆10Jun 2, 2018Updated 7 years ago
beamform / pacmap-rs
View on GitHub
Pairwise Controlled Manifold Approximation (PaCMAP) for dimensionality reduction
☆20Feb 3, 2026Updated 3 weeks ago
Keiku / PyTorch-Lightning-CIFAR10
View on GitHub
"Not too complicated" training code for CIFAR-10 by PyTorch Lightning
☆12Jun 5, 2022Updated 3 years ago
nadavrot / fast_log
View on GitHub
A fast implementation of log() and exp()
☆57Dec 14, 2022Updated 3 years ago
nakulbhati / creative-profile-readme
View on GitHub
A Collection of GitHub Profiles with awesome readme
☆14Aug 17, 2023Updated 2 years ago
swt-user / WWW_2023_code
View on GitHub
☆16Jun 15, 2023Updated 2 years ago
therealnidhin / CRF-image-segmentation
View on GitHub
☆12Jul 6, 2017Updated 8 years ago
ebecht / DR_benchmark
View on GitHub
☆16Feb 23, 2021Updated 5 years ago
RuifMaxx / Paper-List-of-cloud-resource-management
View on GitHub
☆15Dec 29, 2022Updated 3 years ago
smoser / pdftk
View on GitHub
Unofficial mirror of pdftk - imported using git-ubuntu
☆10Aug 20, 2018Updated 7 years ago
Borda / pyRepoStats
View on GitHub
Simple repository contribution statistics
☆15Updated this week
lhb8125 / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆18Updated this week
GeomScale / gsoc20
View on GitHub
GeomScale in Google Summer of Code 2020
☆13Jan 14, 2020Updated 6 years ago
PasaLab / Liquid
View on GitHub
Intelligent Resource Requirement Estimation and Scheduling for Deep Learning Jobs on Distributed GPU Clusters
☆15Nov 18, 2021Updated 4 years ago
tchaton / pytorch2lightning
View on GitHub
☆15Aug 3, 2021Updated 4 years ago
hpdps-group / ICS23-GPULZ
View on GitHub
GPULZ: Optimizing LZSS Lossless Compression for Multi-byte Data on Modern GPUs
☆16Apr 18, 2025Updated 10 months ago
openteams-ai / torch-build
View on GitHub
Collection of scripts to build PyTorch and the domain libraries from source.
☆13Feb 4, 2026Updated 3 weeks ago
grondo / io-watchdog
View on GitHub
Monitor processes and parallel workloads for hangs
☆16Dec 27, 2019Updated 6 years ago