kulinseth / pytorchLinks
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆19Updated 7 months ago
Alternatives and similar repositories for pytorch
Users that are interested in pytorch are comparing it to the libraries listed below
Sorting:
- ☆79Updated last year
- ☆17Updated last year
- A block oriented training approach for inference time optimization.☆34Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- [ACL 2023] The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"☆96Updated 10 months ago
- ☆39Updated last year
- Making Flux go brrr on GPUs.☆161Updated last month
- CATransformers is a framework for joint neural network and hardware architecture search.☆20Updated 9 months ago
- FID computation in Jax/Flax.☆29Updated last year
- ☆52Updated 2 years ago
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆101Updated 5 months ago
- Focused on fast experimentation and simplicity☆80Updated last year
- Tiny AutoEncoder for Stable Diffusion Videos☆36Updated last year
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆153Updated 2 years ago
- ☆30Updated last year
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Updated last year
- TerDiT: Ternary Diffusion Models with Transformers☆74Updated last year
- Patch convolution to avoid large GPU memory usage of Conv2D☆95Updated last year
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Updated 2 years ago
- Experimental CUDA kernel framework unifying typed dimensions, NVRTC JIT specialization, and ML‑guided tuning.☆46Updated last week
- ☆48Updated 11 months ago
- This repository contains the experimental PyTorch native float8 training UX☆227Updated last year
- Code for the paper "Interpreting and Improving Diffusion Models from an Optimization Perspective", appearing in ICML 2024☆14Updated last year
- ☆53Updated 2 years ago
- Faster generation with text-to-image diffusion models.☆230Updated 7 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Updated 6 months ago
- A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)☆41Updated 11 months ago
- faster parallel inference of mochi-1 video generation model☆125Updated 11 months ago
- ☆307Updated this week
- Official implementation for Training LLMs with MXFP4☆118Updated 9 months ago