kulinseth / pytorchLinks
Tensors and Dynamic neural networks in Python with strong GPU acceleration
☆19Updated 7 months ago
Alternatives and similar repositories for pytorch
Users that are interested in pytorch are comparing it to the libraries listed below
Sorting:
- ☆79Updated last year
- Making Flux go brrr on GPUs.☆161Updated last month
- ☆27Updated last year
- [ACL 2023] The official implementation of "CAME: Confidence-guided Adaptive Memory Optimization"☆96Updated 10 months ago
- Writing FLUX in Triton☆41Updated last year
- ☆39Updated last year
- Simple large-scale training of stable diffusion with multi-node support.☆133Updated 2 years ago
- Triton kernels for Flux☆22Updated 7 months ago
- A block oriented training approach for inference time optimization.☆34Updated last year
- Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.☆55Updated 2 years ago
- Focused on fast experimentation and simplicity☆80Updated last year
- PyTorch implementation for "Parallel Sampling of Diffusion Models", NeurIPS 2023 Spotlight☆153Updated 2 years ago
- Patch convolution to avoid large GPU memory usage of Conv2D☆95Updated last year
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆101Updated 5 months ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆78Updated last year
- research impl of Native Sparse Attention (2502.11089)☆63Updated 11 months ago
- ☆91Updated 2 years ago
- Official repository for VQDM:Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization paper☆34Updated last year
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch☆50Updated 3 years ago
- TerDiT: Ternary Diffusion Models with Transformers☆74Updated last year
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆104Updated last year
- ☆52Updated 2 years ago
- Official implementation for SSDD Single-Step Diffusion Decoder for Efficient Image Tokenization.☆53Updated 2 months ago
- FID computation in Jax/Flax.☆29Updated last year
- Just another reasonably minimal repo for class-conditional training of pixel-space diffusion transformers.☆143Updated 8 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆85Updated last year
- The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models☆103Updated last year
- ☆91Updated last year
- [arXiv] On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices☆131Updated 2 months ago
- ☆53Updated 2 years ago