betacord / PSI
☆13Updated 2 months ago
Alternatives and similar repositories for PSI:
Users that are interested in PSI are comparing it to the libraries listed below
- Kick-off repository for starting with Kaggle!☆14Updated last month
- ☆29Updated last month
- Efficient optimizers☆154Updated this week
- Focused on fast experimentation and simplicity☆65Updated last month
- Train VAE like a boss☆254Updated 3 months ago
- supporting pytorch FSDP for optimizers☆75Updated last month
- LoRA and DoRA from Scratch Implementations☆195Updated 10 months ago
- Faster generation with text-to-image diffusion models.☆208Updated 3 months ago
- ☆125Updated last month
- ☆42Updated 2 weeks ago
- A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]☆279Updated 6 months ago
- ☆296Updated 7 months ago
- Code to reproduce "Transformers Can Do Arithmetic with the Right Embeddings", McLeish et al (NeurIPS 2024)☆183Updated 8 months ago
- A repository for log-time feedforward networks☆217Updated 9 months ago
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Updated last year
- ☆192Updated last month
- The repository for the code of the UltraFastBERT paper☆514Updated 10 months ago
- Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees"☆358Updated 11 months ago
- Text to Image Latent Diffusion using a Transformer core☆160Updated 5 months ago
- ☆150Updated last month
- Implementation of DoRA☆288Updated 7 months ago
- Official implementation of Würstchen: Efficient Pretraining of Text-to-Image Models☆538Updated 9 months ago
- 94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds☆197Updated 2 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆261Updated 7 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆64Updated 9 months ago
- Official PyTorch implementation of QA-LoRA☆122Updated 10 months ago
- Annotated version of the Mamba paper☆471Updated 11 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆406Updated last year
- Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead☆220Updated 3 weeks ago
- A pytorch quantization backend for optimum☆870Updated 3 weeks ago