betacord / PSILinks
☆10Updated 8 months ago
Alternatives and similar repositories for PSI
Users that are interested in PSI are comparing it to the libraries listed below
Sorting:
- Kick-off repository for starting with Kaggle!☆12Updated 8 months ago
- ☆27Updated 7 months ago
- Efficient optimizers☆254Updated 2 weeks ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆603Updated last year
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆560Updated last year
- ☆47Updated 5 months ago
- Official repository for our work on micro-budget training of large-scale diffusion models.☆1,505Updated 7 months ago
- Text to Image Latent Diffusion using a Transformer core☆198Updated 11 months ago
- supporting pytorch FSDP for optimizers☆84Updated 8 months ago
- Naively combining transformers and Kolmogorov-Arnold Networks to learn and experiment☆36Updated last year
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆289Updated 2 months ago
- UNet diffusion model in pure CUDA☆615Updated last year
- Getting crystal-like representations with harmonic loss☆194Updated 4 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆278Updated 3 weeks ago
- Train VAE like a boss☆292Updated 9 months ago
- The repository for the code of the UltraFastBERT paper☆517Updated last year
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆434Updated 3 months ago
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆428Updated 8 months ago
- Sparsify transformers with SAEs and transcoders☆604Updated this week
- ☆14Updated last year
- My annotated papers and meeting recordings for the EleutherAI ML Performance research paper reading group☆19Updated last month
- ☆128Updated last year
- Annotated version of the Mamba paper☆487Updated last year
- A Compressed Stable Diffusion for Efficient Text-to-Image Generation [ECCV'24]☆295Updated last year
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆812Updated 2 weeks ago
- Implementation of Stable Diffusion with PyTorch☆347Updated 5 months ago
- Huggingface-compatible SDXL Unet implementation that is readily hackable☆425Updated 2 years ago
- DeMo: Decoupled Momentum Optimization☆190Updated 8 months ago
- When it comes to optimizers, it's always better to be safe than sorry☆356Updated this week
- WIP☆94Updated last year