betacord / PSILinks
☆10Updated 10 months ago
Alternatives and similar repositories for PSI
Users that are interested in PSI are comparing it to the libraries listed below
Sorting:
- Kick-off repository for starting with Kaggle!☆12Updated 9 months ago
- ☆27Updated 9 months ago
- Efficient optimizers☆261Updated this week
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆612Updated last year
- Tiny AutoEncoder for Stable Diffusion☆781Updated 5 months ago
- supporting pytorch FSDP for optimizers☆84Updated 9 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆301Updated 2 months ago
- Annotated version of the Mamba paper☆490Updated last year
- UNet diffusion model in pure CUDA☆647Updated last year
- Implementation of Diffusion Transformer (DiT) in JAX☆291Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 3 months ago
- Minimalistic, extremely fast, and hackable researcher's toolbench for GPT models in 307 lines of code. Reaches <3.8 validation loss on wi…☆349Updated last year
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆576Updated last month
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆601Updated last year
- This repository's goal is to precompile all past presentations of the Huggingface reading group☆48Updated last year
- Simple Transformer in Jax☆139Updated last year
- The Prodigy optimizer and its variants for training neural networks.☆417Updated 8 months ago
- Deep learning for dummies. All the practical details and useful utilities that go into working with real models.☆813Updated last month
- Official Implementation of "ADOPT: Modified Adam Can Converge with Any β2 with the Optimal Rate"☆426Updated 9 months ago
- ☆36Updated 7 months ago
- A repository for log-time feedforward networks☆223Updated last year
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆563Updated last year
- A simple implimentation of Bayesian Flow Networks (BFN)☆240Updated last year
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆437Updated 4 months ago
- RWKV, in easy to read code☆71Updated 6 months ago
- ☆262Updated this week
- The AdEMAMix Optimizer: Better, Faster, Older.☆186Updated last year
- ☆14Updated last year
- Text to Image Latent Diffusion using a Transformer core☆208Updated last year
- Train VAE like a boss☆292Updated 11 months ago