☆33Nov 4, 2024Updated last year
Alternatives and similar repositories for repa-rf
Users that are interested in repa-rf are comparing it to the libraries listed below
Sorting:
- ☆48Feb 23, 2025Updated last year
- research impl of Native Sparse Attention (2502.11089)☆63Feb 19, 2025Updated last year
- Focused on fast experimentation and simplicity☆80Dec 24, 2024Updated last year
- ☆23Jun 18, 2024Updated last year
- A spoken version of the textual story cloze benchmark☆20Aug 6, 2023Updated 2 years ago
- ☆30Dec 2, 2024Updated last year
- ☆28Oct 7, 2025Updated 5 months ago
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆18Jul 24, 2025Updated 7 months ago
- Tiny AutoEncoder for Stable Diffusion Videos☆36Oct 5, 2024Updated last year
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆86Jul 28, 2024Updated last year
- ☆53Jan 6, 2024Updated 2 years ago
- Minimal implementation of scalable rectified flow transformers, based on SD3's approach☆635Jul 1, 2024Updated last year
- ☆34Sep 10, 2024Updated last year
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆48Sep 3, 2023Updated 2 years ago
- WIP☆94Aug 13, 2024Updated last year
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- [ICCV 2023] Going Beyond Nouns With Vision & Language Models Using Synthetic Data☆13Sep 30, 2023Updated 2 years ago
- ☆19Dec 4, 2025Updated 3 months ago
- Train VAE like a boss☆313Oct 21, 2024Updated last year
- LCM Full Cycle Trainer for Ostris - Ai Toolkit☆16Aug 20, 2024Updated last year
- Train transformer language models with reinforcement learning.☆20Dec 26, 2023Updated 2 years ago
- ☆30Oct 7, 2024Updated last year
- Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?☆17Jun 3, 2025Updated 9 months ago
- Efficient optimizers☆294Updated this week
- ☆92Jul 5, 2024Updated last year
- ☆18Aug 24, 2024Updated last year
- supporting pytorch FSDP for optimizers☆84Dec 8, 2024Updated last year
- ☆23Oct 15, 2024Updated last year
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- ☆16Jul 8, 2024Updated last year
- An official PyTorch implementation for CLIPPR☆30Jul 22, 2023Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Sep 11, 2025Updated 6 months ago
- JAX port of FLUX.1 models using flax.nnx☆24Sep 28, 2024Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆132Feb 4, 2026Updated last month
- Official Code for MIMETIC^2☆13Nov 19, 2024Updated last year
- Writing FLUX in Triton☆42Sep 22, 2024Updated last year
- Some minimal implementation of some Diffusion Models. Try to use as less code and as simple arch as possible