RE-N-Y / saeLinks
☆17Updated 8 months ago
Alternatives and similar repositories for sae
Users that are interested in sae are comparing it to the libraries listed below
Sorting:
- ☆32Updated 9 months ago
- Sparse Autoencoders for Stable Diffusion XL models.☆69Updated 2 weeks ago
- ☆33Updated 7 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆55Updated 5 months ago
- ☆27Updated last year
- Minimal Differentiable Image Reward Functions☆80Updated 2 weeks ago
- ☆23Updated last year
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆52Updated 6 months ago
- Synthetic Alphabet Dataset☆19Updated 4 months ago
- Official codebase for Margin-aware Preference Optimization for Aligning Diffusion Models without Reference (MaPO).☆79Updated last year
- [ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction☆56Updated 2 months ago
- Implementation of the proposed MaskBit from Bytedance AI☆82Updated 8 months ago
- ☆53Updated last year
- RS-IMLE☆41Updated 8 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆82Updated 7 months ago
- ☆28Updated last year
- ☆41Updated 2 weeks ago
- ☆39Updated last year
- Recaption large (Web)Datasets with vllm and save the artifacts.☆52Updated 8 months ago
- ☆24Updated last year
- NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when n…☆43Updated 9 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆30Updated 3 months ago
- Focused on fast experimentation and simplicity☆76Updated 7 months ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 9 months ago
- ☆24Updated 3 months ago
- ☆13Updated last year
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆36Updated 5 months ago
- A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)☆41Updated 5 months ago
- ☆10Updated last year
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆114Updated 4 months ago