Adamdad / neumeta
NeuMeta transforms neural networks by allowing a single model to adapt on the fly to different sizes, generating the right weights when needed.
☆42Updated 5 months ago
Alternatives and similar repositories for neumeta:
Users that are interested in neumeta are comparing it to the libraries listed below
- The official repo of continuous speculative decoding☆26Updated last month
- ☆70Updated 5 months ago
- RS-IMLE☆38Updated 5 months ago
- ☆31Updated 3 months ago
- Official implementation of ECCV24 paper: POA☆24Updated 9 months ago
- ☆33Updated 6 months ago
- 🦾 EvalGIM (pronounced as "EvalGym") is an evaluation library for generative image models. It enables easy-to-use, reproducible automatic…☆73Updated 4 months ago
- Sparse Autoencoders for Stable Diffusion XL models.☆55Updated last month
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆32Updated last month
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆31Updated 3 weeks ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆72Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆21Updated 4 months ago
- Remasking Discrete Diffusion Models with Inference-Time Scaling☆18Updated last month
- ☆51Updated 10 months ago
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆137Updated 2 months ago
- ☆27Updated last year
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆36Updated last month
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆54Updated 8 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆22Updated this week
- ☆22Updated 10 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆94Updated last month
- ☆28Updated 9 months ago
- [ICLR 2025] Official PyTorch implementation of "Forgetting Transformer: Softmax Attention with a Forget Gate"☆97Updated 3 weeks ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆51Updated 3 months ago
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆101Updated last year
- Focused on fast experimentation and simplicity☆72Updated 4 months ago
- [ICLR 2025] Source code for paper "A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegr…☆75Updated 4 months ago
- Unofficial Implementation of Selective Attention Transformer☆16Updated 6 months ago
- ☆37Updated last year