NUS-HPC-AI-Lab / Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
☆844Updated last month
Alternatives and similar repositories for Neural-Network-Diffusion:
Users that are interested in Neural-Network-Diffusion are comparing it to the libraries listed below
- [ICLR 2025] Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,220Updated last week
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,285Updated 4 months ago
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆902Updated 4 months ago
- Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think (ICL…☆832Updated 3 weeks ago
- This repo contains the code for 1D tokenizer and generator☆691Updated last week
- SEED-Voken: A Series of Powerful Visual Tokenizers☆830Updated this week
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆327Updated 4 months ago
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆547Updated 9 months ago
- Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI☆950Updated 2 weeks ago
- The paper collections for the autoregressive models in vision.☆406Updated this week
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆1,096Updated 7 months ago
- Implementation of MagViT2 Tokenizer in Pytorch☆590Updated last month
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆764Updated 11 months ago
- [NeurIPS 2024]OmniTokenizer: one model and one weight for image-video joint tokenization.☆279Updated 7 months ago
- Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆968Updated this week
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆561Updated 4 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,576Updated 6 months ago
- Eagle Family: Exploring Model Designs, Data Recipes and Training Strategies for Frontier-Class Multimodal LLMs☆607Updated 3 weeks ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆969Updated last year
- [ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for …☆1,325Updated last year
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆243Updated 2 months ago
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆903Updated 7 months ago
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆730Updated this week
- Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.☆2,116Updated this week
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆717Updated 4 months ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆932Updated 6 months ago
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆318Updated 4 months ago
- The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…☆495Updated 11 months ago
- Implementation of Autoregressive Diffusion in Pytorch☆356Updated 3 months ago