NUS-HPC-AI-Lab / Neural-Network-Parameter-Diffusion
We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters
☆836Updated this week
Related projects ⓘ
Alternatives and complementary repositories for Neural-Network-Parameter-Diffusion
- PyTorch implementation of RCG https://arxiv.org/abs/2312.03701☆839Updated last month
- Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.☆1,033Updated this week
- PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838☆1,029Updated last month
- The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…☆491Updated 8 months ago
- Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI☆734Updated this week
- Open-MAGVIT2: Democratizing Autoregressive Visual Generation☆705Updated last month
- Lossless Training Speed Up by Unbiased Dynamic Data Pruning☆318Updated last month
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,327Updated 3 months ago
- A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".☆923Updated last year
- Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think☆674Updated last week
- This repo contains the code for 1D tokenizer and generator☆554Updated this week
- Implementation of MagViT2 Tokenizer in Pytorch☆564Updated last month
- Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)☆528Updated 7 months ago
- Collection of AWESOME vision-language models for vision tasks☆2,513Updated 3 weeks ago
- [CVPR 2024] OneLLM: One Framework to Align All Modalities with Language☆592Updated last month
- Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"☆692Updated 8 months ago
- VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks☆367Updated 4 months ago
- [ECCV 2024] The official code of paper "Open-Vocabulary SAM".☆950Updated 3 months ago
- A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis☆531Updated last year
- Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States☆1,048Updated 4 months ago
- Diffusion Model-Based Image Editing: A Survey (arXiv)☆487Updated this week
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆535Updated last month
- Collection of papers on state-space models☆556Updated 2 weeks ago
- [ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation☆459Updated 3 weeks ago
- Next-Token Prediction is All You Need☆1,832Updated last month
- [NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scala…☆4,281Updated last month
- Official Implementation of Rectified Flow (ICLR2023 Spotlight)☆960Updated 4 months ago
- [ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding☆842Updated 4 months ago
- [CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want☆706Updated 3 months ago
- [Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)☆303Updated last month