NUS-HPC-AI-Lab / Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

☆836

Related projects ⓘ

Alternatives and complementary repositories for Neural-Network-Parameter-Diffusion

LTH14 / rcg
PyTorch implementation of RCG https://arxiv.org/abs/2312.03701
☆839Updated last month
showlab / Show-o
Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.
☆1,033Updated this week
LTH14 / mar
PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838
☆1,029Updated last month
lichao-sun / SoraReview
The official GitHub page for the review paper "Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision M…
☆491Updated 8 months ago
lucidrains / transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
☆734Updated this week
TencentARC / Open-MAGVIT2
Open-MAGVIT2: Democratizing Autoregressive Visual Generation
☆705Updated last month
NUS-HPC-AI-Lab / InfoBatch
Lossless Training Speed Up by Unbiased Dynamic Data Pruning
☆318Updated last month
FoundationVision / LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,327Updated 3 months ago
baofff / U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
☆923Updated last year
sihyun-yu / REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆674Updated last week
bytedance / 1d-tokenizer
This repo contains the code for 1D tokenizer and generator
☆554Updated this week
lucidrains / magvit2-pytorch
Implementation of MagViT2 Tokenizer in Pytorch
☆564Updated last month
sail-sg / MDT
Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)
☆528Updated 7 months ago
jingyi0000 / VLM_survey
Collection of AWESOME vision-language models for vision tasks
☆2,513Updated 3 weeks ago
csuhan / OneLLM
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
☆592Updated last month
willisma / SiT
Official PyTorch Implementation of "SiT: Exploring Flow and Diffusion-based Generative Models with Scalable Interpolant Transformers"
☆692Updated 8 months ago
Meituan-AutoML / VisionLLaMA
VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks
☆367Updated 4 months ago
HarborYuan / ovsam
[ECCV 2024] The official code of paper "Open-Vocabulary SAM".
☆950Updated 3 months ago
LTH14 / mage
A PyTorch implementation of MAGE: MAsked Generative Encoder to Unify Representation Learning and Image Synthesis
☆531Updated last year
test-time-training / ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
☆1,048Updated 4 months ago
SiatMMLab / Awesome-Diffusion-Model-Based-Image-Editing-Methods
Diffusion Model-Based Image Editing: A Survey (arXiv)
☆487Updated this week
jy0205 / LaVIT
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
☆535Updated last month
radarFudan / Awesome-state-space-models
Collection of papers on state-space models
☆556Updated 2 weeks ago
NVlabs / DiffiT
[ECCV 2024] Official Repository for DiffiT: Diffusion Vision Transformers for Image Generation
☆459Updated 3 weeks ago
baaivision / Emu3
Next-Token Prediction is All You Need
☆1,832Updated last month
FoundationVision / VAR
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scala…
☆4,281Updated last month
gnobitab / RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
☆960Updated 4 months ago
OpenGVLab / VideoMamba
[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding
☆842Updated 4 months ago
SunzeY / AlphaCLIP
[CVPR 2024] Alpha-CLIP: A CLIP Model Focusing on Wherever You Want
☆706Updated 3 months ago
Lupin1998 / Awesome-MIM
[Survey] Masked Modeling for Self-supervised Representation Learning on Vision and Beyond (https://arxiv.org/abs/2401.00897)
☆303Updated last month