DuNGEOnmassster / awesome-customized-generative-AI
Papers and codes collection for customized, personalized and editable generative models
☆25Updated 4 months ago
Alternatives and similar repositories for awesome-customized-generative-AI:
Users that are interested in awesome-customized-generative-AI are comparing it to the libraries listed below
- [AAAI 2025] SeFAR: Semi-supervised Fine-grained Action Recognition with Temporal Perturbation and Learning Stabilization☆16Updated last month
- ☆11Updated 2 months ago
- This is the official implementation for ControlVAR.☆94Updated 2 months ago
- You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.☆290Updated last month
- Implements VAR+CLIP for text-to-image (T2I) generation☆119Updated 3 weeks ago
- [ICLR25] High-performance Image Tokenizers for VAR and AR☆194Updated this week
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆23Updated 8 months ago
- Training-Free Condition-Guided Text-to-Video Generation☆62Updated last year
- 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆249Updated last month
- 【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".☆31Updated 2 months ago
- The code for Fine-grained HBOE | AAAI 2024 (official version and optimized version).☆16Updated 10 months ago
- My implement of InstantBooth☆9Updated last year
- Accepted by CVPR 2024☆31Updated 9 months ago
- This is a repo to track the latest autoregressive visual generation papers.☆137Updated this week
- CAR: Controllable AutoRegressive Modeling for Visual Generation☆99Updated 2 months ago
- The paper collections for the autoregressive models in vision.☆396Updated this week
- [NAACL 2024] LaDiC: Are Diffusion Models Really Inferior to Autoregressive Counterparts for Image-to-text Generation?☆37Updated 8 months ago
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models☆154Updated 4 months ago
- VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆278Updated 3 weeks ago
- Official implementation for BroadWay: Boost Your Text-to-Video Generation Model in a Training-free Way☆26Updated 4 months ago
- The code and data of Paper: Towards World Simulator: Crafting Physical Commonsense-Based Benchmark for Video Generation☆84Updated 3 months ago
- A collection of vision foundation models unifying understanding and generation.☆40Updated last month
- “FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with an…☆86Updated last month
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆382Updated last week
- [CVPR 2024] Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners☆137Updated 7 months ago
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paper☆142Updated 9 months ago
- ☆160Updated 7 months ago
- [CVPR 2024] On the Content Bias in Fréchet Video Distance☆102Updated 4 months ago
- 🔥stable, simple, state-of-the-art VQVAE toolkit & cookbook☆74Updated 7 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆94Updated 10 months ago