liujf69 / Classic-Generative-ModelLinks
Simple code demos about classic AIGC models/Compilation of blogs and papers on classic AIGC models.
☆72Updated 5 months ago
Alternatives and similar repositories for Classic-Generative-Model
Users that are interested in Classic-Generative-Model are comparing it to the libraries listed below
Sorting:
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆171Updated 2 weeks ago
- Inference pipeline for some Text-to-Image metrics.☆42Updated last week
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆24Updated 10 months ago
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆10Updated 2 months ago
- [ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance☆141Updated 4 months ago
- The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".☆73Updated last month
- ☆133Updated 5 months ago
- SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing☆35Updated last year
- [NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild☆43Updated 5 months ago
- ☆22Updated last week
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 4 months ago
- [ECCV 2024] InterFusion: Text-Driven Generation of 3D Human-Object Interaction☆54Updated 5 months ago
- [ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆146Updated 3 weeks ago
- Official code of CVPR 2023 Highlight paper CVT-SLR☆122Updated last year
- [IJCV 2024] RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆36Updated 7 months ago
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆35Updated this week
- OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation☆103Updated this week
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆82Updated 3 weeks ago
- SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation☆114Updated 7 months ago
- The official project website of "ScaleKD: Strong Vision Transformers Could Be Excellent Teachers" (ScaleKD for short, accepted to NeurIPS…☆59Updated 4 months ago
- [CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation☆113Updated last year
- Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"☆64Updated last month
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆113Updated last year
- Official Implementation of NeurIPS 2023 Contextually Affinitive Neighborhood Refinery for Deep Clustering☆46Updated last year
- [IJCAI 2023] A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement☆51Updated 10 months ago
- Official implementation of "Generating images with 3D annotations using diffusion models".☆48Updated 9 months ago
- [ICLR 2025] Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration☆97Updated 4 months ago
- [CVPR24] Official Implementation of 'A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video …☆98Updated 11 months ago
- Visualization of DiT self attention features☆211Updated 9 months ago
- [ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities☆144Updated 4 months ago