liujf69 / Classic-Generative-ModelLinks
Simple code demos about classic AIGC models/Compilation of blogs and papers on classic AIGC models.
☆72Updated 6 months ago
Alternatives and similar repositories for Classic-Generative-Model
Users that are interested in Classic-Generative-Model are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance☆146Updated 5 months ago
- Inference pipeline for some Text-to-Image metrics.☆42Updated 2 weeks ago
- ☆133Updated 6 months ago
- [NeurIPS 2024] Referring Human Pose and Mask Estimation In the Wild☆43Updated 5 months ago
- Official implementation of "Generating images with 3D annotations using diffusion models".☆49Updated 10 months ago
- [TCSVT 2024] Official PyTorch implementation of the paper "MLP: Motion Label Prior for Temporal Sentence Localization in Untrimmed 3D Hum…☆26Updated 11 months ago
- ☆25Updated last month
- [ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"☆158Updated last month
- Image and video Tokenizer/VAE selection guide, text and face reconstruction evaluation.☆82Updated last month
- SpecRef: A Fast Training-free Baseline of Specific Reference-Condition Real Image Editing☆35Updated last year
- [AAAI 2025] Code for paper:Enhancing Multimodal Large Language Models Complex Reasoning via Similarity Computation☆3Updated 5 months ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆233Updated last month
- [ECCV 2024] InterFusion: Text-Driven Generation of 3D Human-Object Interaction☆54Updated 6 months ago
- The repository for the paper "Image Inversion: A Survey from GANs to Diffusion and Beyond".☆74Updated 2 months ago
- Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"☆65Updated last week
- Rethinking Video-Text Understanding Retrieval from Counterfactually Augmented Data☆39Updated 11 months ago
- Multi-granularity Correspondence Learning from Long-term Noisy Videos [ICLR 2024, Oral]☆114Updated last year
- The official generation code and toolkits of VDW dataset (ICCV 2023)☆35Updated 11 months ago
- [CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation☆113Updated last year
- [ICRA 2025]AVD2: Accident Video Diffusion for Accident Video Description☆82Updated last month
- [ICCV 2025] Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer☆120Updated last week
- Official Implementation for "Mask-based modeling for Neural Radiance Fields" (ICLR 2024)☆37Updated last year
- [ICLR 2025] BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities☆144Updated 5 months ago
- [MM'24 Oral] Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval☆126Updated 10 months ago
- The official project website of "ScaleKD: Strong Vision Transformers Could Be Excellent Teachers" (ScaleKD for short, accepted to NeurIPS…☆62Updated 5 months ago
- [MMAsia 2023] Official PyTorch implementation of the paper " Cross-Modal Retrieval for Motion and Text via DropTriple Loss "☆36Updated 7 months ago
- Code release for "UniVS: Unified and Universal Video Segmentation with Prompts as Queries" (CVPR2024)☆185Updated 7 months ago
- RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations☆36Updated 8 months ago
- 【ICME2025 Oral】Offical Pytorch Code for "Fraesormer: Learning Adaptive Sparse Transformer for Efficient Food Recognition"☆10Updated 3 months ago
- [Neurips 2023] dynpoint: dynamic neural point for view synthesis☆52Updated last year