yonseivnl / cmotaLinks
β10Updated 10 months ago
Alternatives and similar repositories for cmota
Users that are interested in cmota are comparing it to the libraries listed below
Sorting:
- β30Updated last year
- π Visual Instruction Inversion: Image Editing via Visual Prompting (NeurIPS 2023)β91Updated last year
- β78Updated last year
- [CVPR 2024] On the Content Bias in FrΓ©chet Video Distanceβ117Updated 9 months ago
- Official GitHub repository for the Text-Guided Video Editing (TGVE) competition of LOVEU Workshop @ CVPR'23.β76Updated last year
- [ICLR 2024] LLM-grounded Video Diffusion Models (LVD): official implementation for the LVD paperβ154Updated last year
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimizationβ64Updated last year
- PyTorch implementation of InstructAny2Pix: Flexible Visual Editing via Multimodal Instruction Followingβ30Updated 5 months ago
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generationβ109Updated 3 months ago
- [NeurIPS 2024] Motion Consistency Model: Accelerating Video Diffusion with Disentangled Motion-Appearance Distillationβ66Updated 8 months ago
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animatorβ94Updated last year
- [CVPR2024] Official PyTorch implementation of "Contrastive Denoising Score(CDS) for Text-guided Latent Diffusion Image Editing"β115Updated 8 months ago
- β126Updated last year
- official repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]β92Updated 5 months ago
- [ECCV 2024] Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learningβ51Updated last month
- FQGAN: Factorized Visual Tokenization and Generationβ50Updated 3 months ago
- Official Implementation of VideoDPOβ121Updated last month
- β12Updated 2 years ago
- [CVPR`2024, Oral] Attention Calibration for Disentangled Text-to-Image Personalizationβ103Updated last year
- β80Updated 7 months ago
- Training-Free Condition-Guided Text-to-Video Generationβ61Updated 3 months ago
- Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models (ICLR 2024)β139Updated last year
- β111Updated 5 months ago
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflectionβ42Updated 3 weeks ago
- [CVPR 2025, Highlight] The official implementation of the paper "Unleashing In-context Learning of Autoregressive Models for Few-shot Imaβ¦β21Updated last month
- [CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Modelsβ170Updated 9 months ago
- CVPR-24 | Official codebase for ZONE: Zero-shot InstructiON-guided Local Editingβ78Updated 7 months ago
- MC$^2$: Multi-concept Guidance for Customized Multi-concept Generationβ27Updated last year
- Code Release for the paper "Make-A-Story: Visual Memory Conditioned Consistent Story Generation" in CVPR 2023β40Updated 2 years ago
- [NeurIPS 2024 Spotlight] The official implement of research paper "MotionBooth: Motion-Aware Customized Text-to-Video Generation"β134Updated 9 months ago