revelio-diffusion / revelio
☆14Updated last month
Alternatives and similar repositories for revelio:
Users that are interested in revelio are comparing it to the libraries listed below
- [ICLR 2025] SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image and Video Generation☆36Updated 3 months ago
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆41Updated this week
- [ICLR 2025] Causal Graphical Models for Vision-Language Compositional Understanding☆8Updated last week
- Code for the paper - ConceptPrune: Concept Editing in Diffusion Models via Skilled Neuron Pruning☆18Updated 8 months ago
- ☆41Updated 5 months ago
- Official Repository of Personalized Visual Instruct Tuning☆28Updated last month
- (arXiv.2405.18406) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives☆36Updated 5 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆33Updated last month
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Updated last year
- ☆11Updated 6 months ago
- A benchmark dataset and simple code examples for measuring the perception and reasoning of multi-sensor Vision Language models.☆18Updated 4 months ago
- Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement"☆46Updated 4 months ago
- The official repo of continuous speculative decoding☆24Updated last month
- The official implementation of our paper "CoRe^2: Collect, Reflect and Refine to Generate Better and Faster".☆22Updated last month
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆66Updated 5 months ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆28Updated 6 months ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆37Updated last year
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- Official code for the paper 'DCTdiff: Intriguing Properties of Image Generative Modeling in the DCT Space'☆25Updated 4 months ago
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆34Updated last year
- NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆46Updated this week
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆65Updated 2 months ago
- A curated list of Awesome Personalized Large Multimodal Models resources☆19Updated last month
- ☆11Updated 3 months ago
- ☆38Updated 7 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆40Updated 9 months ago
- ☆22Updated 10 months ago
- ☆21Updated last year
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆70Updated this week
- CLIP-MoE: Mixture of Experts for CLIP☆31Updated 6 months ago