Exploring Representation-Aligned Latent Space for Better Generation
☆18Mar 17, 2026Updated last week
Alternatives and similar repositories for ReaLS
Users that are interested in ReaLS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆52Updated this week
- Gaussian Splating 2d implemented in triton☆11Mar 19, 2024Updated 2 years ago
- Official implementation for our paper: Rethinking Video Tokenization: A Conditioned Diffusion-based Approach☆14Apr 2, 2025Updated 11 months ago
- [ICML'25] EQ-VAE: Equivariance Regularized Latent Space for Improved Generative Image Modeling.☆175Updated this week
- DDS: Delta Denoising Score PyTorch implementation☆19Sep 2, 2023Updated 2 years ago
- ☆20Jan 1, 2026Updated 2 months ago
- [TPAMI 2026] Enhancing MMDiT-Based Text-to-Image Models for Similar Subject Generation☆11Mar 7, 2026Updated 2 weeks ago
- ☆40Dec 16, 2025Updated 3 months ago
- (NeurIPS 2025) Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation☆66Oct 14, 2025Updated 5 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 7 months ago
- Code for Self-Cross Diffusion Guidance for Text-to-Image Synthesis of Similar Subjects☆11Mar 5, 2026Updated 2 weeks ago
- ☆46Mar 12, 2026Updated last week
- [ICML 2025] Differentiable Solver Search for Fast Diffusion Sampling☆21Jul 7, 2025Updated 8 months ago
- StreetSurfGS: Scalable Large Scene Surface Reconstruction with Gaussian Splatting for Urban Street Scences☆22Jun 12, 2024Updated last year
- ☆11Aug 23, 2024Updated last year
- PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation☆56Jan 5, 2026Updated 2 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆25Apr 14, 2025Updated 11 months ago
- Code repository for GCT634 Musical Applications of Machine Learning (Spring 2024)☆11May 19, 2024Updated last year
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆62Jul 1, 2025Updated 8 months ago
- [ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers☆41Jul 23, 2025Updated 8 months ago
- An efficient distillation method for flow matching models☆25Feb 1, 2026Updated last month
- [ICCV 2025] Distilling Parallel Gradients for Fast ODE Solvers of Diffusion Models☆35Updated this week
- [ AAAI26 ]: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame Interpolation”☆17Mar 9, 2026Updated 2 weeks ago
- Code for Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense? [COLM 2024]☆24Aug 13, 2024Updated last year
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆56Apr 1, 2025Updated 11 months ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 4 months ago
- ☆19May 19, 2025Updated 10 months ago
- An official implementation of Coefficients-Preserving Sampling for Reinforcement Learning with Flow Matching☆69Sep 11, 2025Updated 6 months ago
- [ECCV2024] "SlimFlow: Training Smaller One-Step Diffusion Models with Rectified Flow", Yuanzhi Zhu, Xingchao Liu, Qiang Liu☆60Nov 26, 2024Updated last year
- Code for the paper: Probabilistic Forecasting with Stochastic Interpolants and Follmer Processes☆18Aug 18, 2024Updated last year
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆28Updated this week
- [ICCV 2025] Official implementation of the paper: REPA-E: Unlocking VAE for End-to-End Tuning of Latent Diffusion Transformers☆473Dec 6, 2025Updated 3 months ago
- ☆20Mar 23, 2025Updated last year
- [AAAI2026] Bring Your Dreams to Life: Continual Text-to-Video Customization☆36Dec 9, 2025Updated 3 months ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆97Dec 3, 2025Updated 3 months ago
- Implementation and experiment of the MusGConv paper.☆15Sep 6, 2024Updated last year
- [NeurIPS 2024] Exploring DCN-like Architectures for Fast Image Generation with Arbitrary Resolution☆34Dec 23, 2024Updated last year
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆19Jan 4, 2022Updated 4 years ago
- Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"☆97Feb 12, 2024Updated 2 years ago