ZYH-Lightyear / LVAS
LVAS-Agent Code Base
☆15Updated last month
Alternatives and similar repositories for LVAS
Users that are interested in LVAS are comparing it to the libraries listed below
Sorting:
- The official implementation of OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows☆60Updated 2 months ago
- Music production for silent film clips.☆22Updated 2 weeks ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆68Updated 7 months ago
- ☆69Updated last week
- An official implementation of SwapAnyone.☆60Updated 2 months ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆58Updated 2 months ago
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆51Updated 2 weeks ago
- ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer☆33Updated 4 months ago
- The official PyTorch implementation for Improving Long-Text Alignment for Text-to-Image Diffusion Models (LongAlign)☆73Updated 3 weeks ago
- Implementation of the proposed MaskBit from Bytedance AI☆76Updated 6 months ago
- [WACV 2025] - EmoVOCA: Speech-Driven Emotional 3D Talking Heads☆20Updated 2 months ago
- HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation☆57Updated 2 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆109Updated 2 months ago
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆95Updated last month
- MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation☆23Updated 2 months ago
- ☆14Updated 2 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated 8 months ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆19Updated 3 months ago
- Learning Motion from Low-Rank Adaptation☆45Updated 11 months ago
- [CVPR 2025] Official implementation of ByTheWay: Boost Your Text-to-Video Generation Model to Higher Quality in a Training-free Way☆36Updated 3 weeks ago
- Collection of scripts to build small-scale datasets for fine-tuning video generation models.☆55Updated last month
- [NeurIPS 2024] Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective☆68Updated 6 months ago
- [CVPR2024] The official implementation of paper Relation Rectification in Diffusion Model☆47Updated 8 months ago
- Distilling Diversity and Control in Diffusion Models☆39Updated 2 weeks ago
- [ICLR 2025] CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion☆44Updated 3 months ago
- Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆145Updated 3 weeks ago
- Official Implementation of GrounDiT (NeurIPS 2024)☆53Updated 5 months ago
- ☆33Updated 6 months ago
- Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"☆52Updated last month