XavierJiezou / Face-MoGLELinks
Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation
☆26Updated last month
Alternatives and similar repositories for Face-MoGLE
Users that are interested in Face-MoGLE are comparing it to the libraries listed below
Sorting:
- ☆78Updated 8 months ago
- An official implementation of SwapAnyone.☆72Updated 9 months ago
- ☆33Updated 2 months ago
- Music production for silent film clips.☆31Updated 8 months ago
- LVAS-Agent Code Base☆21Updated 8 months ago
- EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs☆42Updated 3 months ago
- MV-RAG combines retrieval with multi-view generation to create accurate 3D-consistent visuals. By retrieving reference images and text, i…☆21Updated last month
- ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation☆109Updated last month
- Animate Any Character in Any World☆86Updated this week
- ☆38Updated last month
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆45Updated last year
- Official implementation of "Art-Free Generative Models: Art Creation Without Graphic Art Knowledge"☆32Updated last month
- ☆18Updated 6 months ago
- [NeurIPS 2024] Official PyTorch Implementation of "FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner"☆70Updated 2 months ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models".☆28Updated last week
- Official Repository of paper: "MotionEdit: Benchmarking and Learning Motion-Centric Image Editing"☆51Updated this week
- [NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"☆55Updated last week
- Official implementation of the paper "Bind-Your-Avatar: Multi-Talking-Character Video Generation with Dynamic 3D-mask-based Embedding Rou…☆29Updated 3 months ago
- ☆29Updated 9 months ago
- This project is the official implementation of 'DreamOmni3: Scribble-based Editing and Generation''☆24Updated last week
- Make self forcing endless. Add cache purging. Add prompt controllability.☆68Updated 4 months ago
- VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning☆60Updated 2 months ago
- [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization☆53Updated last year
- CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning☆32Updated 4 months ago
- Official Implementation of ReCo: Region-Constraint In-Context Generation for Instructional Video Editing☆115Updated last week
- [ICML 2025] Official implementation of the paper "Compressed Image Generation with Denoising Diffusion Codebook Models"☆79Updated 5 months ago
- DACVAE☆184Updated 3 weeks ago
- LIA-X: Interpretable Latent Portrait Animator☆93Updated 3 months ago
- video-SALMONN 2 is a powerful audio-visual large language model (LLM) that generates high-quality audio-visual video captions, which is d…☆136Updated 3 weeks ago
- Official implementation of "VSTAR: Generative Temporal Nursing for Longer Dynamic Video Synthesis"☆20Updated 11 months ago