W2GenAI-Lab / LucidFluxLinks
☆1,159Updated 2 months ago
Alternatives and similar repositories for LucidFlux
Users that are interested in LucidFlux are comparing it to the libraries listed below
Sorting:
- [ICCV2025 Highlight] DicFace: Dirichlet-Constrained Variational Codebook Learning for Temporally Coherent Video Face Restoration☆447Updated 6 months ago
- Efficient controlnet for DiTs☆382Updated 8 months ago
- [NeurIPS2025 spotlight★] Official implementation for "RepLDM: Reprogramming Pretrained Latent Diffusion Models for High-Quality, High-Eff…☆220Updated last month
- [AAAI 2026] Playmate2: Training-Free Multi-Character Audio-Driven Animation via Diffusion Transformer with Reward Feedback☆294Updated 2 months ago
- ☆389Updated 6 months ago
- ☆99Updated last year
- Efficient DiT architecture for text2any tasks, ICLR2025☆447Updated 8 months ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.☆141Updated 2 months ago
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆186Updated last week
- ☆119Updated this week
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 3 months ago
- ☆164Updated last year
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆1,003Updated 2 months ago
- ☆69Updated last week
- This is the project for the paper of "Boosting Image Restoration via Priors from Pre-trained Models" in CVPR2024☆95Updated 7 months ago
- Inspiring the Next Generation of Segment Anything Models: Comprehensively Evaluate SAM and SAM 2 with Diverse Prompts Towards Context-Dep…☆574Updated 5 months ago
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆924Updated 2 months ago
- A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which …☆533Updated this week
- (ECCV 2024) Open-Vocabulary Camouflaged Object Segmentation☆274Updated 5 months ago
- We introduce the Audio Logical Reasoning (ALR) dataset, consisting of 6,446 text-audio annotated samples specifically designed for comple…☆1,105Updated 2 months ago
- A Compositional Operation Toolbox for Gradient-based Bi-Level Optimization☆1,060Updated last month
- Official Code for “Unveiling Hidden Details: A RAW Data-Enhanced Paradigm for Real-World Super-Resolution”☆152Updated last year
- Dynamic human image animation with strong identity preservation, heterogeneous character driving, and controllable backgrounds.☆140Updated 8 months ago
- [NeurIPS 2025] Codes for paper Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation …☆133Updated 4 months ago
- MTLA: Multi-head Temporal Latent Attention☆760Updated 4 months ago
- Fat-Cat: A document-centric context management Agent. Making context as simple as reading chat history.☆515Updated 3 weeks ago
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆108Updated 2 months ago
- 「ICCV25 highlight」 Official implementation of “Feature Purification Matters: Suppressing Outlier Propagation for Training-Free Open-Vocab…☆48Updated this week
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆574Updated last month
- A real-time interactive Omni Avatar built on LiveKit, which allows you to seamlessly integrate with any open source Avatar components (re…☆559Updated last week