RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards.
☆317Dec 29, 2025Updated 2 months ago
Alternatives and similar repositories for RealGen
Users that are interested in RealGen are comparing it to the libraries listed below
Sorting:
- ☆88Dec 12, 2025Updated 2 months ago
- [ICLR 2026] Draw-In-Mind: Rebalancing Designer-Painter Roles in Unified Multimodal Models Benefits Image Editing☆25Jan 27, 2026Updated last month
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Sep 17, 2025Updated 5 months ago
- ☆553Updated this week
- ☆33Aug 9, 2024Updated last year
- A Unified Visual Generator with Interleaved OmniModal Context☆185Feb 10, 2026Updated 2 weeks ago
- "MoCA: Mixture-of-Components Attention for Scalable Compositional 3D Generation"☆173Dec 9, 2025Updated 2 months ago
- Torch 7 + Android port of Neural style algorithm☆10May 10, 2016Updated 9 years ago
- Open Source Text-to-Speech GUI Tool running on TalkNet☆11Dec 24, 2022Updated 3 years ago
- ☆31Dec 7, 2025Updated 2 months ago
- InsertAnywhere: Bridging 4D Scene Geometry and Diffusion Models for Realistic Video Object Insertion☆82Dec 27, 2025Updated 2 months ago
- ☆197Feb 3, 2026Updated last month
- ☆130Dec 24, 2025Updated 2 months ago
- Scaling Zero-Shot Reference-to-Video Generation☆62Dec 11, 2025Updated 2 months ago
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]☆123Feb 6, 2026Updated 3 weeks ago
- Official implementation for "Story2Board: A Training‑Free Approach for Expressive Storyboard Generation"☆233Aug 22, 2025Updated 6 months ago
- This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.☆101Jan 25, 2026Updated last month
- [CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“☆76Feb 22, 2026Updated last week
- ☆19Apr 28, 2025Updated 10 months ago
- Listen, transcribe, reply - Voice Assistant using OpenAI & ElevenLabs API's☆14Jun 24, 2023Updated 2 years ago
- SigLIP-based Aesthetic Score Predictor☆383Dec 18, 2024Updated last year
- Official implementation of the paper: "FlowEdit: Inversion-Free Text-Based Editing Using Pre-Trained Flow Models"☆942Dec 23, 2025Updated 2 months ago
- ☆55Dec 8, 2025Updated 2 months ago
- ☆14Oct 16, 2023Updated 2 years ago
- Official implementation for "DyPE: Dynamic Position Extrapolation for Ultra High Resolution Diffusion".☆340Updated this week
- State-of-the-art framework for fast, large-scale training and inference of diffusion models☆32Updated this week
- [ECCV 2024] Be-Your-Outpainter https://arxiv.org/abs/2403.13745☆255Apr 19, 2025Updated 10 months ago
- Dreambooth for colab☆31Dec 25, 2023Updated 2 years ago
- YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals☆40Feb 9, 2025Updated last year
- stochastic bfloat16 based optimizer library☆21Dec 4, 2024Updated last year
- Official repository for the paper "Audio ControlNet for Fine-Grained Audio Generation and Editing".☆63Feb 7, 2026Updated 3 weeks ago
- Pytorch implementation of Self-Refining Video Sampling☆146Feb 6, 2026Updated 3 weeks ago
- [ACM MM24] Official implementation of ACM MM 2024 paper: "ZePo: Zero-Shot Portrait Stylization with Faster Sampling"☆44Aug 22, 2024Updated last year
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆119Jan 23, 2025Updated last year
- [NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".☆215Sep 27, 2025Updated 5 months ago
- Rectified Flow Inversion (RF-Inversion) - ICLR 2025☆469Mar 19, 2025Updated 11 months ago
- Code for 'JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion'☆199Feb 10, 2026Updated 2 weeks ago
- ☆123Oct 14, 2024Updated last year
- ☆46Nov 20, 2025Updated 3 months ago