Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"
☆524Nov 24, 2025Updated 7 months ago
Alternatives and similar repositories for FluxText
Users that are interested in FluxText are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR26] NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆113Jul 28, 2025Updated 11 months ago
- A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.☆107Mar 24, 2026Updated 3 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆140Jun 7, 2025Updated last year
- ☆17Mar 25, 2025Updated last year
- You can use SHMT method to apply makeup to the characters when use ComfyUI☆29Jan 9, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆86Jun 30, 2025Updated last year
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆97Oct 11, 2025Updated 8 months ago
- [ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆75Oct 10, 2025Updated 8 months ago
- [ICLR'26] Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework☆541Jan 27, 2026Updated 5 months ago
- TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis☆96Sep 18, 2025Updated 9 months ago
- [AAAI2026] Implementation Code for Omni-Effects☆175Dec 9, 2025Updated 6 months ago
- ☆148Dec 14, 2025Updated 6 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆430May 12, 2026Updated last month
- A node for ComfyUI that performs GPEN face restoration on the input image(s). Significantly faster than other implementations of GPEN.☆67Apr 15, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- HiGoalVita is a modular, layered, production ready AI RAG suite.☆252May 22, 2025Updated last year
- Calligrapher: Freestyle Text Image Customization☆296Sep 3, 2025Updated 9 months ago
- [KDD 2026 Oral] MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios☆153Jun 10, 2026Updated 2 weeks ago
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆274Sep 22, 2025Updated 9 months ago
- ☆342Jul 4, 2025Updated 11 months ago
- ☆132Feb 15, 2025Updated last year
- [ICLR26]GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆182Jan 29, 2026Updated 5 months ago
- [T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv…☆451Dec 13, 2024Updated last year
- Flux Kontext Inpainting ComfyUI Implementation☆401Jul 1, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆204Nov 26, 2025Updated 7 months ago
- GENERanno: A Genomic Foundation Model for Metagenomic Annotation☆314Jun 15, 2026Updated 2 weeks ago
- 一款ComfyUI扩展节点,能够为您的图像添加各种精美的艺术文字效果,支持丰富的文字样式和特效。☆30Mar 21, 2025Updated last year
- A ComfyUI extention for BAGEL(Unified Model for Multimodal Understanding and Generation)☆188Oct 13, 2025Updated 8 months ago
- This repository implements Yolo functionality using TensorRT and CUDA acceleration on Nvidia Jetson devices and the ROS framework.☆205Aug 14, 2025Updated 10 months ago
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆643Jun 1, 2026Updated 3 weeks ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,233Apr 29, 2026Updated 2 months ago
- Practice Code for text to image trainer☆560Feb 27, 2026Updated 4 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆159Nov 12, 2025Updated 7 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆405Aug 31, 2022Updated 3 years ago
- 2025技术分享(FullStack Frontend Focus),分享常用知识点。代码纯手打+AI验证,只做精品!!!☆153Jul 2, 2025Updated 11 months ago
- ☆163Nov 16, 2025Updated 7 months ago
- Official code of ICML 2025 paper "NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Predicti…☆134Oct 27, 2025Updated 8 months ago
- ☆1,367Apr 21, 2025Updated last year
- Advanced Vision Model Loader for Comfy UI☆261Mar 6, 2025Updated last year
- ☆114Aug 10, 2025Updated 10 months ago