Implementation of "FLUX-Text: A Simple and Advanced Diffusion Transformer Baseline for Scene Text Editing"
☆448Nov 24, 2025Updated 5 months ago
Alternatives and similar repositories for FluxText
Users that are interested in FluxText are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR26] NarrLV: Towards a Comprehensive Narrative-Centric Evaluation for Long Video Generation Models☆112Jul 28, 2025Updated 9 months ago
- A comprehensive benchmark specifically designed to evaluate the interactive response capabilities of world models in 4D settings.☆107Mar 24, 2026Updated last month
- RepText: Rendering Visual Text via Replicating 🔥☆140Jun 7, 2025Updated 10 months ago
- ☆17Mar 25, 2025Updated last year
- You can use SHMT method to apply makeup to the characters when use ComfyUI☆30Jan 9, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model.☆84Jun 30, 2025Updated 10 months ago
- [ICCV25] USP: Unified Self-Supervised Pretraining for Image Generation and Understanding☆93Oct 11, 2025Updated 6 months ago
- [ICCV 25] VMBench: A Benchmark for Perception-Aligned Video Motion Generation☆71Oct 10, 2025Updated 6 months ago
- [ICLR'26] Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework☆537Jan 27, 2026Updated 3 months ago
- TextFlux: An OCR-Free DiT Model for High-Fidelity Multilingual Scene Text Synthesis☆92Sep 18, 2025Updated 7 months ago
- [AAAI2026] Implementation Code for Omni-Effects☆176Dec 9, 2025Updated 4 months ago
- ☆146Dec 14, 2025Updated 4 months ago
- EvaLearn is a pioneering benchmark designed to evaluate large language models (LLMs) on their learning capability and efficiency in chall…☆432Jan 18, 2026Updated 3 months ago
- A node for ComfyUI that performs GPEN face restoration on the input image(s). Significantly faster than other implementations of GPEN.☆68Apr 15, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- HiGoalVita is a modular, layered, production ready AI RAG suite.☆253May 22, 2025Updated 11 months ago
- Calligrapher: Freestyle Text Image Customization☆296Sep 3, 2025Updated 7 months ago
- MobilityBench: A Scalable Benchmark for Evaluating Route-Planning Agents in Real-World Mobility Scenarios☆137Mar 4, 2026Updated last month
- World Simulator Assistant for Physics-Aware Text-to-Video Generation☆273Sep 22, 2025Updated 7 months ago
- ☆343Jul 4, 2025Updated 9 months ago
- ☆133Feb 15, 2025Updated last year
- [ICLR26]GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning☆183Jan 29, 2026Updated 3 months ago
- [T-PAMI 2025] Official implementation for "SVGDreamer++: Advancing Editability and Diversity in Text-Guided SVG Generation" https://arxiv…☆448Dec 13, 2024Updated last year
- [ICLR2026] Video-STAR: Reinforcing Open-Vocabulary Action Recognition with Tools☆56Apr 17, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official implementation code of the paper <AnyText2: Visual Text Generation and Editing With Customizable Attributes>☆190Nov 26, 2025Updated 5 months ago
- Flux Kontext Inpainting ComfyUI Implementation☆394Jul 1, 2025Updated 9 months ago
- GENERanno: A Genomic Foundation Model for Metagenomic Annotation☆310Feb 27, 2026Updated 2 months ago
- 一款ComfyUI扩展节点,能够为您的图像添加各种精美的艺术文字效果,支持丰富的文字样式和特效。☆30Mar 21, 2025Updated last year
- A ComfyUI extention for BAGEL(Unified Model for Multimodal Understanding and Generation)☆189Oct 13, 2025Updated 6 months ago
- This repository implements Yolo functionality using TensorRT and CUDA acceleration on Nvidia Jetson devices and the ROS framework.☆205Aug 14, 2025Updated 8 months ago
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆643Nov 10, 2025Updated 5 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,186Dec 29, 2025Updated 4 months ago
- PosterMaker [CVPR 2025] https://poster-maker.github.io/☆153Nov 12, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Practice Code for text to image trainer☆562Feb 27, 2026Updated 2 months ago
- ☆404Aug 31, 2022Updated 3 years ago
- 2025技术分享(FullStack Frontend Focus),分享常用知识点。代码纯手打+AI验证,只做精品!!!☆153Jul 2, 2025Updated 9 months ago
- ☆164Nov 16, 2025Updated 5 months ago
- Official code of ICML 2025 paper "NTPP: Generative Speech Language Modeling for Dual-Channel Spoken Dialogue via Next-Token-Pair Predicti…☆134Oct 27, 2025Updated 6 months ago
- ☆1,365Apr 21, 2025Updated last year
- Advanced Vision Model Loader for Comfy UI☆261Mar 6, 2025Updated last year