[NeurIPS 24] PromptFix: You Prompt and We Fix the Photo
☆891Oct 4, 2024Updated last year
Alternatives and similar repositories for PromptFix
Users that are interested in PromptFix are comparing it to the libraries listed below
Sorting:
- gradio WebUI for AdvancedLivePortrait☆527Mar 13, 2025Updated 11 months ago
- [NeurIPS 2024] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation☆1,191Mar 21, 2025Updated 11 months ago
- The fastest digital human algorithm, now on your desktop.☆570Sep 29, 2025Updated 5 months ago
- Official repository of In-Context LoRA for Diffusion Transformers☆2,058Dec 20, 2024Updated last year
- Clapper.app, a video synthesizer and sequencer designed for the age of AI cinema☆2,313Aug 1, 2025Updated 7 months ago
- MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code☆814Oct 16, 2024Updated last year
- SEED-Story: Multimodal Long Story Generation with Large Language Model☆884Oct 11, 2024Updated last year
- [ICCV 2025] Code Implementation of "ArtEditor: Learning Customized Instructional Image Editor from Few-Shot Examples"☆433Apr 23, 2025Updated 10 months ago
- [ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer☆1,903Jul 3, 2025Updated 8 months ago
- This is a study aim to transfer the single concept by using DIT model self-attention capablity☆786Nov 20, 2024Updated last year
- Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…☆1,132Feb 7, 2025Updated last year
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,983Dec 8, 2025Updated 2 months ago
- [CVPR'25] Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System☆3,666Dec 3, 2025Updated 3 months ago
- ☆790Nov 22, 2024Updated last year
- Arbitrary-steps Image Super-resolution via Diffusion Inversion (CVPR 2025)☆1,379Feb 7, 2026Updated 3 weeks ago
- A powerful Python tool that leverages Claude 3.5 Sonnet Vision API to detect and visualize objects in images. The script automatically dr…☆221Nov 3, 2024Updated last year
- Official implementations for paper: Zero-shot Image Editing with Reference Imitation☆1,305Jun 15, 2024Updated last year
- [under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"☆587Sep 3, 2025Updated 6 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆568Nov 20, 2025Updated 3 months ago
- StoryMaker: Towards consistent characters in text-to-image generation☆721Dec 2, 2024Updated last year
- Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…☆16,048May 18, 2025Updated 9 months ago
- [CVPR 2025] Learning Flow Fields in Attention for Controllable Person Image Generation☆1,649Sep 12, 2025Updated 5 months ago
- 一键将视频转换为优质小红书笔记,自动优化内容和配图☆1,707Oct 30, 2025Updated 4 months ago
- [CVPR 2025] Official implementation of "AnyDressing: Customizable Multi-Garment Virtual Dressing via Latent Diffusion Models"☆328Apr 9, 2025Updated 10 months ago
- [CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation☆4,490Feb 23, 2026Updated last week
- [CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos…☆1,411Sep 21, 2025Updated 5 months ago
- ☆899Dec 11, 2024Updated last year
- Welcome to the 'Open-Alteryx-Macro' project. This project is aimed at providing an open-source solution for managing and updating Alteryx…☆156May 25, 2024Updated last year
- Official Implementation of AttentionShift: Iteratively Estimated Part-based Attention Map for Pointly Supervised Instance Segmentation☆155Oct 18, 2024Updated last year
- [NeurIPS 2024] SHMT: Self-supervised Hierarchical Makeup Transfer via Latent Diffusion Models☆207Jan 24, 2025Updated last year
- A free + OSS logo generator powered by Flux on Together AI☆6,210Dec 12, 2025Updated 2 months ago
- [ICLR 2025 Oral] TANGO: Co-Speech Gesture Video Reenactment with Hierarchical Audio-Motion Embedding and Diffusion Interpolation☆1,144Aug 24, 2025Updated 6 months ago
- ⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。☆20,759Jul 12, 2025Updated 7 months ago
- Kolors Team☆4,603Nov 13, 2024Updated last year
- Solution for checking file if contain NSFW content.☆603Jan 30, 2026Updated last month
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆640Nov 10, 2025Updated 3 months ago
- The official HelloMeme GitHub site☆627Jun 27, 2025Updated 8 months ago
- [AAAI 2025]👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing. It enables customizable human image generation …☆1,328Sep 30, 2025Updated 5 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)☆664Dec 14, 2024Updated last year