☆208Apr 23, 2026Updated 2 months ago
Alternatives and similar repositories for RefineAnything
Users that are interested in RefineAnything are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation for Detector Guidance for Multi-Object Text-to-Image Generation (DG)☆20Feb 7, 2024Updated 2 years ago
- [ICCV2025] Official code for Fine-structure Preserved Real-world Image Super-resolution via Transfer VAE Training☆127Jan 6, 2026Updated 5 months ago
- Code Implementation of “RelationAdapter: Learning and Transferring Visual Relation with Diffusion Transformers”☆32Apr 13, 2026Updated 2 months ago
- A framework for camera-controllable image editing using unified geometric guidance and video models.☆66Jun 25, 2026Updated last week
- ☆49Apr 17, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)☆59Feb 23, 2026Updated 4 months ago
- [SIGGRAPH Asia 2025] Official Implementation of "ConsistEdit: Highly Consistent and Precise Training-free Visual Editing"☆73Apr 8, 2026Updated 2 months ago
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆80Apr 19, 2026Updated 2 months ago
- [ICLR 2025 spotlight] 3DIS: Depth-Driven Decoupled Instance Synthesis for Text-to-Image Generation☆259Jun 3, 2025Updated last year
- [CVPR 2026]☆57May 29, 2026Updated last month
- [ICLR 2026 Oral] Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment☆38Feb 14, 2026Updated 4 months ago
- ☆37Mar 21, 2025Updated last year
- (AAAI2024) Controllable 3D Face Generation with Conditional Style Code Diffusion☆40Apr 17, 2024Updated 2 years ago
- Unofficial Implementation of Training-free Diffusion Model Adaptation for Variable-Sized Text-to-Image Synthesis☆16Sep 27, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICCV2025] Official implementation of "IFAdapter: Instance Feature Control for Grounded Text-to-Image Generation".☆62Jun 27, 2025Updated last year
- [NIPS 2025] Seg2Any: Open-set Segmentation-Mask-to-Image Generation with Precise Shape and Semantic Control☆48Apr 1, 2026Updated 3 months ago
- Simulate virus pandemic in your browser. Change virus settings to visualize how different settings affect the spread of the disease.☆12Jan 5, 2023Updated 3 years ago
- This node is base on VisualCloze method, A Universal Image Generation Framework via Visual In-Context Learning☆11May 21, 2025Updated last year
- A PyTorch implementation of computing mean average precision in parallel☆16Jul 7, 2022Updated 3 years ago
- A script to draw attention heat map with matplotlib☆14May 7, 2019Updated 7 years ago
- Official implementation of "OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes".☆100Mar 31, 2026Updated 3 months ago
- Example scripts for using [my] fine-tuned CLIP models with HuggingFace 🤗☆13Sep 24, 2024Updated last year
- ☆15May 13, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Using LLM and Joy tag pipeline to tag your image(s folder), it's suitable for train FLUX LoRA and also sdxl. Load images in order!☆18Oct 24, 2025Updated 8 months ago
- Extension for Forge-based UIs (Forge, reForge, etc) and ComfyUI to replace CFG with Negative Rejection Steering☆16May 16, 2026Updated last month
- 本项目包含一个 Python 脚本,用于分离双人(或多人)对话播客音频文件中的不同说话人语音。它利用 `pyannote.audio` 库进行说话人日志分析(Speaker Diarization),找出“谁在什么时候说话”,并将每个说话人的语音片段提取到单独的音轨中。☆15Apr 30, 2025Updated last year
- UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios☆143Apr 9, 2026Updated 2 months ago
- A real-time caption translation tool based on VOSK speech recognition and machine translation, which supports transcribing audio into tar…☆10Mar 12, 2025Updated last year
- Official Code for "Intelligent Painter: Picture Composition With Resampling Diffusion Model" (ICIP 2023)☆16Jun 23, 2023Updated 3 years ago
- [CVPR 2024 Highlight] Official repository for paper "SIFU: Side-view Conditioned Implicit Function for Real-world Usable Clothed Human Re…☆272Sep 21, 2025Updated 9 months ago
- Code for full fintuing Mochi model with FSDP (and CP)☆29Jul 15, 2025Updated 11 months ago
- Convert 2D videos to 3D VR format using AI depth estimation.☆37Jan 20, 2026Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [AAAI 2025] Effective Diffusion Transformer Architecture for Image Super-Resolution☆74May 15, 2025Updated last year
- [arXiv 2026] Official PyTorch Repository for "Coarse-Guided Visual Generation via Weighted h-Transform Sampling"☆41May 8, 2026Updated last month
- 3D Telecommunications project utilizing Holoportation technology to provide live volumetric capture. Used in one case to increase the re…☆22Apr 15, 2026Updated 2 months ago
- [SIGGRAPH Asia 2025] "ASIA: Adaptive 3D Segmentation using Few Image Annotations ".☆26Feb 14, 2026Updated 4 months ago
- ☆34Mar 18, 2025Updated last year
- Code for 'Single-Image 3D Human Reconstruction with 3D-Aware Diffusion Priors and Facial Enhancement [Siggraph Asia 2025]'☆21Feb 1, 2026Updated 5 months ago
- [ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"☆10Sep 28, 2024Updated last year