☆104Feb 4, 2026Updated 4 months ago
Alternatives and similar repositories for VAREdit
Users that are interested in VAREdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆37Mar 10, 2026Updated 3 months ago
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 5 months ago
- [ICCV 2025] LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal☆30Oct 20, 2025Updated 7 months ago
- PICABench: How Far Are We from Physically Realistic Image Editing?☆38Nov 5, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2026] A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design☆150Mar 12, 2026Updated 3 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆165Jun 26, 2025Updated 11 months ago
- Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose …☆77Apr 29, 2025Updated last year
- [AAAI 2025] Explore In-Context Segmentation via Latent Diffusion Models☆22Mar 25, 2025Updated last year
- ☆119Apr 25, 2025Updated last year
- ☆34Mar 18, 2025Updated last year
- Jittor挑战赛,骨骼绑定赛题☆15Oct 9, 2025Updated 8 months ago
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆69May 15, 2026Updated last month
- Official implementation for "Single Image Reflection Separation via Interactive Dual-Stream Transformers"☆25Dec 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation☆126May 20, 2026Updated 3 weeks ago
- BitDance custom nodes for ComfyUI with unified loader, text encode, sampler, and VAE nodes.☆33Feb 26, 2026Updated 3 months ago
- ☆98Nov 6, 2025Updated 7 months ago
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 11 months ago
- A ComfyUI custom node implementation of TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows.☆44Mar 6, 2026Updated 3 months ago
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆38Feb 11, 2025Updated last year
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 10 months ago
- Code release for AccDiffusionV2 (TPAMI)☆34Nov 4, 2025Updated 7 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆118Sep 27, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- RepText: Rendering Visual Text via Replicating 🔥☆140Jun 7, 2025Updated last year
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆13Nov 20, 2024Updated last year
- ☆22Nov 22, 2024Updated last year
- Official Implementation for "ReMOVE: A Reference-free Metric for Object Erasure"☆25Apr 30, 2024Updated 2 years ago
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆99Jan 26, 2026Updated 4 months ago
- [CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering☆101Apr 9, 2026Updated 2 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Aug 18, 2025Updated 10 months ago
- 🚀 [ICLR 2026] SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation☆95Mar 14, 2026Updated 3 months ago
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆123Sep 20, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework☆152Aug 6, 2025Updated 10 months ago
- Official implementation of BLIP3o-Series☆1,658Nov 29, 2025Updated 6 months ago
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆688Feb 27, 2026Updated 3 months ago
- Code and data for UniEgoMotion (ICCV 2025)☆57Apr 18, 2026Updated 2 months ago