☆99Feb 4, 2026Updated 2 months ago
Alternatives and similar repositories for VAREdit
Users that are interested in VAREdit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository is an offical PyTorch implementation of SD-GAN: Semantic Decomposition for Face Image Synthesis with Discrete Attribute.☆13Mar 18, 2024Updated 2 years ago
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆34Mar 10, 2026Updated last month
- Learning from Next-Frame Prediction: Autoregressive Video Modeling Encodes Effective Representations☆22Dec 24, 2025Updated 3 months ago
- [ICCV 2025] LightsOut: Diffusion-based Outpainting for Enhanced Lens Flare Removal☆27Oct 20, 2025Updated 5 months ago
- [ICLR 2026] A Unified Multi-Conditional Diffusion Transformer for Creative Graphic Design☆149Mar 12, 2026Updated last month
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 5 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editing☆164Jun 26, 2025Updated 9 months ago
- [AAAI 2025] Explore In-Context Segmentation via Latent Diffusion Models☆22Mar 25, 2025Updated last year
- Official implementation of CVPR 2025 paper "ID-Patch: Robust ID Association for Group Photo Personalization". This work proposed propose …☆77Apr 29, 2025Updated 11 months ago
- ☆115Apr 25, 2025Updated 11 months ago
- ☆34Mar 18, 2025Updated last year
- Jittor挑战赛,骨骼绑定赛题☆15Oct 9, 2025Updated 6 months ago
- FLM-Audio is a audio-language subversion of RoboEgo/FLM-Ego -- an omnimodal model with native full duplexity.☆66Dec 9, 2025Updated 4 months ago
- Official implementation for "Single Image Reflection Separation via Interactive Dual-Stream Transformers"☆24Dec 24, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ArcFlow: Unleashing 2-Step Text-to-Image Generation via High-Precision Non-Linear Flow Distillation☆117Feb 17, 2026Updated 2 months ago
- BitDance custom nodes for ComfyUI with unified loader, text encode, sampler, and VAE nodes.☆33Feb 26, 2026Updated last month
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 8 months ago
- A ComfyUI custom node implementation of TwinFlow: Realizing One-step Generation on Large Models with Self-adversarial Flows.☆43Mar 6, 2026Updated last month
- Codes accompanying the paper "Toward Guidance-Free AR Visual Generation via Condition Contrastive Alignment"☆36Feb 11, 2025Updated last year
- Code release for AccDiffusionV2 (TPAMI)☆34Nov 4, 2025Updated 5 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆116Sep 27, 2025Updated 6 months ago
- RepText: Rendering Visual Text via Replicating 🔥☆140Jun 7, 2025Updated 10 months ago
- A Python implementation of an agent swarm system that works with local LLM servers. The system allows you to create multiple agents that …☆13Nov 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆22Nov 22, 2024Updated last year
- Official Implementation for "ReMOVE: A Reference-free Metric for Object Erasure"☆25Apr 30, 2024Updated last year
- Official Repo For AAAI 2026 Accepted Paper "Rethinking the Spatio-Temporal Alignment of End-to-End 3D Perception"☆30Mar 25, 2026Updated 3 weeks ago
- [ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…☆92Jan 26, 2026Updated 2 months ago
- TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation☆236Aug 18, 2025Updated 8 months ago
- [CVPR 2026 Highlight] GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering☆96Apr 9, 2026Updated last week
- 🚀 [ICLR 2026] SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation☆83Mar 14, 2026Updated last month
- ComfyMind: Toward General-Purpose Generation via Tree-Based Planning and Reactive Feedback☆121Sep 20, 2025Updated 6 months ago
- Official implementation of BLIP3o-Series☆1,648Nov 29, 2025Updated 4 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [🚀 ICLR 2026 Oral] NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s …☆660Feb 27, 2026Updated last month
- Code and data for UniEgoMotion (ICCV 2025)☆48Nov 11, 2025Updated 5 months ago
- ☆19Dec 8, 2024Updated last year
- ☆13Jul 28, 2024Updated last year
- Consistency Distillation with Target Timestep Selection and Decoupled Guidance☆105Jan 4, 2025Updated last year
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- Scalable DBSCAN and OPTICS for clustering high-dimensional datasets using random projections☆14Nov 1, 2024Updated last year