☆85Oct 10, 2025Updated 4 months ago
Alternatives and similar repositories for InstructX
Users that are interested in InstructX are comparing it to the libraries listed below
Sorting:
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- ☆42Jan 19, 2026Updated last month
- ☆13Mar 8, 2024Updated last year
- OpenVE-3M: A Large-Scale High-Quality Dataset for Instruction-Guided Video Editing☆38Jan 9, 2026Updated last month
- ☆14Jul 5, 2024Updated last year
- [ICLR 2026] IVEBench - Benchmark for Instruction-Guided Video Editing☆70Jan 28, 2026Updated last month
- [ICCV2025] VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆33Aug 18, 2025Updated 6 months ago
- ☆28Jun 9, 2022Updated 3 years ago
- ThinkGen: Generalized Thinking for Visual Generation☆51Dec 30, 2025Updated 2 months ago
- A free and open-source focus stacking software that supports multi-focus image alignment and fusion.☆19Feb 5, 2026Updated 3 weeks ago
- InstantUnify: Integrates Multimodal LLM into Diffusion Models 🔥☆40Aug 8, 2024Updated last year
- This is the official repository of CVPR 2025 Paper: Dynamic Motion Blending for Versatile Motion Editing.☆47Mar 29, 2025Updated 11 months ago
- [CVPR2026] One-Step Diffusion Transformer for Controllable Real-World Image Super-Resolution☆92Feb 21, 2026Updated last week
- DREAM: Diffusion Rectification and Estimation-Adaptive Models (CVPR 2024)☆41Feb 3, 2025Updated last year
- ☆80Jan 3, 2024Updated 2 years ago
- A simple demo project of cmake and google protocol buffer.☆10Dec 3, 2013Updated 12 years ago
- [ICLR 2026] SpatialLadder: Progressive Training for Spatial Reasoning in Vision-Language Models☆74Jan 29, 2026Updated last month
- OFER: Occluded Face Expression Reconstruction. A 3D face reconstruction method producing diverse plausible expressive faces from a single…☆14Jan 9, 2026Updated last month
- This is the official repository for the paper "Modeling Human Gaze Behavior with Diffusion Models for Unified Scanpath Prediction". ICCV …☆24Dec 4, 2025Updated 2 months ago
- ☆10Dec 8, 2025Updated 2 months ago
- ☆43Dec 1, 2025Updated 3 months ago
- [ISBI 2024] Official PyTorch implementation of Towards Cross-Domain Single Blood Cell Image Classification via Large-Scale LoRA-based Seg…☆11Aug 12, 2024Updated last year
- [arXiv'25] AnyCharV: Bootstrap Controllable Character Video Generation with Fine-to-Coarse Guidance☆41Feb 19, 2025Updated last year
- DreamStyle: A Unified Framework for Video Stylization☆109Jan 7, 2026Updated last month
- G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering.☆23Jan 31, 2026Updated last month
- CVPR 2025 Accepted Papers☆23Dec 20, 2025Updated 2 months ago
- Code of Rags2riches☆20May 26, 2025Updated 9 months ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- Official project page for "From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing" (X-Dub).☆29Jan 31, 2026Updated last month
- ☆10Jul 13, 2024Updated last year
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 4 months ago
- 🕵️♂️🔊 Automatically update Audio Deepfake Detection (ADD) papers daily using GitHub Actions (updates every 12 hours)☆17Feb 13, 2026Updated 2 weeks ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆15Feb 23, 2026Updated last week
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Jan 30, 2021Updated 5 years ago
- ☆14May 18, 2023Updated 2 years ago
- ComfyUI MiniMax Remover Node☆60Jun 24, 2025Updated 8 months ago
- The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder☆18Oct 19, 2025Updated 4 months ago
- ☆39Oct 29, 2025Updated 4 months ago