[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception
☆295Sep 21, 2025Updated 5 months ago
Alternatives and similar repositories for Diception
Users that are interested in Diception are comparing it to the libraries listed below
Sorting:
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆78Nov 17, 2025Updated 3 months ago
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆164Aug 21, 2025Updated 6 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆219Jan 24, 2025Updated last year
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆48Apr 14, 2025Updated 10 months ago
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆65Oct 29, 2024Updated last year
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆62Feb 6, 2026Updated 3 weeks ago
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆114Dec 3, 2025Updated 2 months ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆153Oct 16, 2025Updated 4 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆505Oct 25, 2025Updated 4 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆365Feb 5, 2026Updated 3 weeks ago
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆86Aug 18, 2025Updated 6 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions☆43Mar 10, 2025Updated 11 months ago
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆783Nov 28, 2025Updated 3 months ago
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".☆30Aug 23, 2024Updated last year
- Pippo: High-Resolution Multi-View Humans from a Single Image☆632Apr 4, 2025Updated 10 months ago
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,748Nov 28, 2025Updated 3 months ago
- [CVPR 2025] Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion☆43Mar 21, 2025Updated 11 months ago
- [3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting☆158Dec 9, 2025Updated 2 months ago
- Official implementation of OneDiffusion paper (CVPR 2025)☆664Dec 14, 2024Updated last year
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆499Jan 26, 2026Updated last month
- [ICCV 2025, Highlight] ZIM: Zero-Shot Image Matting for Anything☆400Aug 28, 2025Updated 6 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆296Jul 15, 2025Updated 7 months ago
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆174May 8, 2025Updated 9 months ago
- ☆70Oct 19, 2023Updated 2 years ago
- 3D In-the-Wild Human Dataset Generation with Diffusion Models☆48Apr 3, 2024Updated last year
- [ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".☆502Jan 9, 2025Updated last year
- Unleashing Vecset Diffusion Model for Fast Shape Generation / within 1 Second (ICCV'25 Highlight)☆312Oct 7, 2025Updated 4 months ago
- [CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation☆3,086Dec 10, 2025Updated 2 months ago
- SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting☆56Jul 21, 2025Updated 7 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆573Oct 26, 2025Updated 4 months ago
- Official implementation of LaVin-DiT☆53Jan 27, 2025Updated last year
- SkyReels-A2: Compose anything in video diffusion transformers☆704Jun 3, 2025Updated 8 months ago
- High-resolution models for human tasks.☆5,291Nov 18, 2024Updated last year
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆604Jan 17, 2026Updated last month
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 7 months ago
- [SIGGRAPH 2025] Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control☆807Jun 9, 2025Updated 8 months ago
- Official implementation of BLIP3o-Series☆1,637Nov 29, 2025Updated 3 months ago
- [SIGGRAPHASIA2025] InfiniHuman: Infinite 3D Human Creation with Precise Control☆84Oct 14, 2025Updated 4 months ago