[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception
☆298Sep 21, 2025Updated 6 months ago
Alternatives and similar repositories for Diception
Users that are interested in Diception are comparing it to the libraries listed below
Sorting:
- [ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".☆63Mar 5, 2026Updated 2 weeks ago
- ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO☆79Nov 17, 2025Updated 4 months ago
- One-shot and Few-shot 3D Editing without Per-Scene Optimization☆168Aug 21, 2025Updated 7 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆220Jan 24, 2025Updated last year
- [NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples☆65Oct 29, 2024Updated last year
- [NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration☆116Dec 3, 2025Updated 3 months ago
- [NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)☆49Apr 14, 2025Updated 11 months ago
- SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)☆154Oct 16, 2025Updated 5 months ago
- [CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation☆367Feb 5, 2026Updated last month
- [ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".☆30Aug 23, 2024Updated last year
- SurfaceSplat: Connecting Surface Reconstruction and Gaussian Splatting☆57Jul 21, 2025Updated 8 months ago
- [ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions☆44Mar 10, 2025Updated last year
- Repo of HawkLlama.☆16Jan 2, 2025Updated last year
- Official implementation of OneDiffusion paper (CVPR 2025)☆665Dec 14, 2024Updated last year
- Official implementation of Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction☆788Nov 28, 2025Updated 3 months ago
- [ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".☆502Jan 9, 2025Updated last year
- [3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting☆158Dec 9, 2025Updated 3 months ago
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆88Aug 18, 2025Updated 7 months ago
- [ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion☆507Oct 25, 2025Updated 4 months ago
- [ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding☆31Aug 23, 2024Updated last year
- [ICCV 2025, Highlight] ZIM: Zero-Shot Image Matting for Anything☆405Aug 28, 2025Updated 6 months ago
- [ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences☆322Aug 10, 2024Updated last year
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆173May 8, 2025Updated 10 months ago
- Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson☆13Jul 26, 2025Updated 7 months ago
- ☆20Mar 4, 2025Updated last year
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,767Nov 28, 2025Updated 3 months ago
- [CVPR 2026] High-Precision Dichotomous Image Segmentation via Depth Integrity-Prior and Fine-Grained Patch Strategy☆34Mar 7, 2026Updated 2 weeks ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆504Jan 26, 2026Updated last month
- [ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation☆107Feb 6, 2026Updated last month
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆297Jul 15, 2025Updated 8 months ago
- [ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"☆88Aug 23, 2024Updated last year
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆510Dec 4, 2024Updated last year
- Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"☆614Jan 17, 2026Updated 2 months ago
- ☆70Oct 19, 2023Updated 2 years ago
- [CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories☆94Aug 8, 2025Updated 7 months ago
- Track4World: Feedforward World-centric Dense 3D Tracking of All Pixels☆174Mar 11, 2026Updated last week
- [SIGGRAPH2025] Generative Video Matting☆71Aug 12, 2025Updated 7 months ago
- 3D In-the-Wild Human Dataset Generation with Diffusion Models☆48Apr 3, 2024Updated last year
- Pippo: High-Resolution Multi-View Humans from a Single Image☆634Apr 4, 2025Updated 11 months ago