aim-uofa/Diception

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/aim-uofa/Diception)

aim-uofa / Diception

[NeurIPS 2025 Spotlight] A Generalist Diffusion Model for Vision Perception

☆318

Alternatives and similar repositories for Diception

Users that are interested in Diception are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aim-uofa / Active-o3
View on GitHub
[ICML2026] ACTIVE-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO
☆83Apr 30, 2026Updated 2 months ago
aim-uofa / dLLM-MidTruth
View on GitHub
[ICLR'26] Official PyTorch implementation of "Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models".
☆66Mar 5, 2026Updated 4 months ago
aim-uofa / TVRBench
View on GitHub
TVRBench: Target Viewpoint Reproduction Benchmark for Active Spatial Intelligence
☆25Jun 2, 2026Updated last month
aim-uofa / GSI-Bench
View on GitHub
[CVPR2026] Exploring Spatial Intelligence from a Generative Perspective
☆30Jun 3, 2026Updated last month
aim-uofa / Tinker
View on GitHub
One-shot and Few-shot 3D Editing without Per-Scene Optimization
☆175Aug 21, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
aim-uofa / Omni-R1
View on GitHub
[NeurIPS 2025] Official Repo of Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration
☆126Dec 3, 2025Updated 7 months ago
aim-uofa / OmniJigsaw
View on GitHub
☆34Apr 10, 2026Updated 3 months ago
aim-uofa / STAIR
View on GitHub
☆18Jun 13, 2026Updated last month
aim-uofa / SINE
View on GitHub
[NeurIPS'24] A Simple Image Segmentation Framework via In-Context Examples
☆68Oct 29, 2024Updated last year
aim-uofa / GenPercept
View on GitHub
[ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models
☆229Jan 24, 2025Updated last year
aim-uofa / StaMo
View on GitHub
Unsupervised Learning of Generalizable Robot Motion from Compact State Representation
☆40Jun 10, 2026Updated last month
aim-uofa / DiffewS
View on GitHub
[NeurIPS'24] Unleashing the Potential of the Diffusion Model in Few-shot Semantic Segmentation (Diffews)
☆51Apr 14, 2025Updated last year
aim-uofa / AGILE
View on GitHub
☆46May 6, 2026Updated 2 months ago
aim-uofa / COSINE
View on GitHub
[ICCV'25] Unified Open-World Segmentation with Multi-Modal Prompts
☆16Jun 16, 2026Updated last month
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aim-uofa / EvoTokenDLM
View on GitHub
[ACL'26] EvoToken-DLM (Beyond Hard Masks: Progressive Token Evolution for Diffusion Language)
☆48Apr 7, 2026Updated 3 months ago
nupurkmr9 / syncd
View on GitHub
SynCD: Generating Multi-Image Synthetic Data for Text-to-Image Customization (ICCV 2025)
☆155May 24, 2026Updated last month
aim-uofa / PM-Loss
View on GitHub
[3DV 2026] Revisiting Depth Representations for Feed-Forward 3D Gaussian Splatting
☆162Dec 9, 2025Updated 7 months ago
KupynOrest / s3od
View on GitHub
[ICLR 2026] Official repo for S3OD: Towards Generalizable Salient Object Detection with Synthetic Data
☆43Jun 3, 2026Updated last month
microsoft / art-msra
View on GitHub
[CVPR 2025] Official repo for ART:Anonymous Region Transformer for Variable Multi-Layer Transparent Image Generation
☆373Apr 8, 2026Updated 3 months ago
aim-uofa / VFN
View on GitHub
[ICLR 2024 Spotlight] The official repo for the paper "De novo Protein Design using Geometric Vector Field Networks".
☆31Aug 23, 2024Updated last year
aim-uofa / BA-DDG
View on GitHub
[ICLR 2025 Spotlight] Boltzmann-Aligned Inverse Folding Model as a Predictor of Mutational Effects on Protein-Protein Interactions
☆45Mar 10, 2025Updated last year
SihuiJi / LayerFlow
View on GitHub
[SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation
☆95Aug 18, 2025Updated 11 months ago
baaivision / URSA
View on GitHub
[ICLR 2026] 🐻 Uniform Discrete Diffusion with Metric Path for Video Generation
☆123May 20, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
aim-uofa / VLModel
View on GitHub
Repo of HawkLlama.
☆16Jan 2, 2025Updated last year
lehduong / OneDiffusion
View on GitHub
Official implementation of OneDiffusion paper (CVPR 2025)
☆662Dec 14, 2024Updated last year
aim-uofa / SegAgent
View on GitHub
[CVPR2025] SegAgent: Exploring Pixel Understanding Capabilities in MLLMs by Imitating Human Annotator Trajectories
☆106Aug 8, 2025Updated 11 months ago
aim-uofa / Framer
View on GitHub
[ICLR'25] Official PyTorch implementation of "Framer: Interactive Frame Interpolation".
☆498Jan 9, 2025Updated last year
EnVision-Research / Lotus
View on GitHub
Official implementation of "Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction"
☆812Nov 28, 2025Updated 7 months ago
HKU-MMLab / Macro
View on GitHub
The official repo of "MACRO: Advancing Multi-Reference Image Generation with Structured Long-Context Data"
☆66Mar 27, 2026Updated 3 months ago
naver-ai / ZIM
View on GitHub
[ICCV 2025, Highlight] ZIM: Zero-Shot Image Matting for Anything
☆418Aug 28, 2025Updated 10 months ago
bcmi / Light-A-Video
View on GitHub
[ICCV 2025] Light-A-Video: Training-free Video Relighting via Progressive Light Fusion
☆517Oct 25, 2025Updated 8 months ago
aim-uofa / FADiff
View on GitHub
[ICML 2024] Floating Anchor Diffusion Model for Multi-motif Scaffolding
☆34Aug 23, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
aim-uofa / MovieDreamer
View on GitHub
[ICLR'25] MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequences
☆323Aug 10, 2024Updated last year
dgcnz / edge
View on GitHub
Training, optimization and deployment of Object Detection model with dinov2 backbone for efficient inference on NVIDIA Jetson
☆14Jul 26, 2025Updated 11 months ago
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
KlingAIResearch / ReCamMaster
View on GitHub
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
☆1,830Nov 28, 2025Updated 7 months ago
VisualComputingInstitute / diffusion-e2e-ft
View on GitHub
[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think
☆519Jul 9, 2026Updated last week
YisuiTT / Mobius
View on GitHub
Mobius: Text to Seamless Looping Video Generation via Latent Shift
☆178May 8, 2025Updated last year
Yaoyaolingbro / notebook
View on GitHub
☆20Mar 4, 2025Updated last year