[NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation
☆33Oct 17, 2025Updated 4 months ago
Alternatives and similar repositories for ORIGEN
Users that are interested in ORIGEN are comparing it to the libraries listed below
Sorting:
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated 3 weeks ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 8 months ago
- ☆48Feb 9, 2026Updated 2 weeks ago
- code release for HouseCrafter (ICCV 2025 Highlight)☆68Oct 23, 2025Updated 4 months ago
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- [NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing☆72Oct 12, 2025Updated 4 months ago
- [EMNLP 2025 Findings] 3D-Aware Vision-Language Models Fine-Tuning with Geometric Distillation☆31Jun 12, 2025Updated 8 months ago
- Training recipe for SpatialReasoner☆38Sep 21, 2025Updated 5 months ago
- Official implementation of SyncTweedies: A General Generative Framework Based on Synchronized Diffusions (NeurIPS 2024)☆70Aug 4, 2024Updated last year
- Rare-to-Frequent (R2F), ICLR'25, Spotlight☆53Apr 23, 2025Updated 10 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated 11 months ago
- Implementation for "Text2Control3D: Controllable 3D Avatar Generation in Neural Radiance Fields using Geometry-Guided Text-to-Image Diffu…☆13Sep 8, 2023Updated 2 years ago
- ☆14Sep 11, 2025Updated 5 months ago
- Official implementation of PartSTAD: 2D-to-3D Part Segmentation Task Adaptation (ECCV 2024).☆55Nov 7, 2024Updated last year
- ☆33Aug 9, 2024Updated last year
- official code for "3D Question Answering via only 2D Vision-Language Models"☆23Jan 15, 2026Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆16Apr 18, 2024Updated last year
- ComfyUI node for modular, human‑like Kani TTS. Generate natural, high‑quality speech from text☆38Oct 17, 2025Updated 4 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆28May 3, 2025Updated 9 months ago
- [CVPR2025] Official repository for "VideoGuide: Improving Video Diffusion Models without Training Through a Teacher's Guide"☆28May 27, 2025Updated 9 months ago
- [CVPR 2024] "Taming Mode Collapse in Score Distillation for Text-to-3D Generation" by Peihao Wang, Dejia Xu, Zhiwen Fan, Dilin Wang, Srey…☆51Feb 2, 2024Updated 2 years ago
- Official implementation of the paper "Smoothed Energy Guidance: Guiding Diffusion Models with Reduced Energy Curvature of Attention" (Neu…☆136Oct 3, 2024Updated last year
- [ICCV'25] ScenePainter: Semantically Consistent Perpetual 3D Scene Generation with Concept Relation Alignment☆36Oct 5, 2025Updated 4 months ago
- ☆17Jul 30, 2024Updated last year
- Official repository for the paper "Orientation Matters: Making 3D Generative Models Orientation-Aligned" (NeurIPS 2025)☆113Nov 27, 2025Updated 3 months ago
- ☆93Sep 22, 2024Updated last year
- animatediff prompt travel☆19Jan 27, 2024Updated 2 years ago
- ☆18Oct 21, 2024Updated last year
- Dungeon procedural generator similar to whatabou's "One Page Dungeon"☆48Jan 4, 2026Updated last month
- A precise and stable CFG for negative prompts, derived via guided sampling with contrastive loss.☆14Dec 27, 2024Updated last year
- ☆21Nov 21, 2024Updated last year
- Official PyTorch Implementation of "Minority-Focused Text-to-Image Generation via Prompt Optimization" (CVPR 2025 Oral)☆27Apr 8, 2025Updated 10 months ago
- [ICLR 2025 Oral] Official Implementation for "Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Un…☆21Oct 24, 2024Updated last year
- A ComfyUI plugin that provides a user interface of StableStudio☆22Aug 15, 2025Updated 6 months ago
- [SIGGRAPH Asia 2024] TrailBlazer: Trajectory Control for Diffusion-Based Video Generation☆100May 31, 2024Updated last year
- Environment light tools.☆68Jan 19, 2024Updated 2 years ago
- ControlNet control image preprocess library☆15Feb 27, 2023Updated 3 years ago
- [NeurIPS'25] Official implementation of "Emergent Temporal Correspondences from Video Diffusion Models"☆96Dec 3, 2025Updated 2 months ago
- ☆16Apr 23, 2024Updated last year