PlayerOne: Egocentric World Simulator
☆185Jun 12, 2025Updated 9 months ago
Alternatives and similar repositories for PlayerOne
Users that are interested in PlayerOne are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Aug 15, 2025Updated 7 months ago
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆88Aug 18, 2025Updated 7 months ago
- ☆110Sep 3, 2025Updated 6 months ago
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆92Nov 30, 2025Updated 3 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆31Jan 6, 2026Updated 2 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆219Aug 11, 2025Updated 7 months ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆418Jul 25, 2025Updated 8 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆173Feb 4, 2026Updated last month
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆327Feb 25, 2026Updated last month
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation☆367Jul 4, 2025Updated 8 months ago
- Pytorch implementation of Self-Refining Video Sampling☆153Feb 6, 2026Updated last month
- ☆40May 9, 2025Updated 10 months ago
- ☆187Jul 31, 2025Updated 7 months ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆45Nov 24, 2025Updated 4 months ago
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 8 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆52Feb 21, 2026Updated last month
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆423Aug 26, 2025Updated 6 months ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- ☆141Oct 15, 2025Updated 5 months ago
- [ICLR 2026] Official Code for "the Quest for Generalizable Motion Generation: Data, Model, and Evaluation"☆83Mar 18, 2026Updated last week
- Official code repository for "Self-transcendence: Is External Feature Guidance Indispensable for Accelerating Diffusion Transformer Train…☆28Mar 17, 2026Updated last week
- Code2Worlds: Empowering Coding LLMs for 4D World Generation☆92Feb 26, 2026Updated 3 weeks ago
- The CODE of WaH-NeRF (ACM MM 23).☆11Aug 28, 2023Updated 2 years ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆322Mar 30, 2025Updated 11 months ago
- [ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation☆115Oct 7, 2025Updated 5 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆218Nov 25, 2025Updated 4 months ago
- Segment Anything preprocessor for ControlNet inside Stable Diffusion WebUI☆16Jul 15, 2024Updated last year
- Vision Bridge Transformer at Scale☆139Dec 1, 2025Updated 3 months ago
- ☆40Dec 19, 2025Updated 3 months ago
- ObjCtrl-2.5D☆58Apr 2, 2025Updated 11 months ago
- [ICML 2025] VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models☆39Jun 14, 2025Updated 9 months ago
- VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control☆337Feb 26, 2026Updated 3 weeks ago
- [ICLR2026] Any-to-Bokeh is a novel one-step video bokeh framework that converts arbitrary input videos into temporally coherent, depth-aw…☆128Feb 4, 2026Updated last month
- [CVPR 2026] Official Implementations for Paper - HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives☆648Nov 26, 2025Updated 3 months ago
- [ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.☆35Jan 3, 2026Updated 2 months ago
- ReMoMask: Retrieval-Augmented Masked Motion Generation☆39Feb 14, 2026Updated last month
- Pusa: Thousands Timesteps Video Diffusion Model☆674Feb 13, 2026Updated last month
- Official repository for gathering data of Revisit Human-Scene Interaction via Space Occupancy (ECCV 2024).☆28Sep 29, 2024Updated last year