PlayerOne: Egocentric World Simulator
☆185Jun 12, 2025Updated 8 months ago
Alternatives and similar repositories for PlayerOne
Users that are interested in PlayerOne are comparing it to the libraries listed below
Sorting:
- Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)☆118Aug 15, 2025Updated 6 months ago
- [SIGGRAGH'25] Official repository of LayerFlow: A Unified Model for Layer-aware Video Generation☆86Aug 18, 2025Updated 6 months ago
- DanceTogether! Identity-Preserving Multi-Person Interactive Video Generation☆39Aug 3, 2025Updated 7 months ago
- [AAAI 2026] UltraGen☆77Feb 1, 2026Updated last month
- A unified framework for controllable caption generation across images, videos, and audio. Supports multi-modal inputs and customizable ca…☆52Jul 24, 2025Updated 7 months ago
- ☆109Sep 3, 2025Updated 6 months ago
- [ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".☆172Feb 4, 2026Updated last month
- [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping☆90Nov 30, 2025Updated 3 months ago
- [SIGGRAPHASIA2025] InfiniHuman: Infinite 3D Human Creation with Precise Control☆84Oct 14, 2025Updated 4 months ago
- Pytorch implementation of Self-Refining Video Sampling☆146Feb 6, 2026Updated 3 weeks ago
- [ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation☆367Jul 4, 2025Updated 8 months ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆417Jul 25, 2025Updated 7 months ago
- EmoCapCLIP: Learning Transferable Facial Emotion Representations from Large-Scale Semantically Rich Captions☆20Jul 29, 2025Updated 7 months ago
- Official implementation of Nemesis: Normalizing the Soft-prompt Vectors of Vision-Language Models (ICLR 2024 Spotlight)☆15Dec 27, 2024Updated last year
- [ICML 2025] LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models☆17Nov 4, 2025Updated 4 months ago
- ☆37May 9, 2025Updated 9 months ago
- [ICLR 2026] Official repo for "Spotlight on Token Perception for Multimodal Reinforcement Learning"☆49Jan 30, 2026Updated last month
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆29Dec 24, 2025Updated 2 months ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆420Aug 26, 2025Updated 6 months ago
- [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Long…☆321Mar 30, 2025Updated 11 months ago
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆324Feb 25, 2026Updated last week
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆44Nov 24, 2025Updated 3 months ago
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆116Nov 26, 2024Updated last year
- CoV: Chain-of-View Prompting for Spatial Reasoning☆51Jan 23, 2026Updated last month
- ☆16May 13, 2025Updated 9 months ago
- PyTorch implementation of "HERO: Human Reaction Generation from Videos (ICCV 2025)"☆31Jan 6, 2026Updated last month
- Official code of "UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models" WACV2026☆36Nov 24, 2025Updated 3 months ago
- [3DV 2026 Oral] VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space☆211Nov 25, 2025Updated 3 months ago
- ☆184Jul 31, 2025Updated 7 months ago
- Vision Bridge Transformer at Scale☆139Dec 1, 2025Updated 3 months ago
- ☆141Oct 15, 2025Updated 4 months ago
- An End-to-End Model with Adaptive Filtering for Retrieval-Augmented Generation☆16Oct 27, 2024Updated last year
- ☆23Jul 20, 2025Updated 7 months ago
- [NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation☆31Jan 5, 2026Updated 2 months ago
- Unofficial implementation of MIMO (MImicking anyone anywhere with complex Motions and Object interactions)☆10Nov 22, 2024Updated last year
- [ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints☆680May 23, 2025Updated 9 months ago
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆346Jul 4, 2025Updated 8 months ago
- [CVPR 2026] Official Pytorch implementation of Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation☆256Feb 22, 2026Updated last week