showlab / Exo2Ego-V
☆40Updated 3 weeks ago
Alternatives and similar repositories for Exo2Ego-V:
Users that are interested in Exo2Ego-V are comparing it to the libraries listed below
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆11Updated last week
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆39Updated 2 months ago
- FQGAN: Factorized Visual Tokenization and Generation☆47Updated 3 weeks ago
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated last month
- Official code for MotionBench (CVPR 2025)☆34Updated last month
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆27Updated this week
- A comprehensive list of papers investigating physical cognition in video generation, including papers, codes, and related websites.☆62Updated this week
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆51Updated last year
- HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video☆67Updated last year
- ☆21Updated 11 months ago
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆66Updated last month
- VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation☆17Updated last month
- ☆19Updated this week
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆80Updated 8 months ago
- MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations☆33Updated 6 months ago
- FleVRS: Towards Flexible Visual Relationship Segmentation, NeurIPS 2024☆20Updated 4 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated 6 months ago
- A framework named B^2-DiffuRL for RL-based diffusion model fine-tuning.☆24Updated 3 weeks ago
- Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion☆39Updated 8 months ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆48Updated this week
- Video Generation, Physical Commonsense, Semantic Adherence, VideoCon-Physics☆89Updated last month
- [arXiv'24] Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space☆41Updated 5 months ago
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"☆23Updated 8 months ago
- Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation☆64Updated last week
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Updated 6 months ago
- Official Implementation of "Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling"☆15Updated last year
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆44Updated 8 months ago
- [arXiv: 2502.05178] QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation☆69Updated last month
- EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation☆102Updated 5 months ago
- [ICLR' 25] AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation☆60Updated last month