google-deepmind / wyd-benchmark
☆26Updated 2 months ago
Alternatives and similar repositories for wyd-benchmark
Users that are interested in wyd-benchmark are comparing it to the libraries listed below
Sorting:
- A list of works on video generation towards world model☆58Updated this week
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆20Updated last month
- Official Implementation of "Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling"☆15Updated last year
- Web page for "🍅HumanTOMATO: Text-aligned Whole-body Motion Generation".☆14Updated 11 months ago
- Official repo of "Barbie: Text to Barbie-Style 3D Avatars“☆25Updated 5 months ago
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆43Updated 3 months ago
- ☆11Updated 7 months ago
- Plan, Posture and Go: Towards Open-World Text-to-Motion Generation☆41Updated 5 months ago
- Unofficial Implementation of "Stable Video Diffusion Multi-View"☆79Updated last year
- Official implementation of MTM☆21Updated last year
- MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations☆34Updated 7 months ago
- ☆26Updated 2 weeks ago
- [Neurips 2024] Video Diffusion Models are Training-free Motion Interpreter and Controller☆40Updated last month
- [arXiv'24] Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space☆43Updated 6 months ago
- ☆15Updated last month
- ☆33Updated this week
- ☆14Updated 2 years ago
- TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation☆30Updated 5 months ago
- Code for Paper 'Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach'☆17Updated 7 months ago
- Official implementation of "Reangle-A-Video: 4D Video Generation as Video-to-Video Translation"☆39Updated 2 months ago
- [CVPR 2025] Multi-focal Conditioned Latent Diffusion for Person Image Synthesis☆12Updated last month
- Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)☆67Updated 2 months ago
- Awesome-Text2Motion-Generation☆18Updated last year
- TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer☆47Updated last year
- FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax☆18Updated last year
- ☆27Updated last year
- ☆10Updated 10 months ago
- ☆39Updated last year
- ☆43Updated last month
- Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation☆16Updated last year