Araachie / yodaLinks
Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation. In AAAI, 2024.
☆9Updated 5 months ago
Alternatives and similar repositories for yoda
Users that are interested in yoda are comparing it to the libraries listed below
Sorting:
- ☆18Updated last year
- ☆21Updated 8 months ago
- [ICLR 2024] Seer: Language Instructed Video Prediction with Latent Diffusion Models☆34Updated last year
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆50Updated 2 months ago
- ☆19Updated last year
- Code for Stable Control Representations☆25Updated 3 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆24Updated 2 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆50Updated last year
- Official implementation of "Self-Improving Video Generation"☆67Updated 2 months ago
- [CVPR 2025] Parallel Sequence Modeling via Generalized Spatial Propagation Network☆100Updated 3 weeks ago
- ☆26Updated 3 weeks ago
- ☆48Updated 4 months ago
- [TMLR 2025] The official repository of the paper "Unsupervised Discovery of Object-Centric Neural Fields"☆17Updated 5 months ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆21Updated 3 months ago
- Code of the paper "FreePCA:Integrating Consistency Information across Long-short Frames in Training-free Long Video Generation via Princi…☆24Updated last month
- Physics-based Zero-Shot Video Generation☆28Updated 9 months ago
- UniSkill: Imitating Human Videos via Cross-Embodiment Skill Representations☆50Updated 2 months ago
- Agent-to-Sim Learning Interactive Behavior from Casual Videos.☆44Updated 9 months ago
- VQVAE for video prediction☆27Updated 3 years ago
- ☆11Updated 2 months ago
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆23Updated last month
- Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces☆75Updated last month
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆64Updated last month
- ☆76Updated 10 months ago
- Implementation of Latent Diffusion Planning (Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn)☆38Updated 2 weeks ago
- KMM: Key Frame Mask Mamba for Extended Motion Generation☆15Updated 3 months ago
- ☆22Updated 5 months ago
- ☆38Updated 9 months ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆106Updated 3 weeks ago
- [ICML 2024] A Touch, Vision, and Language Dataset for Multimodal Alignment☆79Updated last month