gisilvs / covaeLinks
☆20Updated 3 months ago
Alternatives and similar repositories for covae
Users that are interested in covae are comparing it to the libraries listed below
Sorting:
- An ML research template with good documentation by Boyuan Chen, an MIT PhD student☆90Updated 7 months ago
- ☆149Updated 9 months ago
- ☆21Updated 11 months ago
- Implementation of Latent Diffusion Planning (Amber Xie, Oleh Rybkin, Dorsa Sadigh, Chelsea Finn)☆51Updated 4 months ago
- Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆85Updated 3 months ago
- Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give…☆158Updated 2 weeks ago
- [ICLR 2025 Spotlight] Grounding Video Models to Actions through Goal Conditioned Exploration☆58Updated 5 months ago
- [ICML 2024] Compositional Image Decomposition with Diffusion Models☆51Updated last year
- ☆18Updated last year
- Implementation of Danijar's latest iteration for his Dreamer line of work☆83Updated last week
- PyTorch implementation of Shortcut Models [Frans, 2025] with little modification☆57Updated 3 months ago
- [ICCV 2025] Official Implementation of Contrastive Flow Matching☆132Updated 4 months ago
- [Preprint] UCGM: Unified Continuous Generative Models☆168Updated 5 months ago
- Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025☆101Updated 2 months ago
- Official repo of BesiegeField, an interactive and real-time environment for machine construction and simulation (arXiv:2510.14980).☆48Updated this week
- [NeurIPS'25 Spotlight] Boosting Generative Image Modeling via Joint Image-Feature Synthesis☆82Updated last week
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Updated 6 months ago
- ☆54Updated 3 months ago
- ☆109Updated 2 months ago
- [ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models …☆73Updated 4 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆81Updated 3 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆253Updated this week
- UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…☆130Updated 6 months ago
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆166Updated 4 months ago
- ☆77Updated 5 months ago
- Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning☆47Updated 2 weeks ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆27Updated last week
- [CVPR 2025 highlight] Generating 6DoF Object Manipulation Trajectories from Action Description in Egocentric Vision☆29Updated last month
- A unified robotic manipulation learning framework☆17Updated last month
- (NeurIPS 2025 D&B Track) OverLayBench: A Benchmark for Layout-to-Image Generation with Dense Overlaps☆20Updated last week