[ICML 2026] Orienting Latent Actions for Video World Modeling
☆108Apr 20, 2026Updated last month
Alternatives and similar repositories for Olaf-World
Users that are interested in Olaf-World are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2024] EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models.☆52Oct 14, 2024Updated last year
- [CVPR 2026] Official pytorch implementation of "ReDirector: Creating Any-Length Video Retakes with Rotary Camera Encoding"☆30Dec 17, 2025Updated 5 months ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆35Jun 7, 2026Updated last week
- Official code release for the PVSM paper: "From Rays to Projections: Better Inputs for Feed-Forward View Synthesis"☆50Jan 9, 2026Updated 5 months ago
- [CVPR 2026] An official implementation of Adv-GRPO. The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image…☆84Feb 26, 2026Updated 3 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [ICCV2025] Training-Free Diffusion Models for Geometric Image Editing☆37Jan 13, 2026Updated 5 months ago
- [NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation☆33May 1, 2026Updated last month
- [CVPR 2026] ShowUI-π: Flow-based Generative Models as GUI Dexterous Hands☆124Apr 22, 2026Updated last month
- [ICCV 2025] VLM4D: Towards Spatiotemporal Awareness in Vision Language Models☆49Nov 20, 2025Updated 6 months ago
- Officail Implementation for "Unified Diffusion-Based Rigid and Non-Rigid Editing with Text and Image Guidance"☆19Jan 25, 2024Updated 2 years ago
- Official implementation of AnchorWeave: World-Consistent Video Generation with Retrieved Local Spatial Memories☆92Feb 17, 2026Updated 3 months ago
- [CVPR 2026] Official Implementation of Edit2Perceive☆44Feb 21, 2026Updated 3 months ago
- Official Implementation of Posterior Distillation Sampling☆94Jul 7, 2025Updated 11 months ago
- Code for: "Long-Context Autoregressive Video Modeling with Next-Frame Prediction"☆308Apr 23, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆66May 25, 2026Updated 3 weeks ago
- [ICCV 2025] Official implementation of "Anchor Token Matching: Implicit Structure Locking for Training-free AR Image Editing"☆28Apr 15, 2025Updated last year
- ☆11Jan 16, 2025Updated last year
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆54Oct 21, 2025Updated 7 months ago
- ☆16Mar 7, 2025Updated last year
- [NeurIPS 2025 D&B Track] MLR-Bench: Evaluating AI Agents on Open-Ended Machine Learning Research☆30May 8, 2026Updated last month
- A light-weight, Eigen-based C++ library for trajectory optimization for legged robots.☆13Feb 23, 2021Updated 5 years ago
- implementing the algorithm of fast llf into python☆13Feb 28, 2020Updated 6 years ago
- [ECCV 2024] DGE: Direct Gaussian 3D Editing by Consistent Multi-view Editing☆130Jul 19, 2025Updated 10 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [SIGGRAPH ASIA 2025] PyTorch implementation of "Inbetweening from Two Single-View Images to 4D Generation"☆16Sep 24, 2025Updated 8 months ago
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆367Feb 21, 2026Updated 3 months ago
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆231Aug 11, 2025Updated 10 months ago
- [ICLR2025] Official Implementation of "AniSDF: Fused-Granularity Neural Surfaces with Anisotropic Encoding for High-Fidelity 3D Reconstru…☆89Jun 3, 2025Updated last year
- Code for [ICCV 2025] Efficient Physics Simulation for 3D Scenes via MLLM-Guided Gaussian Splatting☆17Feb 28, 2026Updated 3 months ago
- Official Repository of **CaPa**: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation☆165Aug 18, 2025Updated 9 months ago
- ☆17Apr 21, 2026Updated last month
- Code for [CVPR 2025] ROICtrl: Boosting Instance Control for Visual Generation☆111Apr 16, 2025Updated last year
- Accelerating SDF gradient computation in NeuS-like multi-view reconstruction with directional finite difference (DFD) and patch-based sam…☆34Mar 24, 2024Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [SIGGRAPH ASIA 2024] Frankenstein: Generating Semantic-Compositional 3D Scenes in One Tri-Plane☆20Nov 25, 2024Updated last year
- Make 2DGS Great Again!☆63Nov 11, 2024Updated last year
- The official implementation of StereoPilot☆114Dec 19, 2025Updated 5 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆44Aug 9, 2025Updated 10 months ago
- TPDiff: Temporal Pyramid Video Diffusion Model☆25Mar 13, 2025Updated last year
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆190May 5, 2026Updated last month
- Official Implementation of Rethinking Score Distillation as a Bridge Between Image Distributions☆85Mar 26, 2025Updated last year