☆52Dec 13, 2024Updated last year
Alternatives and similar repositories for Owl
Users that are interested in Owl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Jul 5, 2024Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆43Oct 22, 2025Updated 5 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆30Apr 8, 2025Updated last year
- ☆645May 24, 2024Updated last year
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆214Feb 11, 2025Updated last year
- DreamCinema: Cinematic Transfer with Free Camera and 3D Character☆95Jun 13, 2025Updated 9 months ago
- A Video Tokenizer Evaluation Dataset☆153Jan 13, 2025Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆113Jan 21, 2025Updated last year
- [NeurIPS 2025] Scene as Superquadrics for 3D Semantic Occupancy Prediction☆66Jul 13, 2025Updated 8 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆86Dec 5, 2024Updated last year
- ☆10Apr 7, 2025Updated last year
- Empowering Unified MLLM with Multi-granular Visual Generation☆130Jan 16, 2025Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2025] Gaussian World Model for Streaming 3D Occupancy Prediction☆144Dec 4, 2025Updated 4 months ago
- a family of versatile and state-of-the-art video tokenizers.☆442Sep 1, 2025Updated 7 months ago
- AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers☆157Sep 16, 2025Updated 6 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 6 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆642Oct 29, 2025Updated 5 months ago
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆38Jun 21, 2025Updated 9 months ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Mar 31, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jul 10, 2024Updated last year
- ☆19Nov 18, 2024Updated last year
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆200Jul 22, 2024Updated last year
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆649Jul 1, 2025Updated 9 months ago
- Out-of-Distribution Semantic Occupancy Prediction☆21Oct 22, 2025Updated 5 months ago
- [AAAI26] Next Patch Prediction☆132Jan 2, 2025Updated last year
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆55Feb 14, 2025Updated last year
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆171Nov 5, 2024Updated last year
- [ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆59Mar 18, 2026Updated 3 weeks ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆353Feb 21, 2026Updated last month
- ☆21Jan 17, 2025Updated last year
- Official Implementation of wd1☆25Sep 25, 2025Updated 6 months ago
- ☆41Jun 9, 2025Updated 10 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"☆51Feb 26, 2026Updated last month
- ☆387Jun 6, 2024Updated last year