☆53Dec 13, 2024Updated last year
Alternatives and similar repositories for Owl
Users that are interested in Owl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Jul 5, 2024Updated last year
- VideoAuteur: Towards Long Narrative Video Generation☆44Oct 22, 2025Updated 7 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆31Apr 8, 2025Updated last year
- ☆654May 24, 2024Updated 2 years ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆90May 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆216Feb 11, 2025Updated last year
- DreamCinema: Cinematic Transfer with Free Camera and 3D Character☆96Jun 13, 2025Updated 11 months ago
- A Video Tokenizer Evaluation Dataset☆159Jan 13, 2025Updated last year
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆113Jan 21, 2025Updated last year
- [NeurIPS 2025] Scene as Superquadrics for 3D Semantic Occupancy Prediction☆71Jul 13, 2025Updated 10 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆86Dec 5, 2024Updated last year
- ☆10Apr 7, 2025Updated last year
- Empowering Unified MLLM with Multi-granular Visual Generation☆132Jan 16, 2025Updated last year
- [ECCV'24] MaxFusion: Plug & Play multimodal generation in text to image diffusion models☆27Nov 2, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] Gaussian World Model for Streaming 3D Occupancy Prediction☆156Dec 4, 2025Updated 6 months ago
- a family of versatile and state-of-the-art video tokenizers.☆451Sep 1, 2025Updated 9 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆313Mar 12, 2025Updated last year
- AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers☆163Sep 16, 2025Updated 8 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Oct 9, 2025Updated 8 months ago
- [EMNLP 2024] Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality☆22Oct 8, 2024Updated last year
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆650Oct 29, 2025Updated 7 months ago
- UniCon: A Simple Approach to Unifying Diffusion-based Conditional Generation (ICLR 2025)☆38Jun 21, 2025Updated 11 months ago
- Code for "AVG-LLaVA: A Multimodal Large Model with Adaptive Visual Granularity"☆33Oct 12, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] Science-T2I: Addressing Scientific Illusions in Image Synthesis☆62Mar 31, 2026Updated 2 months ago
- ☆13Jul 10, 2024Updated last year
- ☆19May 24, 2026Updated 2 weeks ago
- Live2Diff: A Pipeline that processes Live video streams by a uni-directional video Diffusion model.☆199Jul 22, 2024Updated last year
- [AAAI26] Next Patch Prediction☆129Jan 2, 2025Updated last year
- Out-of-Distribution Semantic Occupancy Prediction☆23May 23, 2026Updated 2 weeks ago
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆666Jul 1, 2025Updated 11 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆56Feb 14, 2025Updated last year
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆171Nov 5, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory☆365Feb 21, 2026Updated 3 months ago
- [ICLR 2025] OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework☆59Mar 18, 2026Updated 2 months ago
- ☆21Jan 17, 2025Updated last year
- Official Implementation of wd1☆30Sep 25, 2025Updated 8 months ago
- ☆42Jun 9, 2025Updated 11 months ago
- Video-Infinity generates long videos quickly using multiple GPUs without extra training.☆191Aug 4, 2024Updated last year
- ☆383Jun 6, 2024Updated 2 years ago