[NeurIPS 2024] DEMO: Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning
☆45Nov 1, 2024Updated last year
Alternatives and similar repositories for DEMO
Users that are interested in DEMO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A ray-based library of Distributed POPulation-based OPtimization for Large-Scale Black-Box Optimization.☆18Feb 23, 2024Updated 2 years ago
- A library for automatically designing metaheuristic optimizers.☆24Nov 9, 2025Updated 7 months ago
- Evolutionary Computation: A Modern Perspective |<...>| This is an online book, which is free-access and actively-updated (1st Edition: fr…☆60Mar 19, 2026Updated 2 months ago
- Implementation of the SIGIR 2020 paper "Automated Embedding Size Search in Deep Recommender Systems"☆28Feb 24, 2021Updated 5 years ago
- Video Diffusion Transformers are In-Context Learners☆37Jan 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆48Aug 26, 2025Updated 9 months ago
- This repo aims to customize moving trajectories in a video.☆13Sep 20, 2024Updated last year
- ☆30Sep 4, 2024Updated last year
- [NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models☆58Sep 29, 2025Updated 8 months ago
- [IEEE T-BIOM] FaceXBench: Evaluating Multimodal LLMs on Face Understanding☆20Jan 15, 2026Updated 4 months ago
- [ICLR 2026] OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models☆87Jan 21, 2026Updated 4 months ago
- [AAAI 2026] Zero-to-Hero: Zero-Shot Initialization Empowering Reference-Based Video Appearance Editing☆24Nov 20, 2025Updated 6 months ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated 2 years ago
- Omni Controllable Video Diffusion☆46Dec 22, 2025Updated 5 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [AAAI 2025 Oral] GURecon: Learning Detailed 3D Geometric Uncertainties for Neural Surface Reconstruction☆23Jul 21, 2025Updated 10 months ago
- [ICML2026] Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆50Jun 2, 2025Updated last year
- [CVPR 2024] SimDA: Simple Diffusion Adapter for Efficient Video Generation☆129May 7, 2024Updated 2 years ago
- Colored Kimia Path24 Dataset: Configurations and Benchmarks with Deep Embeddings☆10Jun 6, 2024Updated 2 years ago
- ☆17Feb 20, 2025Updated last year
- ☆11Dec 20, 2024Updated last year
- Enabling pure data parallel training of DLRM via caching and prefetching☆17Oct 29, 2021Updated 4 years ago
- [ICCV 2025] Official Implementation of "Shot-by-Shot: Film-Grammar-Aware Training-Free Audio Description Generation". Junyu Xie, Tengda H…☆21May 16, 2026Updated 3 weeks ago
- [ECCV24] Attention Regulation on T2I Diffusion Models☆19Jul 8, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation☆38Nov 21, 2023Updated 2 years ago
- ☆56Feb 11, 2025Updated last year
- [CVPR 2025] GPS as a Control Signal for Image Generation☆25Mar 18, 2025Updated last year
- The official repo of continuous speculative decoding☆35Mar 28, 2025Updated last year
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆35May 18, 2026Updated 3 weeks ago
- An innovative method designed to augment the capabilities of existing video diffusion models☆22May 10, 2024Updated 2 years ago
- The source codes for D2AGE model. Distance-aware DAG Embedding for Proximity Search on Heterogeneous Graphs.☆12Feb 20, 2018Updated 8 years ago
- 多订阅合并后, 进行本机多端口匹配分流, 适合跨境或者多ip环境的网络设置☆26Feb 28, 2026Updated 3 months ago
- ☆25May 12, 2026Updated last month
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ACM MM 2024 (Oral)] Official PyTorch Implementation of Paper "MovingColor: Seamless Fusion of Fine-grained Video Color Enhancement"☆12Dec 30, 2024Updated last year
- Training-Free Condition-Guided Text-to-Video Generation☆62Oct 23, 2025Updated 7 months ago
- ☆12Oct 10, 2024Updated last year
- Code repository for T2V-Turbo and T2V-Turbo-v2☆312Jan 31, 2025Updated last year
- [CVPR2024] Parameter Efficient Fine-tuning via Cross Block Orchestration for Segment Anything Model☆12Jul 31, 2024Updated last year
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆17Jun 12, 2024Updated 2 years ago
- ☆130Jun 24, 2025Updated 11 months ago