[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
☆33Feb 6, 2025Updated last year
Alternatives and similar repositories for MMTrail
Users that are interested in MMTrail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [Arxiv2022] Interpreting Class Conditional GANs with Channel Awareness☆17Apr 4, 2022Updated 4 years ago
- ☆17Dec 12, 2023Updated 2 years ago
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 9 months ago
- ☆24May 23, 2025Updated 11 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Music production for silent film clips.☆32Apr 30, 2025Updated last year
- ☆20Aug 11, 2025Updated 8 months ago
- 🕹️ Explore cutting-edge techniques in game generation☆71Mar 16, 2026Updated last month
- [ICML 2022] Region-Based Semantic Factorization in GANs☆71Dec 24, 2022Updated 3 years ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆405Jan 19, 2025Updated last year
- A dataset for Audio-Visual Sound Event Detection in Movies☆26Jan 23, 2023Updated 3 years ago
- [IROS 2021] Official code for "Stereo Waterdrop Removal with Row-wise Dilated Attention"☆35Aug 21, 2021Updated 4 years ago
- [ECCV 2022 Oral] 3D-Aware Indoor Scene Synthesis with Depth Priors☆71Nov 24, 2022Updated 3 years ago
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Mar 29, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2025] PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"☆25Oct 27, 2025Updated 6 months ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- ☆14Oct 16, 2023Updated 2 years ago
- ☆32May 3, 2024Updated 2 years ago
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 months ago
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆86Sep 12, 2024Updated last year
- The repo host the code and model of MAViL.☆45Jul 24, 2023Updated 2 years ago
- ☆62Jun 15, 2025Updated 10 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆26Nov 26, 2024Updated last year
- Implementation of MathReader, Text-to-Speech for Mathematical Documents☆28Sep 23, 2025Updated 7 months ago
- Memory Oriented Transfer Learning for Semi-Supervised Image Deraining☆28Nov 23, 2023Updated 2 years ago
- Code and dataset release for "PACS: A Dataset for Physical Audiovisual CommonSense Reasoning" (ECCV 2022)☆18Dec 20, 2022Updated 3 years ago
- A Framework for Symbolic MUsic Graph Explanations☆10Jul 30, 2025Updated 9 months ago
- ☆14Oct 7, 2021Updated 4 years ago
- Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning☆44Jul 2, 2025Updated 10 months ago
- ☆12Jul 4, 2024Updated last year
- ☆85Dec 4, 2022Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆129Jun 7, 2025Updated 10 months ago
- ☆12Jun 1, 2024Updated last year
- MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]☆23Dec 10, 2025Updated 4 months ago
- ☆12Feb 2, 2024Updated 2 years ago
- Fast Image Restoration with Multi-bin Trainable Linear Units.☆11Dec 23, 2019Updated 6 years ago
- Official repository for "Structure-Enhanced Pop Music Generation via Harmony-Aware Learning", ACM MM 2022.☆14Mar 22, 2023Updated 3 years ago
- Code Release for the paper "TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation" in NeurIPS…☆14Dec 9, 2021Updated 4 years ago