[Arxiv 2024] Official code for MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions
☆33Feb 6, 2025Updated last year
Alternatives and similar repositories for MMTrail
Users that are interested in MMTrail are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆36Feb 6, 2025Updated last year
- [Arxiv 2025] Official code for T-REX: Mixture-of-Rank-One-Experts with semantic-aware Intuition for Multi-task Large Language Model Finet…☆17May 16, 2025Updated last year
- [Arxiv2022] Interpreting Class Conditional GANs with Channel Awareness☆17Apr 4, 2022Updated 4 years ago
- ☆17Dec 12, 2023Updated 2 years ago
- blender scripts for shapenet☆11Oct 12, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- On Path to Multimodal Generalist: General-Level and General-Bench☆18Jul 11, 2025Updated 10 months ago
- This project is based on the [LTX-Video](https://github.com/Lightricks/LTX-Video) algorithm of the diffusers and optimized and accelerate…☆14Dec 31, 2024Updated last year
- ☆24May 23, 2025Updated last year
- Curriculum Vitae of Quan Wang☆15Dec 13, 2025Updated 5 months ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- [ICLR 2024] ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation☆77Apr 25, 2024Updated 2 years ago
- ☆24Jan 14, 2021Updated 5 years ago
- Model-based Hindsight Experience Replay☆10Jun 8, 2022Updated 3 years ago
- Music production for silent film clips.☆32Apr 30, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆20Aug 11, 2025Updated 9 months ago
- 🕹️ Explore cutting-edge techniques in game generation☆72Mar 16, 2026Updated 2 months ago
- 深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系sc…☆12Jul 5, 2019Updated 6 years ago
- [ICML 2022] Region-Based Semantic Factorization in GANs☆71Dec 24, 2022Updated 3 years ago
- Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"☆30Mar 21, 2024Updated 2 years ago
- [ICCV 2025] VideoVAE+: Large Motion Video Autoencoding with Cross-modal Video VAE☆406Jan 19, 2025Updated last year
- [IROS 2021] Official code for "Stereo Waterdrop Removal with Row-wise Dilated Attention"☆35Aug 21, 2021Updated 4 years ago
- Motion-conditional image animation for video editing☆20Dec 2, 2023Updated 2 years ago
- [ECCV 2022 Oral] 3D-Aware Indoor Scene Synthesis with Depth Priors☆71Nov 24, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICCV 2023] Video Background Music Generation: Dataset, Method and Evaluation☆78Mar 29, 2024Updated 2 years ago
- [NeurIPS 2025] PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"☆25Oct 27, 2025Updated 6 months ago
- Transfering images python from server to client UI python socket☆13Sep 30, 2020Updated 5 years ago
- official code for CVPR'24 paper Diff-BGM☆71Oct 12, 2024Updated last year
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- Implementation of "Reconstruction-based Anomaly Detection with Completely Random Forest," SIAM International Conference on Data Mining (S…☆10Feb 16, 2021Updated 5 years ago
- ☆16Sep 29, 2025Updated 7 months ago
- ☆14Oct 16, 2023Updated 2 years ago
- ☆32May 3, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 4 months ago
- ☆28Dec 16, 2024Updated last year
- VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation☆86Sep 12, 2024Updated last year
- ☆26Nov 26, 2024Updated last year
- The repo host the code and model of MAViL.☆45Jul 24, 2023Updated 2 years ago
- ☆61Jun 15, 2025Updated 11 months ago
- 此一project是由清华大学医学院的姚非凡与郑家瀚共同开发完成,这里运用了三个目标检测模型,来找到图像里的人脸,以及他们是否有带口罩,是个目标检测+2分类问题。 这一readme.md文件是为了帮助使用者如何正确使用我们的code。我们使用FasterRCNN可达到0.7…☆17Dec 8, 2022Updated 3 years ago