We propose MMAD, a novel automated pipeline for precise AD generation. MMAD introduces ambient music alongside visual and linguistic, enhancing the model's multimodal representation learning through modality encoders and alignment.
☆16Dec 31, 2024Updated last year
Alternatives and similar repositories for MMAD
Users that are interested in MMAD are comparing it to the libraries listed below
Sorting:
- [ACL 2023] VSTAR is a multimodal dialogue dataset with scene and topic transition information☆15Oct 27, 2024Updated last year
- Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object☆18Dec 1, 2024Updated last year
- Ultraman: Single Image 3D Human Reconstruction with Ultra Speed and Detail☆16Jul 5, 2024Updated last year
- Official implementation of "EG4D: Explicit Generation of 4D Object without Score Distillation" (ICLR 2025)☆36Feb 14, 2025Updated last year
- Narrative movie understanding benchmark☆76Jun 11, 2025Updated 8 months ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆36Oct 13, 2023Updated 2 years ago
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering☆11Feb 16, 2023Updated 3 years ago
- 智慧园区☆10Aug 3, 2017Updated 8 years ago
- real time face swap and one-click video deepfake with only a single image☆14Sep 10, 2024Updated last year
- HACKATON WINNER!!! (3,5K USD PRIZE). A live code interview tool powered by monaco, using SuperViz SDK.☆13Oct 28, 2024Updated last year
- A Python implementation of Delaunay triangulation☆11Aug 5, 2021Updated 4 years ago
- HyperCUT: Video Sequence from a Single Blurry Image using Unsupervised Ordering (CVPR'23)☆14Nov 4, 2025Updated 3 months ago
- Outpainting-Images-and-Videos-using-GANs☆12Nov 22, 2022Updated 3 years ago
- Add Rain Streak Mask On Unparied Image Using GAN☆10Sep 12, 2020Updated 5 years ago
- ☆18Mar 23, 2025Updated 11 months ago
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- The official repository of the paper "X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation"☆12Jan 22, 2025Updated last year
- 深度学习课程自己所做答案☆10Apr 23, 2018Updated 7 years ago
- Create your own 3D scene with words anywhere.☆29Updated this week
- huggingface-go : 高速下载 huggingface 的模型和数据集☆50Aug 27, 2025Updated 6 months ago
- [COLING 2025] Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs☆52Jan 22, 2025Updated last year
- 基于rasa的多轮问答学习和测试☆13May 12, 2019Updated 6 years ago
- ☆10Nov 27, 2024Updated last year
- ☆13Mar 11, 2025Updated 11 months ago
- [ICASSP 2026] This is the code repo for our paper: LegalΔ: Enhancing Legal Reasoning in LLMs via Reinforcement Learning with Chain-of-Tho…☆25Aug 20, 2025Updated 6 months ago
- ☆11Dec 11, 2017Updated 8 years ago
- Create a UIView hierarchy from XML☆12Apr 1, 2016Updated 9 years ago
- Automatically replace full publication names in a bibtex database file into official abbreviated names, or reverse. (Support IEEE/ACM/Sci…☆14Jul 30, 2024Updated last year
- No-Reference Image Quality Assessment with Global Statistical Features☆13Dec 3, 2021Updated 4 years ago
- [SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization☆10Jul 13, 2024Updated last year
- ☆12Mar 1, 2023Updated 3 years ago
- ☆13Jun 7, 2021Updated 4 years ago
- This is the code repo for our paper "Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression".☆12Feb 27, 2024Updated 2 years ago
- Python (pip) package for fitting mixtures of Student's t-distributions using either maximum likelihood (EM) or Bayesian methodology (vari…☆11Sep 23, 2025Updated 5 months ago
- Implementation of "HumanReg: Self-supervised Non-rigid Registration of Sparse Human Point Cloud" (3DV 2024)☆15Oct 26, 2024Updated last year
- ☆16Dec 30, 2022Updated 3 years ago
- A Raspberry Pi pipeline viewer☆12Mar 26, 2015Updated 10 years ago
- Internal diffusion for video inpainting☆15May 19, 2025Updated 9 months ago
- 2018年中国联通大数据创新大赛:高端用户离网预测/用户换机时间预测全套代码☆13Nov 23, 2018Updated 7 years ago