lzk901372 / MM-When2SpeakView external linksLinks
☆14May 20, 2025Updated 8 months ago
Alternatives and similar repositories for MM-When2Speak
Users that are interested in MM-When2Speak are comparing it to the libraries listed below
Sorting:
- A dataset of first-person monologue videos/transcript/annotations about "life lessons" in various domains. The main purpose is for multi-…☆17Jan 8, 2025Updated last year
- ☆28Nov 25, 2024Updated last year
- ☆41May 15, 2025Updated 9 months ago
- [NeurIPS 2025] Reward-Instruct: A Reward-Centric Approach to Fast Photo-Realistic Image Generation☆34Oct 24, 2025Updated 3 months ago
- [ECCV'24] Self-training Room Layout Estimation via Geometry-aware Ray-casting☆15Jan 20, 2025Updated last year
- Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)☆38Sep 10, 2024Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Official PyTorch implementation of The Linear Attention Resurrection in Vision Transformer☆15Sep 7, 2024Updated last year