[NeurIPS2024] Official code for (IMA) Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
☆23Oct 15, 2024Updated last year
Alternatives and similar repositories for ima-lmms
Users that are interested in ima-lmms are comparing it to the libraries listed below
Sorting:
- An exploration of LLM steering☆24Jun 15, 2024Updated last year
- Official code of "RoboOmni: Proactive Robot Manipulation in Omni-modal Context"☆89Nov 17, 2025Updated 3 months ago
- Official implementation of "Connect, Collapse, Corrupt: Learning Cross-Modal Tasks with Uni-Modal Data" (ICLR 2024)☆34Oct 16, 2024Updated last year
- [CVPR 2023] Improving Zero-shot Generalization and Robustness of Multi-modal Models☆35Jul 16, 2023Updated 2 years ago
- LLaVA combines with Magvit Image tokenizer, training MLLM without an Vision Encoder. Unifying image understanding and generation.☆39Jun 20, 2024Updated last year
- SurgLaVi: Official repository☆27Updated this week
- ☆23Dec 11, 2025Updated 2 months ago
- ☆15Feb 12, 2026Updated 3 weeks ago
- ☆10Jul 2, 2021Updated 4 years ago
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- [WACV 2025] Exploiting VLM Localizability and Semantics for Open Vocabulary Action Detection☆16Mar 23, 2025Updated 11 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆22Mar 2, 2026Updated last week
- 七轴机械臂的仿真☆13Jun 7, 2022Updated 3 years ago
- This repository contains the official code for "Flexible Biometrics Recognition: Bridging the Multimodality Gap through Attention, Alignm…☆11Oct 9, 2024Updated last year
- Adaptive Multimodal Reasoning via Reinforcement Learning☆23Jan 11, 2026Updated last month
- Flutter Project - Cinema Plus - Rive☆11Feb 24, 2020Updated 6 years ago
- Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining☆13Oct 22, 2021Updated 4 years ago
- ICNet in TensorFlow, Real-Time Segmentation☆10Aug 17, 2018Updated 7 years ago
- Code for the paper BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues (EMNLP20)☆11Jun 16, 2025Updated 8 months ago
- ☆26Oct 16, 2025Updated 4 months ago
- VMDのモーフデータをFBXに変換するためのプロジェクト☆11Dec 10, 2025Updated 2 months ago
- BFloat16 Fused Adam Operator for PyTorch☆16Nov 16, 2024Updated last year
- ☆11Sep 27, 2023Updated 2 years ago
- Official code repository for Findings of EMNLP 2022 paper: PseudoReasoner: Leveraging Pseudo Labels for Commonsense Knowledge Base Popula…☆11Oct 18, 2022Updated 3 years ago
- ALAS: Autonomous Learning Agent System☆15Aug 14, 2025Updated 6 months ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- Class materials, homeworks and videos for probation preparation.☆19Feb 3, 2026Updated last month
- 本项目提供了面向中文的XLNet预训练模型,旨在丰富中文自然语言处理资源,提供多元化的中文预训练模型选择。 我们欢迎各位专家学者下载使用,并共同促进和发展中文资源建设。☆11May 30, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- Imshow - Flexible and Customizable Image Display with Python☆13Dec 27, 2025Updated 2 months ago
- Time frequency ridge detection based on relevant ridge portions☆11Aug 17, 2023Updated 2 years ago
- Web app for makeup transfer using Stable Diffusion☆10Sep 11, 2023Updated 2 years ago
- Python bindings for NVIDIA CUDA APIs.☆13Mar 2, 2024Updated 2 years ago
- The Gradient Icon package is a powerful Flutter package that enables creating gradient icons effortlessly.☆20May 3, 2024Updated last year
- MLLM-DataEngine: An Iterative Refinement Approach for MLLM☆48May 24, 2024Updated last year
- Official Repository of LatentSeek☆77Jun 6, 2025Updated 9 months ago
- VideoNIAH: A Flexible Synthetic Method for Benchmarking Video MLLMs☆54Mar 9, 2025Updated last year
- Multimodal RewardBench☆62Feb 21, 2025Updated last year