layer6ai-labs / fusemix
Data-Efficient Multimodal Fusion on a Single GPU
☆52Updated 8 months ago
Alternatives and similar repositories for fusemix:
Users that are interested in fusemix are comparing it to the libraries listed below
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆19Updated last month
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆60Updated 3 months ago
- Official repository of paper "Subobject-level Image Tokenization"☆64Updated 9 months ago
- LAVIS - A One-stop Library for Language-Vision Intelligence☆47Updated 5 months ago
- UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model☆20Updated 5 months ago