[TMLR 2022] High-Modality Multimodal Transformer
☆116Nov 2, 2024Updated last year
Alternatives and similar repositories for HighMMT
Users that are interested in HighMMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Aug 20, 2024Updated last year
- [NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning☆628Jan 27, 2024Updated 2 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆39Sep 6, 2021Updated 4 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆99Aug 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Holistic evaluation of multimodal foundation models☆48Aug 11, 2024Updated last year
- [ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Data☆123Jun 28, 2025Updated 11 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆317Sep 22, 2025Updated 8 months ago
- Source materials for CoinFT☆34Jan 23, 2026Updated 4 months ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 3 years ago
- Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".☆25Aug 16, 2023Updated 2 years ago
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆76Nov 13, 2023Updated 2 years ago
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- [Preprint 2022] “Can We Solve 3D Vision Tasks Starting from A 2D Vision Transformer?” by Yi Wang, Zhiwen Fan, Tianlong Chen, Hehe Fan, Zh…☆63Jan 18, 2023Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 3 years ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Updated this week
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- [ICLR 2019] Learning Factorized Multimodal Representations☆69Aug 4, 2020Updated 5 years ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆62Nov 5, 2024Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆53Jul 11, 2025Updated 11 months ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆30Apr 11, 2024Updated 2 years ago
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆35Jan 12, 2024Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 4 years ago
- 【MICCAI 2023 Early Accept & MedIA】EyeMost "Reliable Multimodality Eye Disease Screening via Mixture of Student's t Distributions"☆27Dec 11, 2024Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Apr 17, 2024Updated 2 years ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆38Oct 13, 2023Updated 2 years ago
- Interactively evolve various types of art (pictures, animations, shapes, and sounds) using Compositional Pattern Producing Networks☆21May 9, 2018Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for "Holistic Physics Solver: Learning PDEs in a Unified Spectral-Physical Space"☆24Mar 25, 2026Updated 2 months ago
- ☆14Mar 31, 2022Updated 4 years ago
- Building the cognitive-core to solve ARC-AGI-2☆27Feb 2, 2025Updated last year
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 3 years ago
- Pytorch implementation of Multimodal Neural Machine Translation(MNMT).☆12Jan 21, 2021Updated 5 years ago
- Beta-VAE, Conditional-VAE, Total Correlation-VAE, FactorVAE, Relevance Factor-VAE, Multi-Level VAE, (Soft)-IntroVAE (Beta-Version), LVAE,…☆17Aug 19, 2025Updated 9 months ago
- This is the official code for NeurIPS 2023 paper "Learning Unseen Modality Interaction"☆18Jan 22, 2024Updated 2 years ago