[TMLR 2022] High-Modality Multimodal Transformer
☆117Nov 2, 2024Updated last year
Alternatives and similar repositories for HighMMT
Users that are interested in HighMMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactions☆87Oct 28, 2024Updated last year
- [NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning☆626Jan 27, 2024Updated 2 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆39Sep 6, 2021Updated 4 years ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆100Aug 22, 2024Updated last year
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Nov 30, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Holistic evaluation of multimodal foundation models☆49Aug 11, 2024Updated last year
- [ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Data☆120Jun 28, 2025Updated 9 months ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆313Sep 22, 2025Updated 6 months ago
- Source materials for CoinFT☆31Jan 23, 2026Updated 2 months ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 2 years ago
- Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".☆23Aug 16, 2023Updated 2 years ago
- ☆21Jun 4, 2025Updated 10 months ago
- Official implementation of the paper The Hidden Language of Diffusion Models☆77Jan 24, 2024Updated 2 years ago
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆74Nov 13, 2023Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- ☆14May 31, 2022Updated 3 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 2 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 2 years ago
- Lottery Tickets in Evolutionary Optimization (Lange & Sprekeler, ICML 2023)☆17Jun 2, 2023Updated 2 years ago
- (CVPR Workshop Best Paper Award) Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality Robustn…☆17Nov 4, 2025Updated 5 months ago
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated last year
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆12Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆76Dec 4, 2022Updated 3 years ago
- ☆54Dec 30, 2024Updated last year
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 2 years ago
- The repo for "Enhancing Multi-modal Cooperation via Sample-level Modality Valuation", CVPR 2024☆61Nov 5, 2024Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆37Oct 13, 2023Updated 2 years ago
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆56Apr 1, 2024Updated 2 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆31Apr 11, 2024Updated 2 years ago
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆35Jan 12, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation☆42Dec 23, 2023Updated 2 years ago
- 【MICCAI 2023 Early Accept & MedIA】EyeMost "Reliable Multimodality Eye Disease Screening via Mixture of Student's t Distributions"☆27Dec 11, 2024Updated last year
- [CVPR 2024] VCoder: Versatile Vision Encoders for Multimodal Large Language Models☆279Apr 17, 2024Updated last year
- Introducing diverse tasks for NAS☆52Dec 11, 2022Updated 3 years ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆26Jul 11, 2023Updated 2 years ago
- Building the cognitive-core to solve ARC-AGI-2☆27Feb 2, 2025Updated last year
- ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models (ICLR 2024, Official Implementation)☆16Jan 18, 2024Updated 2 years ago