[TMLR 2022] High-Modality Multimodal Transformer
☆116Nov 2, 2024Updated last year
Alternatives and similar repositories for HighMMT
Users that are interested in HighMMT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023, ICMI 2023] Quantifying & Modeling Multimodal Interactions☆90Oct 28, 2024Updated last year
- ☆11Aug 20, 2024Updated last year
- [NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning☆631Jan 27, 2024Updated 2 years ago
- Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch☆39Sep 6, 2021Updated 4 years ago
- ☆21Mar 15, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Nov 30, 2022Updated 3 years ago
- Holistic evaluation of multimodal foundation models☆48Aug 11, 2024Updated last year
- [ICML 2023] Provable Dynamic Fusion for Low-Quality Multimodal Data☆124Jun 28, 2025Updated last year
- Intepretability method to find what navigation agents learn☆19Jun 16, 2022Updated 4 years ago
- The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)☆318Sep 22, 2025Updated 9 months ago
- Official code for the paper: "Metadata Archaeology"☆19May 10, 2023Updated 3 years ago
- Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".☆25Aug 16, 2023Updated 2 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- [NeurIPS 2023] Factorized Contrastive Learning: Going Beyond Multi-view Redundancy☆76Nov 13, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Variational Reinforcement Learning☆17Jul 25, 2024Updated last year
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- ActMAD: Activation Matching to Align Distributions for Test-Time-Training (CVPR 2023)☆21Jun 27, 2023Updated 3 years ago
- Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!☆11May 24, 2023Updated 3 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- Rust bindings for CTranslate2☆14Jun 21, 2023Updated 3 years ago
- Implementation for "DeltaPhi: Learning Physical Trajectory Residual for PDE Solving"☆13Jun 17, 2024Updated 2 years ago
- This repository contains the code of our paper 'Skip \n: A simple method to reduce hallucination in Large Vision-Language Models'.☆15Feb 12, 2024Updated 2 years ago
- This is an implementation of the paper "Are We Done with Object-Centric Learning?"☆13Jun 21, 2026Updated last week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch☆81Dec 4, 2022Updated 3 years ago
- ☆54Dec 30, 2024Updated last year
- Improved diffusion generative models with subspaces☆135Jun 1, 2022Updated 4 years ago
- SMILE: A Multimodal Dataset for Understanding Laughter☆13Jun 15, 2023Updated 3 years ago
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆240Jan 20, 2023Updated 3 years ago
- [ICLR 2019] Learning Factorized Multimodal Representations☆69Aug 4, 2020Updated 5 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- ✨✨The Curse of Multi-Modalities (CMM): Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio☆54Jul 11, 2025Updated 11 months ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆13Jul 20, 2024Updated last year
- The official repository of the paper "DeepM2CDL: Deep Multi-scale Multi-modal Convolutional Dictionary Learning Network" from IEEE Transa…☆57Apr 1, 2024Updated 2 years ago
- ☆19Jan 30, 2023Updated 3 years ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆31Apr 11, 2024Updated 2 years ago
- AgentHive provides the primitives and helpers for a seamless usage of robohive within TorchRL.☆36Jan 12, 2024Updated 2 years ago
- Source code for the paper "Policy Architectures for Compositional Generalization in Control"☆30May 19, 2022Updated 4 years ago