An effective multimodal representation and fusion method for multimodal intent recognition
☆18Jun 7, 2024Updated last year
Alternatives and similar repositories for EMRFM
Users that are interested in EMRFM are comparing it to the libraries listed below
Sorting:
- Dual Evidence Enhancement and Text-Image Similarity Awareness for Multimodal Rumor Detection☆23Apr 25, 2025Updated 10 months ago
- Knowledge-Enhanced Dynamic Scene Graph Attention Network for Fake News Video Detection☆21Oct 22, 2025Updated 4 months ago
- A repository for fake news detection.☆19Jun 29, 2023Updated 2 years ago
- The implementation codes of paper: Multimodal Sentiment Analysis with Mutual Information-based Disentangled Representation Learning☆18May 8, 2025Updated 10 months ago
- MIntRec: A New Dataset for Multimodal Intent Recognition (ACM MM 2022)☆128May 2, 2025Updated 10 months ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated last year
- [ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"☆15Jul 6, 2024Updated last year
- MIntRec2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (IC…☆72Aug 13, 2025Updated 6 months ago
- This repo contains the codes and steps to perform object detection on stanford drone dataset☆16Dec 27, 2023Updated 2 years ago
- [ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification☆16Sep 25, 2023Updated 2 years ago
- The official code for Improving Multimodal Learning via Imbalanced Learning☆47Dec 9, 2025Updated 3 months ago
- Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'☆27Oct 10, 2024Updated last year
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- ☆27Apr 29, 2025Updated 10 months ago
- This is a realistic 3D scene in Visual Studio with the OpenGL library, which contains multiple 3D objects with lighting, texture effects,…☆38Apr 18, 2018Updated 7 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆58Oct 8, 2025Updated 5 months ago
- ☆45May 13, 2025Updated 9 months ago
- Code for dmrnet☆64Jul 16, 2025Updated 7 months ago
- Improving Mamaba performance on Video Understanding task☆45Dec 30, 2025Updated 2 months ago
- Official repository of Uni-AdaFocus (TPAMI 2024).☆61Dec 17, 2024Updated last year
- ☆60Feb 20, 2024Updated 2 years ago
- [NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models☆79Sep 23, 2025Updated 5 months ago
- [NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models☆79Dec 27, 2025Updated 2 months ago
- Official repository of paper titled "CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications…☆87Jan 15, 2026Updated last month
- [CVPR 2025] DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models☆102Nov 22, 2025Updated 3 months ago
- ☆66Mar 27, 2024Updated last year
- [NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model☆80Dec 25, 2024Updated last year
- Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.☆107Jun 29, 2025Updated 8 months ago
- This repo contains the code for paper "LightEMMA: Lightweight End-to-End Multimodal Model for Autonomous Driving"☆135Nov 19, 2025Updated 3 months ago
- End2EndPerception deployment solution based on vision sparse transformer paradigm is open sourced.☆174Jan 12, 2025Updated last year
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆150Dec 22, 2024Updated last year
- Official Code Release of "FusionAD"☆158Jul 9, 2024Updated last year
- [ICCV 2025] Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆231Dec 12, 2025Updated 2 months ago
- [CVPR2025] Don’t Shake the Wheel: Momentum-Aware Planning in End-to-End Autonomous Driving☆265Sep 9, 2025Updated 6 months ago
- ☆379Oct 29, 2025Updated 4 months ago
- [ICLR 2025] The official implementation of SSR☆244Mar 23, 2025Updated 11 months ago
- [CVPR 2025, Spotlight] SimLingo (CarLLava): Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment☆363Aug 25, 2025Updated 6 months ago
- Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning☆318Mar 26, 2025Updated 11 months ago
- ☆286Aug 14, 2025Updated 6 months ago