xuejianhuang/EMRFM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuejianhuang/EMRFM)

xuejianhuang / EMRFM

An effective multimodal representation and fusion method for multimodal intent recognition

☆19

Alternatives and similar repositories for EMRFM

Users that are interested in EMRFM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuejianhuang / KDSGAT-FNVD
View on GitHub
Knowledge-Enhanced Dynamic Scene Graph Attention Network for Fake News Video Detection
☆22Oct 22, 2025Updated 9 months ago
xuejianhuang / rummordetection_lstm
View on GitHub
基于LSTM的谣言检测(Rumor Detection)
☆61Apr 2, 2024Updated 2 years ago
bill13031 / fakenewsdetection
View on GitHub
A repository for fake news detection.
☆18Jun 29, 2023Updated 3 years ago
JoeYing1019 / SDIF-DA
View on GitHub
[ICASSP2024] Code for paper "SDIF-DA: A Shallow-to-Deep Interaction Framework with Data Augmentation for Multi-modal Intent Detection"
☆16Jul 6, 2024Updated 2 years ago
fufangze / SDR-GNN
View on GitHub
An official pytorch implementation for the paper: SDR-GNN: Spectral Domain Reconstruction Graph Neural Network for incomplete multimodal …
☆18Dec 30, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FeipengMa6 / PriSA
View on GitHub
[ICME 2023 Oral] Pytorch implementation for Multimodal Sentiment Analysis with Preferential Fusion and Distance-aware Contrastive Learnin…
☆24Jan 2, 2024Updated 2 years ago
JinYuanLi0012 / PGIM
View on GitHub
[EMNLP 2023 Findings] Prompting Chatgpt in MNER: Enhanced Multimodal Named Entity Recognition with Auxiliary Refined Knowledge
☆33Nov 10, 2024Updated last year
thuiar / MIntRec2.0
View on GitHub
MIntRec2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (IC…
☆83Aug 13, 2025Updated 11 months ago
ta012 / DTFAT
View on GitHub
[AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification
☆12Mar 10, 2025Updated last year
drivsaf / MFAN
View on GitHub
☆64Jan 18, 2023Updated 3 years ago
MengShen0709 / bmmal
View on GitHub
[ACMMM 2023] BMMAL: Towards Balanced Active Learning for Multimodal Classification
☆17Sep 25, 2023Updated 2 years ago
AV-Odyssey / AV-Odyssey
View on GitHub
This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"
☆31Dec 23, 2024Updated last year
OrkhanHI / pytorch_grid_sample_python
View on GitHub
Python function of Pytorch Grid Sample with Zero Padding
☆19Jan 4, 2021Updated 5 years ago
ZionGo6 / VLM-Auto
View on GitHub
Repo for 'VLM-Auto: VLM-based Autonomous Driving Assistant with Human-like Behavior and Understanding for Complex Road Scenes'
☆27Oct 10, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
JiuTian-VL / MoME
View on GitHub
[NeurIPS 2024] MoME: Mixture of Multimodal Experts for Generalist Multimodal Large Language Models
☆85Dec 27, 2025Updated 7 months ago
ruiyu0 / Retracking-by-Prediction
View on GitHub
Code and data for "Towards Robust Human Trajectory Prediction in Raw Videos" IROS 2021
☆28Aug 19, 2021Updated 4 years ago
JongSuk1 / EquiAV
View on GitHub
☆36Jan 20, 2025Updated last year
MINT-SJTU / STI-Bench
View on GitHub
STI-Bench : Are MLLMs Ready for Precise Spatial-Temporal World Understanding?
☆39Jan 12, 2026Updated 6 months ago
hotfinda / VideoMambaPro
View on GitHub
Improving Mamaba performance on Video Understanding task
☆48Dec 30, 2025Updated 6 months ago
LeapLabTHU / Uni-AdaFocus
View on GitHub
Official repository of Uni-AdaFocus (TPAMI 2024).
☆59Dec 17, 2024Updated last year
shicaiwei123 / ECCV2024-DMRNet
View on GitHub
Code for dmrnet
☆45Jul 16, 2025Updated last year
NMS05 / Multimodal-Fusion-with-Attention-Bottlenecks
View on GitHub
☆42Nov 22, 2024Updated last year
stoneMo / DeepAVFusion
View on GitHub
Official codebase for "Unveiling the Power of Audio-Visual Early Fusion Transformers with Dense Interactions through Masked Modeling".
☆43Aug 2, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Robot-K / Hint-AD
View on GitHub
CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving
☆74Oct 30, 2024Updated last year
shicaiwei123 / ICCV2025-ARL
View on GitHub
The official code for Improving Multimodal Learning via Imbalanced Learning
☆41Mar 26, 2026Updated 4 months ago
woxihuanjiangguo / BEVNeXt
View on GitHub
☆66Sep 3, 2024Updated last year
zhangyp15 / GraphAD
View on GitHub
☆68Mar 27, 2024Updated 2 years ago
epic-kitchens / epic-sounds-annotations
View on GitHub
Splits for epic-sounds dataset
☆85Aug 2, 2025Updated 11 months ago
Kguo-cs / TDOR
View on GitHub
☆66Feb 5, 2024Updated 2 years ago
shicaiwei123 / ICCV2025-GDL
View on GitHub
The official code for Boosting Multimodal Learning via Disentangled Gradient Learning
☆48Nov 22, 2025Updated 8 months ago
YuHengsss / MSVMamba
View on GitHub
[NeurIPS2024] Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model
☆84Dec 25, 2024Updated last year
declare-lab / MISA
View on GitHub
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
☆294Mar 14, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hustvl / mmMamba
View on GitHub
The first decoder-only multimodal state space model
☆104May 19, 2025Updated last year
daniel-code / TubeViT
View on GitHub
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆95Jul 15, 2026Updated 2 weeks ago
Theia-4869 / FasterVLM
View on GitHub
Official code for paper: [CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster.
☆114Jun 29, 2025Updated last year
XiandaGuo / Drive-MLLM
View on GitHub
[NeurIPS 2025] SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models
☆84Sep 23, 2025Updated 10 months ago
TonyLianLong / CrossMAE
View on GitHub
Official Implementation of the CrossMAE paper: Rethinking Patch Dependence for Masked Autoencoders
☆135Apr 10, 2025Updated last year
Theia-4869 / VisPruner
View on GitHub
[ICCV 2025] Official code for paper: Beyond Text-Visual Attention: Exploiting Visual Cues for Effective Token Pruning in VLMs
☆84Jul 1, 2025Updated last year
zhijian11 / RoboTron-Drive
View on GitHub
☆108Dec 27, 2024Updated last year