多模态数据融合:为了完成多模态数据融合,首先利用VGG16网络和cifar10数据集完成多输入网络的分类,在VGG16的基础之上,将前三层特征提取网络作为不同输入的特征提取网络,在中间层进行特征拼接,后面的卷积层用于提取融合特征,最后加上全连接层。该网络稍作修改就能同时提取两张对应的图片作为输入,在特征提取之后进行融合用于分类。
☆102Sep 25, 2020Updated 5 years ago
Alternatives and similar repositories for multiModalityFusionForClassification
Users that are interested in multiModalityFusionForClassification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 使用pytorch完成的一个多模态分类任务,文本和图像部分分别使用了bert和resnet提取特征(在config里可以组合多种模型),在我的小规模数据集上取得了良好的性能(验证集acc96%)☆83Mar 25, 2023Updated 3 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Aug 19, 2021Updated 4 years ago
- 融合MRI多模态的图像的不同特征进行脑梗死区分割网络(基于Unet网络更改新的网络)☆15Jun 20, 2019Updated 6 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆20Mar 12, 2020Updated 6 years ago
- ☆15Sep 22, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 6 years ago
- Using CNN for classifying 101 different food categories - using VGG16, Alex Net and SVM☆10Jan 6, 2020Updated 6 years ago
- ☆14Aug 24, 2018Updated 7 years ago
- A demo for multi-modal emotion recognition.(多模态情感识别demo)☆89Apr 2, 2024Updated 2 years ago
- This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as mul…☆917Mar 15, 2023Updated 3 years ago
- 多模态融合情感分析☆140May 15, 2020Updated 5 years ago
- Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the giv…☆15Mar 25, 2023Updated 3 years ago
- A fine multimodality fusion network :)☆10Aug 9, 2021Updated 4 years ago
- Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.☆69Nov 19, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the code corresponding to the paper "Resolve Domain Conflicts for Generalizable Remote Physiological Measurement." accepted in AC…☆15Apr 15, 2024Updated 2 years ago
- 基于多特征融合的微表情识别☆67Jul 15, 2020Updated 5 years ago
- ISPRS第一技术委员会多模态遥感应用算法智能解译大赛-变化检测赛道解决方案☆20Jun 29, 2025Updated 10 months ago
- 商品图像检索、多模态、深度学习☆32Nov 25, 2021Updated 4 years ago
- Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"☆121Jun 16, 2020Updated 5 years ago
- ☆11Mar 21, 2022Updated 4 years ago
- Emotion analysis on DREAMER dataset using various Deep Learning Techniques☆13Jan 1, 2021Updated 5 years ago
- Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"☆80Jun 16, 2021Updated 4 years ago
- repo for "Decision explanation and feature importance for invertible networks"☆14Nov 13, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Dynamic Selective Network for RGB-D Salient Object Detection☆12Jan 22, 2025Updated last year
- Zero-shot fault diagnosis on the Tennessee–Eastman process by attribute fusion transfer. Paper: Attribute fusion transfer for zero-shot f…☆33Oct 12, 2023Updated 2 years ago
- Using GANs to augment medical imaging data to improve classification accuracy☆12Jan 21, 2018Updated 8 years ago
- A Pytorch implementation of emotion recognition from videos☆18Sep 15, 2020Updated 5 years ago
- ☆14Mar 17, 2019Updated 7 years ago
- 多模态情感分析——基于BERT+ResNet的多种融合方法☆366Nov 20, 2022Updated 3 years ago
- Use CNNs to estimate a grasping point and angle of a given object, so that the robot arm can pick the object.☆30Sep 3, 2021Updated 4 years ago
- 实验室的一个病虫害检测项目,在SSD基础上进行一系列改进!SSD Improvements!☆35Jul 30, 2022Updated 3 years ago
- A ROS (Robotic Operating System) package for simple object detection and planar pose estimation for objects that requires only an image o…☆28Jan 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reconstruction codes for 3D-EPTI method☆11Jun 15, 2021Updated 4 years ago
- SegNet, Unet, and DeepLabV3 for Semantic Segmentation using Keras.☆10Jan 19, 2023Updated 3 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆275May 31, 2020Updated 5 years ago
- TianChi 2018广东工业智造大数据创新大赛——智能算法赛(复赛baseline代码)☆18Nov 6, 2018Updated 7 years ago
- 2020智源-京东多模态对话挑战大赛第二名方案☆35Dec 21, 2022Updated 3 years ago
- A deep learning-based medical image registration package based on PyTorch.☆22Apr 1, 2026Updated last month
- 一个多模态内容理解算法框架,其中包含数据处理、预训练模型、常见模型以及模型加速等模块。☆324Oct 26, 2021Updated 4 years ago