tianruochen / MultimodalVideoTag
多模态视频分类模型
☆17Updated 2 years ago
Alternatives and similar repositories for MultimodalVideoTag:
Users that are interested in MultimodalVideoTag are comparing it to the libraries listed below
- 使用pytorch完成的一个多模态分类任务,文本和图像部分分别使用了bert和resnet提取特征(在config里可以组合多种模型),在我的小规模数据集上取得了良好的性能(验证集acc96%)☆70Updated last year
- 计算机视觉课程设计-基于Chinese-CLIP的图文检索系统☆54Updated last year
- 中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。☆22Updated 2 years ago
- 这是一 个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍…☆83Updated 2 years ago
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆26Updated last month
- Frames Extraction With OpenCV and Python☆15Updated 4 years ago
- 多模态融合情感分析☆120Updated 4 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Updated 4 years ago
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆128Updated 5 years ago
- Papers, codes collection of video summarization / video highlight detection / video key frame selection☆35Updated 3 years ago
- 多模态情感分析——基于BERT+ResNet的多种融合方法☆260Updated 2 years ago
- This is the code for the Paper "Guilherme L. Toledo, Ricardo M. Marcacini: Transfer Learning with Joint Fine-Tuning for Multimodal Sentim…☆15Updated 2 years ago
- [AAAI 2024 Oral] M2CLIP: A Multimodal, Multi-Task Adapting Framework for Video Action Recognition☆46Updated last month
- Tiny Kinetics-400 for test☆87Updated 11 months ago
- 人工智能实验五:多模态情感分类☆14Updated 2 years ago
- 多模态融合情感分析☆33Updated 3 years ago
- 该仓库存放了多模态情感分析实验的配套代码。☆39Updated 2 years ago
- PyTorch implementation of HANet: Hierarchical Alignment Networks for Video-Text Retrieval (ACM MM 2021).☆47Updated 3 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆21Updated 4 years ago
- Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020☆109Updated 8 months ago
- [AAAI 2020] Official implementation of VAANet for Emotion Recognition☆77Updated last year
- 2019CCF爱奇艺视频拷贝(版权)检测算法☆15Updated 5 years ago
- ☆74Updated 2 years ago
- 使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。☆11Updated 5 years ago
- UniSA: Unified Generative Framework for Sentiment Analysis☆50Updated 9 months ago
- You Only Watch One Frame for Online Spatio-Temporal Action Detection☆32Updated last year
- Recognizing Micro-Expression in Video Clip with Adaptive Key-Frame Mining☆26Updated 3 years ago
- 商品图像检索、多模态、深度学习☆31Updated 3 years ago
- 媒体计算实践作业:图像——文本跨模态搜索☆38Updated 4 years ago
- 基于多模态检索的互联网图文匹配☆12Updated 10 months ago