tianruochen / MultimodalVideoTagLinks
多模态视频分类模型
☆21Updated 2 years ago
Alternatives and similar repositories for MultimodalVideoTag
Users that are interested in MultimodalVideoTag are comparing it to the libraries listed below
Sorting:
- 使用pytorch完成的一个多模态分类任务,文本和图像部分分别使用了bert和resnet提取特征(在config里可以组合多种模型),在我的小规模数据集上取得了良好的性能(验证集acc96%)☆78Updated 2 years ago
- 这是一个基于Pytorch平台、Transformer框架实现的视频描述生成 (Video Captioning) 深度学习模型。 视频描述生成任务指的是:输入一个视频,输出一句描述整个视频内容的文字(前提是视频较短且可以用一句话来描述)。本repo主要目的是帮助视力障碍…☆91Updated 3 years ago
- Efficient dual attention SlowFast networks for video action recognition☆24Updated 2 years ago
- 中文CLIP:自定义数据集,可根据文图提取向量,实现文图匹配。☆22Updated 2 years ago
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆19Updated 5 years ago
- 多模态融合情感分析☆131Updated 5 years ago
- 人脸识别、人脸细粒度表情识别、异常行为检测和识别☆11Updated 3 years ago
- Papers, codes collection of video summarization / video highlight detection / video key frame selection☆36Updated 3 years ago
- (TIP'2023) Concept-Aware Video Captioning: Describing Videos with Effective Prior Information☆29Updated 6 months ago
- pytorch☆62Updated 4 years ago
- 商品图像检索 、多模态、深度学习☆31Updated 3 years ago
- Tiny Kinetics-400 for test☆92Updated last year
- 利用pytorch实现图像分类的一个完整的代码,训练,预测,TTA,模型融合,模型部署,cnn提取特征,svm或者随机森林等进行分类,模型蒸馏,一个完整的代码☆30Updated 4 years ago
- 动作识别(Action Recognition)常见模型的Pytorch实现☆33Updated 4 years ago
- Frames Extraction With OpenCV and Python☆15Updated 4 years ago
- 多模态融合情感分析☆35Updated 4 years ago
- 基于多模态检索的互联网图文匹配☆14Updated last year
- 多模态情感分析——基于BERT+ResNet的多种融合方法☆312Updated 2 years ago
- 通过手动标 注数据集,训练老人摔倒的模型☆18Updated 2 years ago
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆128Updated 5 years ago
- Contains code for C3D, LCN and TSM for action recognition models.☆10Updated 5 years ago
- Official implementation of "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision" ECCV2020☆116Updated last year
- 多模态情感分析☆17Updated last year
- [AAAI 2020] Official implementation of VAANet for Emotion Recognition☆78Updated last year
- 本项目采用多模态特征融合和引入外部知识的方式来检测短视频谣言,创新性地引入了对比学习的方式实现了谣言的区分☆22Updated last year
- Key frames extraction in traffic videos using K-Means☆13Updated 7 years ago
- A re-trainable version version of i3d. It is a superset of kinetics_i3d_pytorch repo from hassony2. You can train on your own dataset, an…☆9Updated 5 years ago
- Action_Recognition_Surveillance, for huamn fall down action recognition, helmet、smoke、cell-phone detection.☆18Updated 2 years ago
- This is the code for the Paper "Guilherme L. Toledo, Ricardo M. Marcacini: Transfer Learning with Joint Fine-Tuning for Multimodal Sentim…☆16Updated 2 years ago
- 基于PaddlePaddle的智慧课堂实时监测系统—EduWatching☆72Updated 2 years ago