Multimodal short video classification task, integrating video, image, audio and text modes for short video classification
☆19Mar 12, 2020Updated 5 years ago
Alternatives and similar repositories for Multimodal-short-video-classification
Users that are interested in Multimodal-short-video-classification are comparing it to the libraries listed below
Sorting:
- Hand Gesture Controlled Tello Drone using Python and OpenCV 2021☆11Jun 6, 2022Updated 3 years ago
- ☆11May 18, 2022Updated 3 years ago
- 使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。☆11Oct 16, 2019Updated 6 years ago
- ☆14Aug 24, 2018Updated 7 years ago
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆134Jul 23, 2019Updated 6 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Aug 17, 2020Updated 5 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 3 months ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- Multi-model analysis of sentiment and emotion in multi-speaker conversations.☆28Jul 6, 2023Updated 2 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆32Dec 4, 2020Updated 5 years ago
- Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.☆32Jan 12, 2018Updated 8 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- (Competition) 6th -- Scene-Text-Detection-and-Recognition.☆10Jun 14, 2022Updated 3 years ago
- Autonomous Exploration of mobile robots in unknown environments using Deep Reinforcement learning☆14Oct 28, 2023Updated 2 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- Implement attention model to LSTM using TensorFlow☆10Jul 3, 2018Updated 7 years ago
- 多模态数据融合:为了完成多模态数据融合,首先利用VGG16网络和cifar10数据集完成多输入网络的分类,在VGG16的基础之上,将前三层特征提取网络作为不同输入的特征提取网络,在中间层进行特征拼接,后面的卷积层用于提取融合特征,最后加上全连接层。该网络稍作修改就能同时提取…☆101Sep 25, 2020Updated 5 years ago
- Detect objects from the image, integrated with FLASK for front-end.☆11Jan 30, 2021Updated 5 years ago
- Action-Net is a dataset containing images of 16 different human actions.☆12Sep 22, 2019Updated 6 years ago
- A compiled list of resources and materials for PPML☆11May 10, 2025Updated 9 months ago
- The official implementation of InterBERT☆11Oct 18, 2022Updated 3 years ago
- PyTorch volume toolkit. Efficient data loading, dataset conversions, visualization tools☆10Dec 7, 2022Updated 3 years ago
- Optical Music Reader (OMR) for Parsing horizontally aligned music score sheets. Translation by Generating ABC annotation with a Music pla…☆11Sep 5, 2022Updated 3 years ago
- ☆10Nov 10, 2021Updated 4 years ago
- My Java and Python solutions for LeetCode problems. ( ^ _ ^ ) V☆10Aug 14, 2020Updated 5 years ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12May 24, 2022Updated 3 years ago
- ☆13Feb 8, 2017Updated 9 years ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 8 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Jan 30, 2021Updated 5 years ago
- Paper reading group of the TANGENT Lab @ PKU☆11Oct 16, 2018Updated 7 years ago
- Dialogue Evaluation 2020: Taxonomy Enrichment for the Russian Language☆12Nov 7, 2020Updated 5 years ago
- Code for the AAAI 2021 paper "Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition".☆10Nov 21, 2022Updated 3 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- Instance-Level Salient Object Detection, Computer Vision and Image Understanding (CVIU), 2021.☆12Apr 23, 2021Updated 4 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- Build Your Own Delivery Robot - Teleoperated, autonomous, lightweight and weatherproof. Free for personal use.☆13Oct 18, 2022Updated 3 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago