PALMJJ / Multimodal-short-video-classificationView external linksLinks
Multimodal short video classification task, integrating video, image, audio and text modes for short video classification
☆19Mar 12, 2020Updated 5 years ago
Alternatives and similar repositories for Multimodal-short-video-classification
Users that are interested in Multimodal-short-video-classification are comparing it to the libraries listed below
Sorting:
- Hand Gesture Controlled Tello Drone using Python and OpenCV 2021☆11Jun 6, 2022Updated 3 years ago
- ☆11May 18, 2022Updated 3 years ago
- ☆14Aug 24, 2018Updated 7 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Aug 17, 2020Updated 5 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 2 months ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- Multimodal Fusion, Multimodal Sentiment Analysis☆26Jun 20, 2020Updated 5 years ago
- Multi-model analysis of sentiment and emotion in multi-speaker conversations.☆28Jul 6, 2023Updated 2 years ago
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆31Dec 4, 2020Updated 5 years ago
- Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.☆32Jan 12, 2018Updated 8 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- (Competition) 6th -- Scene-Text-Detection-and-Recognition.☆10Jun 14, 2022Updated 3 years ago
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- Autonomous Exploration of mobile robots in unknown environments using Deep Reinforcement learning☆13Oct 28, 2023Updated 2 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago
- Annotated dataset of quadrotor Eagle for object detection of UAVs☆14Apr 4, 2022Updated 3 years ago
- 多模态数据融合:为了完成多模态数据融合,首先利用VGG16网络和cifar10数据集完成多输入网络的分类,在VGG16的基础之上,将前三层特征提取网络作为不同输入的特征提取网络,在中间层进行特征拼接,后面的卷积层用于提取融合特征,最后加上全连接层。该网络稍作修改就能同时提取…☆101Sep 25, 2020Updated 5 years ago
- ☆13Feb 8, 2017Updated 9 years ago
- A compiled list of resources and materials for PPML☆11May 10, 2025Updated 9 months ago
- PyTorch Lightning based framework to run experiments for self-supervised learning tasks.☆10Feb 14, 2020Updated 5 years ago
- Action-Net is a dataset containing images of 16 different human actions.☆12Sep 22, 2019Updated 6 years ago
- Dialogue Evaluation 2020: Taxonomy Enrichment for the Russian Language☆12Nov 7, 2020Updated 5 years ago
- Sliding Convolutional Attention Network for Scene Text Recognition☆11Aug 31, 2018Updated 7 years ago
- Service for Bert model to Vector. 高效的文本转向量(Text-To-Vector)服务,支持GPU多卡、多worker、多客户端调用,开箱即用。☆12May 24, 2022Updated 3 years ago
- 利用小程序本地存储封装的激励视频版积分系统☆11Jun 19, 2019Updated 6 years ago
- Implementation of Differential Learning Rate in Keras☆11Jun 4, 2019Updated 6 years ago
- The official implementation of InterBERT☆11Oct 18, 2022Updated 3 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- ☆12Oct 31, 2016Updated 9 years ago
- Instance-Level Salient Object Detection, Computer Vision and Image Understanding (CVIU), 2021.☆12Apr 23, 2021Updated 4 years ago
- Code for the AAAI 2021 paper "Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition".☆10Nov 21, 2022Updated 3 years ago
- Materials for Graph Models and Graph Networks☆11Jul 6, 2018Updated 7 years ago
- My Java and Python solutions for LeetCode problems. ( ^ _ ^ ) V☆10Aug 14, 2020Updated 5 years ago
- PyTorch volume toolkit. Efficient data loading, dataset conversions, visualization tools☆10Dec 7, 2022Updated 3 years ago
- Reference implementation and test synthetic data for Sorted Center Time echo density measure for acoustic impulse responses☆15Mar 18, 2020Updated 5 years ago
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆12Jan 30, 2021Updated 5 years ago
- A fine multimodality fusion network :)☆11Aug 9, 2021Updated 4 years ago
- Build Your Own Delivery Robot - Teleoperated, autonomous, lightweight and weatherproof. Free for personal use.☆13Oct 18, 2022Updated 3 years ago