Multimodal short video classification task, integrating video, image, audio and text modes for short video classification
☆20Mar 12, 2020Updated 6 years ago
Alternatives and similar repositories for Multimodal-short-video-classification
Users that are interested in Multimodal-short-video-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆136Jul 23, 2019Updated 6 years ago
- 使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。☆11Oct 16, 2019Updated 6 years ago
- ☆14Aug 24, 2018Updated 7 years ago
- Hand Gesture Controlled Tello Drone using Python and OpenCV 2021☆12Jun 6, 2022Updated 3 years ago
- ☆14Oct 14, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Modulated Fusion using Transformer for Linguistic-Acoustic Emotion Recognition☆32Dec 4, 2020Updated 5 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- ☆11May 18, 2022Updated 3 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- Build Your Own Delivery Robot - Teleoperated, autonomous, lightweight and weatherproof. Free for personal use.☆13Oct 18, 2022Updated 3 years ago
- TianChi 2018广东工业智造大数据创新大赛——智能算法赛(复赛baseline代码)☆18Nov 6, 2018Updated 7 years ago
- A multimodal UAV assistant dataset.☆11Jun 14, 2021Updated 4 years ago
- 多模态数据融合:为了完成多模态数据融合,首先利用VGG16网络和cifar10数据集完成多输入网络的分类,在VGG16的基础之上,将前三层特征提取网络作为不同输入的特征提取网络,在中间层进行特征拼接,后面的卷积层用于提取融合特征,最后加上全连接层。该网络稍作修改就能同时提取…☆102Sep 25, 2020Updated 5 years ago
- ☆12Oct 13, 2017Updated 8 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 3D sMRI data classification using PyTorch.☆15Aug 26, 2019Updated 6 years ago
- ☆25Jun 3, 2020Updated 5 years ago
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Aug 19, 2021Updated 4 years ago
- 在文件虚拟磁盘上实现 FAT 文件系统☆13May 2, 2016Updated 10 years ago
- Detect objects from the image, integrated with FLASK for front-end.☆11Jan 30, 2021Updated 5 years ago
- ☆16Apr 7, 2024Updated 2 years ago
- Fine-Grained Visual Classification on Stanford Cars Dataset☆12Jun 21, 2022Updated 3 years ago
- This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"☆12Apr 27, 2023Updated 3 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Aug 17, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.