Multimodal short video classification task, integrating video, image, audio and text modes for short video classification
☆20Mar 12, 2020Updated 6 years ago
Alternatives and similar repositories for Multimodal-short-video-classification
Users that are interested in Multimodal-short-video-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。☆136Jul 23, 2019Updated 6 years ago
- 使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。☆11Oct 16, 2019Updated 6 years ago
- ☆14Aug 24, 2018Updated 7 years ago
- ☆14Oct 14, 2019Updated 6 years ago
- A search engine implementation using OpenAI's clip model☆10Jun 20, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11May 18, 2022Updated 4 years ago
- Annotated dataset of quadrotor Eagle for object detection of UAVs☆15Apr 4, 2022Updated 4 years ago
- This paper has been accepted in ACM ICMR 2021.☆20Nov 17, 2025Updated 6 months ago
- A multimodal UAV assistant dataset.☆11Jun 14, 2021Updated 4 years ago
- Autonomous Exploration of mobile robots in unknown environments using Deep Reinforcement learning☆14Oct 28, 2023Updated 2 years ago
- 多模态数据融合:为了完成多模态数据融合,首先利用VGG16网络和cifar10数据集完成多输入网络的分类,在VGG16的基础之上,将前三层特征提取网络作为不同输入的特征提取网络,在中间层进行特征拼接,后面的卷积层用于提取融合特征,最后加上全连接层。该网络稍作修改就能同时提取…☆101Sep 25, 2020Updated 5 years ago
- ☆12Oct 13, 2017Updated 8 years ago
- 3D sMRI data classification using PyTorch.☆15Aug 26, 2019Updated 6 years ago
- ☆25Jun 3, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"☆28Aug 19, 2021Updated 4 years ago
- 在文件虚拟磁盘上实现 FAT 文件系统☆13May 2, 2016Updated 10 years ago
- Detect objects from the image, integrated with FLASK for front-end.☆11Jan 30, 2021Updated 5 years ago
- Fine-Grained Visual Classification on Stanford Cars Dataset☆12Jun 21, 2022Updated 3 years ago
- R package for computing Utterance Emotion Dynamics☆22Jun 13, 2021Updated 4 years ago
- [BMVC 2022 workshop] Greedy Grid Search: A 3D Registration Baseline☆17Jan 16, 2025Updated last year
- This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"☆12Apr 27, 2023Updated 3 years ago
- Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models☆19Aug 17, 2020Updated 5 years ago
- Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.☆32Jan 12, 2018Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- autodock is a state machine based auto docking solution for differential-drive robot, allows accurate and reliable docking. Part of Secur…☆15Jun 28, 2025Updated 11 months ago
- Adding semantic segmentation into ORB-SLAM2 to build the point cloud for both background and objects.☆14Oct 27, 2023Updated 2 years ago
- Fatigue Assessment using ECG and Actigraphy Sensors (ISWC 2020)☆16Sep 8, 2020Updated 5 years ago
- Official PyTorch implementation of Multilogue-Net (Best paper runner-up at Challenge-HML @ ACL 2020)☆57Dec 8, 2022Updated 3 years ago
- [KDD'22] Partial Label Learning with Discrimination Augmentation☆10May 21, 2024Updated 2 years ago
- Applying Deep Reinforcement Learning for dialogue generation. aka chatbot☆13Apr 30, 2017Updated 9 years ago
- 多模态视频分类模型☆32Nov 23, 2022Updated 3 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- using nn.Transformer() module to accomplish a machine learning demo.☆13Mar 23, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A smart autonomous drone with Object Tracking and Object Detection capabilities☆17Jul 3, 2022Updated 3 years ago
- Multimodal late fusion for deepfake detection using video and audio data☆12May 7, 2019Updated 7 years ago
- Multi-model analysis of sentiment and emotion in multi-speaker conversations.☆28Jul 6, 2023Updated 2 years ago
- The inference of DINOv2 ONNX models using the ONNXRuntime library.☆21Apr 24, 2025Updated last year
- Real-time visual Simultaneous Localization and Mapping using ORB-SLAM2 for a DJI Tello Drone☆15Apr 9, 2020Updated 6 years ago
- ☆18Feb 25, 2023Updated 3 years ago
- ☆17May 8, 2023Updated 3 years ago