PALMJJ/Multimodal-short-video-classification

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PALMJJ/Multimodal-short-video-classification)

PALMJJ / Multimodal-short-video-classification

Multimodal short video classification task, integrating video, image, audio and text modes for short video classification

☆20

Alternatives and similar repositories for Multimodal-short-video-classification

Users that are interested in Multimodal-short-video-classification are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yuanxiaosc / Multimodal-short-video-dataset-and-baseline-classification-model
View on GitHub
500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型（TensorFlow2.0）。
☆136Jul 23, 2019Updated 6 years ago
sportzhang / paddle_youtube
View on GitHub
使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。
☆11Oct 16, 2019Updated 6 years ago
rhoposit / MultimodalDNN
View on GitHub
☆14Aug 24, 2018Updated 7 years ago
Wangt-CN / Code_CASC
View on GitHub
☆14Oct 14, 2019Updated 6 years ago
adesgautam / clip-search
View on GitHub
A search engine implementation using OpenAI's clip model
☆10Jun 20, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
webYFDT / hateful
View on GitHub
☆11May 18, 2022Updated 4 years ago
larics / UAV-Eagle
View on GitHub
Annotated dataset of quadrotor Eagle for object detection of UAVs
☆15Apr 4, 2022Updated 4 years ago
PeiChunChang / MS-SincResNet
View on GitHub
This paper has been accepted in ACM ICMR 2021.
☆20Nov 17, 2025Updated 6 months ago
VCL3D / UAVA
View on GitHub
A multimodal UAV assistant dataset.
☆11Jun 14, 2021Updated 4 years ago
MRHan-426 / DeepRL-autonomous-exploration
View on GitHub
Autonomous Exploration of mobile robots in unknown environments using Deep Reinforcement learning
☆14Oct 28, 2023Updated 2 years ago
woosual / multiModalityFusionForClassification
View on GitHub
多模态数据融合：为了完成多模态数据融合，首先利用VGG16网络和cifar10数据集完成多输入网络的分类，在VGG16的基础之上，将前三层特征提取网络作为不同输入的特征提取网络，在中间层进行特征拼接，后面的卷积层用于提取融合特征，最后加上全连接层。该网络稍作修改就能同时提取…
☆101Sep 25, 2020Updated 5 years ago
bojone / baidu_dog_classifier
View on GitHub
☆12Oct 13, 2017Updated 8 years ago
FCYtheFreeman / MDD_sMRI_classification_PyTorch
View on GitHub
3D sMRI data classification using PyTorch.
☆15Aug 26, 2019Updated 6 years ago
zengxianyu / jsws-old
View on GitHub
☆25Jun 3, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
yikaiw / AsymFusion
View on GitHub
[ACMMM 2020] Code release for "Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion"
☆28Aug 19, 2021Updated 4 years ago
HelloYym / vdfat16
View on GitHub
在文件虚拟磁盘上实现 FAT 文件系统
☆13May 2, 2016Updated 10 years ago
ViAsmit / Object-Detection-YOLO
View on GitHub
Detect objects from the image, integrated with FLASK for front-end.
☆11Jan 30, 2021Updated 5 years ago
codope / aiforsea-cv-cars
View on GitHub
Fine-Grained Visual Classification on Stanford Cars Dataset
☆12Jun 21, 2022Updated 3 years ago
whipson / edyn
View on GitHub
R package for computing Utterance Emotion Dynamics
☆22Jun 13, 2021Updated 4 years ago
DavidBoja / greedy-grid-search
View on GitHub
[BMVC 2022 workshop] Greedy Grid Search: A 3D Registration Baseline
☆17Jan 16, 2025Updated last year
IS2AI / trimodal_person_verification
View on GitHub
This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"
☆12Apr 27, 2023Updated 3 years ago
VarnithChordia / Multimodal_Classification_Co_Attention
View on GitHub
Multimodal classification solution for the SIGIR eCOM using Co-attention and transformer language models
☆19Aug 17, 2020Updated 5 years ago
roshansridhar / Multimodal-Sentiment-Analysis
View on GitHub
Engaged in research to help improve to boost text sentiment analysis using facial features from video using machine learning.
☆32Jan 12, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ArghyaChatterjee / autonomous-docking-for-mobile-robots
View on GitHub
autodock is a state machine based auto docking solution for differential-drive robot, allows accurate and reliable docking. Part of Secur…
☆15Jun 28, 2025Updated 11 months ago
haliphinx / Object_ORB_SLAM
View on GitHub
Adding semantic segmentation into ORB-SLAM2 to build the point cloud for both background and objects.
☆14Oct 27, 2023Updated 2 years ago
baiyang4 / Sjogrens_questionnaire
View on GitHub
Fatigue Assessment using ECG and Actigraphy Sensors (ISWC 2020)
☆16Sep 8, 2020Updated 5 years ago
amanshenoy / multilogue-net
View on GitHub
Official PyTorch implementation of Multilogue-Net (Best paper runner-up at Challenge-HML @ ACL 2020)
☆57Dec 8, 2022Updated 3 years ago
wwangwitsel / PLDA
View on GitHub
[KDD'22] Partial Label Learning with Discrimination Augmentation
☆10May 21, 2024Updated 2 years ago
agsarthak / Goal-oriented-Dialogue-Systems
View on GitHub
Applying Deep Reinforcement Learning for dialogue generation. aka chatbot
☆13Apr 30, 2017Updated 9 years ago
tianruochen / MultimodalVideoTag
View on GitHub
多模态视频分类模型
☆32Nov 23, 2022Updated 3 years ago
sayakpaul / Multimodal-Entailment-Baseline
View on GitHub
This repository shows how to implement a basic model for multimodal entailment.
☆10Aug 17, 2021Updated 4 years ago
wulele2 / nn.Transformer
View on GitHub
using nn.Transformer() module to accomplish a machine learning demo.
☆13Mar 23, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Prabhdeep1999 / smart-AI-autonomous-drone
View on GitHub
A smart autonomous drone with Object Tracking and Object Detection capabilities
☆17Jul 3, 2022Updated 3 years ago
nviable / deepfake-blips
View on GitHub
Multimodal late fusion for deepfake detection using video and audio data
☆12May 7, 2019Updated 7 years ago
peymanbateni / multimodal-emotion-analysis-in-conversations
View on GitHub
Multi-model analysis of sentiment and emotion in multi-speaker conversations.
☆28Jul 6, 2023Updated 2 years ago
sefaburakokcu / dinov2_onnx
View on GitHub
The inference of DINOv2 ONNX models using the ONNXRuntime library.
☆21Apr 24, 2025Updated last year
yuxiangdai / TellORB
View on GitHub
Real-time visual Simultaneous Localization and Mapping using ORB-SLAM2 for a DJI Tello Drone
☆15Apr 9, 2020Updated 6 years ago
mrinal054 / teethSeg_sr2f2u-net
View on GitHub
☆18Feb 25, 2023Updated 3 years ago
mocherson / ImageGCN
View on GitHub
☆17May 8, 2023Updated 3 years ago