yuanxiaosc/Multimodal-short-video-dataset-and-baseline-classification-model

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuanxiaosc/Multimodal-short-video-dataset-and-baseline-classification-model)

yuanxiaosc / Multimodal-short-video-dataset-and-baseline-classification-model

500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型（TensorFlow2.0）。

☆136

Alternatives and similar repositories for Multimodal-short-video-dataset-and-baseline-classification-model

Users that are interested in Multimodal-short-video-dataset-and-baseline-classification-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sportzhang / paddle_youtube
View on GitHub
使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。
☆10Jun 1, 2026Updated last month
Luka0612 / ChineseVLBert
View on GitHub
中文领域的多模态Bert
☆47Mar 24, 2020Updated 6 years ago
zhangyaoyuan / NextVLAD-Attention-Model
View on GitHub
(2020) Video Classification Neural Network
☆30Feb 18, 2020Updated 6 years ago
shwetabhardwaj44 / EfficientVideoClassification_Youtube8M
View on GitHub
This repository contains code for CVPR 2019 paper "Efficient Video Classification Using Fewer Frames"
☆19Mar 10, 2021Updated 5 years ago
CodeREWorld / Multimodal-Sentiment-Analysis
View on GitHub
多模态融合情感分析
☆140May 15, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
microsoft / UniVL
View on GitHub
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
☆365Jul 25, 2024Updated last year
depshad / Deep-Learning-Framework-for-Multi-modal-Product-Classification
View on GitHub
Code repository for Rakuten Data Challenge: Multimodal Product Classification and Retrieval.
☆30May 10, 2021Updated 5 years ago
showlab / Region_Learner
View on GitHub
The Pytorch implementation for "Video-Text Pre-training with Learned Regions"
☆43Jul 15, 2022Updated 4 years ago
Justin1904 / Low-rank-Multimodal-Fusion
View on GitHub
This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018
☆275May 31, 2020Updated 6 years ago
ennisnie / IMF-DNN
View on GitHub
A fine multimodality fusion network :)
☆10Aug 9, 2021Updated 4 years ago
tianruochen / MultimodalVideoTag
View on GitHub
多模态视频分类模型
☆32Nov 23, 2022Updated 3 years ago
declare-lab / contextual-utterance-level-multimodal-sentiment-analysis
View on GitHub
Context-Dependent Sentiment Analysis in User-Generated Videos
☆126Mar 14, 2023Updated 3 years ago
svdbase / SVD-transformer
View on GitHub
This repo is used for generating faking labeled positive videos for SVD dataset.
☆10Aug 16, 2020Updated 5 years ago
Twopothead / javhoo_actresses
View on GitHub
crawl profiles of Japanese PornStars from Javhoo.com
☆12Feb 8, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
chenjiashuo123 / TAAC-2021-Task2-Rank6
View on GitHub
2021 腾讯广告赛算法大赛赛道二决赛第六名
☆42Oct 7, 2022Updated 3 years ago
ZiyueWu59 / CCA
View on GitHub
☆15Jan 16, 2024Updated 2 years ago
ImperialNLP / vifidel
View on GitHub
Evaluating Visual Fidelity of Image Descriptions
☆11Aug 15, 2019Updated 6 years ago
catalina17 / XFlow
View on GitHub
Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)
☆31Apr 13, 2020Updated 6 years ago
yujiangpu20 / cma_xdVioDet
View on GitHub
Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).
☆13Mar 25, 2022Updated 4 years ago
chuhaojin / BriVL-BUA-applications
View on GitHub
Bling's Object detection tool
☆55Jan 9, 2023Updated 3 years ago
linrongc / youtube-8m
View on GitHub
Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge
☆208Aug 1, 2019Updated 6 years ago
gabeur / mmt
View on GitHub
Multi-Modal Transformer for Video Retrieval
☆265Oct 9, 2024Updated last year
cubenlp / CERRU
View on GitHub
CCL2024 Chinese Essay Rhetoric Recognition and Understanding
☆17Oct 1, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ceshine / yt8m-2019
View on GitHub
7th place solution to The 3rd YouTube-8M Video Understanding Challenge
☆37Oct 18, 2020Updated 5 years ago
soujanyaporia / contextual-multimodal-fusion
View on GitHub
Contextual inter modal attention for multimodal sentiment analysis
☆45Aug 28, 2021Updated 4 years ago
chenjiashuo123 / AIAC-2021-Task1-Rank17
View on GitHub
2021 QQ浏览器ai算法大赛赛道一决赛第17名
☆17Oct 25, 2022Updated 3 years ago
CbGeSky / Pub--1stECG
View on GitHub
首届中国心电智能大赛决赛阶段解决方案-公开版比赛网址 http://mdi.ids.tsinghua.edu.cn/
☆10Aug 21, 2019Updated 6 years ago
IBM / AdaMML
View on GitHub
Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.
☆52Apr 14, 2022Updated 4 years ago
Juhi-Purswani / Multimodal_Meta_Learning_with_Siamese_Network
View on GitHub
Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the giv…
☆16Mar 25, 2023Updated 3 years ago
phueb / CHILDES-SRL
View on GitHub
Research code for generating semantic role labels for CHILDES
☆15Mar 24, 2023Updated 3 years ago
kumasento / tensorflow-quantization
View on GitHub
Experiments and tool-chains build upon TensorFlow and its quantization tools
☆14May 5, 2017Updated 9 years ago
willard-yuan / video-text-retrieval-papers
View on GitHub
☆15Sep 16, 2021Updated 4 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
declare-lab / hfusion
View on GitHub
Multimodal sentiment analysis using hierarchical fusion with context modeling
☆44Mar 14, 2023Updated 3 years ago
rayvzn / MGTV_PIR
View on GitHub
baseline for MGTV competition 2022 PIR
☆11Apr 11, 2022Updated 4 years ago
LogicJake / 2020-Xiamen-International-Bank-Financial-Cup
View on GitHub
2020厦门国际银行数创金融杯建模大赛-优胜奖方案
☆11Feb 2, 2021Updated 5 years ago
HouchangX-AI / Multi-label_text_classification
View on GitHub
Baidu 95categories of multi-label test question classification
☆26Apr 8, 2020Updated 6 years ago
X-PLUG / Youku-mPLUG
View on GitHub
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks
☆307Jan 8, 2024Updated 2 years ago
antoine77340 / video_feature_extractor
View on GitHub
Easy to use video deep features extractor
☆322Jul 5, 2020Updated 6 years ago
CryhanFang / CLIP2Video
View on GitHub
☆260Dec 10, 2022Updated 3 years ago