500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。
☆136Jul 23, 2019Updated 6 years ago
Alternatives and similar repositories for Multimodal-short-video-dataset-and-baseline-classification-model
Users that are interested in Multimodal-short-video-dataset-and-baseline-classification-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆20Mar 12, 2020Updated 6 years ago
- 使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。☆11Jun 1, 2026Updated last week
- 中文领域的多模态Bert☆47Mar 24, 2020Updated 6 years ago
- (2020) Video Classification Neural Network☆30Feb 18, 2020Updated 6 years ago
- 7th place solution to The 3rd YouTube-8M Video Understanding Challenge☆37Oct 18, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于多模态的属性抽取☆45Aug 6, 2020Updated 5 years ago
- 多模态融合情感分析☆140May 15, 2020Updated 6 years ago
- Code repository for Rakuten Data Challenge: Multimodal Product Classification and Retrieval.☆30May 10, 2021Updated 5 years ago
- The Pytorch implementation for "Video-Text Pre-training with Learned Regions"☆43Jul 15, 2022Updated 3 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆275May 31, 2020Updated 6 years ago
- Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆27Sep 5, 2019Updated 6 years ago
- A fine multimodality fusion network :)☆10Aug 9, 2021Updated 4 years ago
- 多模态视频分类模型☆32Nov 23, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This repo is used for generating faking labeled positive videos for SVD dataset.☆10Aug 16, 2020Updated 5 years ago
- 只有30行的百度图片爬虫,只用最简单的语句☆12Oct 30, 2021Updated 4 years ago
- Awesome Video Coding Papers☆13Feb 19, 2025Updated last year
- ☆15Jan 16, 2024Updated 2 years ago
- Evaluating Visual Fidelity of Image Descriptions☆11Aug 15, 2019Updated 6 years ago
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 6 years ago
- Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge☆208Aug 1, 2019Updated 6 years ago
- Bling's Object detection tool☆55Jan 9, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multi-Modal Transformer for Video Retrieval☆265Oct 9, 2024Updated last year
- 首届中国心电智能大赛决赛阶段解决方案-公开版 比赛网址 http://mdi.ids.tsinghua.edu.cn/☆10Aug 21, 2019Updated 6 years ago
- Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.☆52Apr 14, 2022Updated 4 years ago
- Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the giv…☆16Mar 25, 2023Updated 3 years ago
- ☆15Sep 16, 2021Updated 4 years ago
- Experiments and tool-chains build upon TensorFlow and its quantization tools☆14May 5, 2017Updated 9 years ago
- baseline for MGTV competition 2022 PIR☆11Apr 11, 2022Updated 4 years ago
- 2020厦门国际银行数创金融杯建模大赛-优胜奖方案☆11Feb 2, 2021Updated 5 years ago
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆306Jan 8, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆260Dec 10, 2022Updated 3 years ago
- MMSA is a unified framework for Multimodal Sentiment Analysis.☆1,023Jan 15, 2025Updated last year
- Implementation of PVANet in Tensorflow☆17Oct 15, 2017Updated 8 years ago
- Dreambooth (LoRA) with well-organized code structure. Naive adaptation from 🤗Diffusers.☆17May 18, 2023Updated 3 years ago
- Implementing BERT + CRF with PyTorch for Chinese NER.☆10Mar 7, 2022Updated 4 years ago
- Word2VisualVec : Predicting Visual Features from Text for Image and Video Caption Retrieval☆70Jan 27, 2020Updated 6 years ago
- 2020 AI研习社 金融用户评论分类☆14May 17, 2020Updated 6 years ago