500,000 multimodal short video data and baseline models. 50万条多模态短视频数据集和基线模型(TensorFlow2.0)。
☆136Jul 23, 2019Updated 6 years ago
Alternatives and similar repositories for Multimodal-short-video-dataset-and-baseline-classification-model
Users that are interested in Multimodal-short-video-dataset-and-baseline-classification-model are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multimodal short video classification task, integrating video, image, audio and text modes for short video classification☆20Mar 12, 2020Updated 6 years ago
- 使用百度Paddle框架进行视频分类算法NeXtVLAD视频分类模型。☆11Oct 16, 2019Updated 6 years ago
- 中文领域的多模态Bert☆47Mar 24, 2020Updated 6 years ago
- (2020) Video Classification Neural Network☆30Feb 18, 2020Updated 6 years ago
- 7th place solution to The 3rd YouTube-8M Video Understanding Challenge☆37Oct 18, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 多模态融合情感分析☆140May 15, 2020Updated 5 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆365Jul 25, 2024Updated last year
- Code repository for Rakuten Data Challenge: Multimodal Product Classification and Retrieval.☆29May 10, 2021Updated 4 years ago
- Deep Learning for Video Retrieval by Natural Language☆11Oct 20, 2019Updated 6 years ago
- This is the repository for "Efficient Low-rank Multimodal Fusion with Modality-Specific Factors", Liu and Shen, et. al. ACL 2018☆275May 31, 2020Updated 5 years ago
- Video classification, youtube8m, Knowledge distillation, Tensorflow, NeXtVLAD☆27Sep 5, 2019Updated 6 years ago
- A fine multimodality fusion network :)☆10Aug 9, 2021Updated 4 years ago
- 多模态视频分类模型☆32Nov 23, 2022Updated 3 years ago
- Implementation of NAACL'19 Strong and Simple Baselines for Multimodal Utterance Embeddings☆10Jun 4, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Awesome Video Coding Papers☆13Feb 19, 2025Updated last year
- 2021 腾讯广告赛算法大赛 赛道二 决赛第六名☆42Oct 7, 2022Updated 3 years ago
- ☆15Jan 16, 2024Updated 2 years ago
- 人工智能实验五:多模态情感分类☆16Jul 14, 2022Updated 3 years ago
- Generalized cross-modal NNs; new audiovisual benchmark (IEEE TNNLS 2019)☆31Apr 13, 2020Updated 6 years ago
- Official code for "Audio-Guided Attention Network for Weakly Supervised Violence Detection" (ICCECE2022).☆13Mar 25, 2022Updated 4 years ago
- Bling's Object detection tool☆56Jan 9, 2023Updated 3 years ago
- Code of PhoenixLin(3rd place) in the 2nd Youtube8M Video Understanding Challenge☆208Aug 1, 2019Updated 6 years ago
- Contextual inter modal attention for multimodal sentiment analysis☆45Aug 28, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 首届中国心电智能大赛决赛阶段解决方案-公开版 比赛网址 http://mdi.ids.tsinghua.edu.cn/☆10Aug 21, 2019Updated 6 years ago
- Official implementation of AdaMML. https://arxiv.org/abs/2105.05165.☆51Apr 14, 2022Updated 4 years ago
- Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the giv…☆15Mar 25, 2023Updated 3 years ago
- Pytorch port of Google Research's VGGish model used for extracting audio features.☆410Nov 3, 2021Updated 4 years ago
- Multimodal sentiment analysis using hierarchical fusion with context modeling☆44Mar 14, 2023Updated 3 years ago
- baseline for MGTV competition 2022 PIR☆11Apr 11, 2022Updated 4 years ago
- Stanford CS230 Win2018 project☆29Jul 2, 2022Updated 3 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆22Nov 4, 2020Updated 5 years ago
- Baidu 95categories of multi-label test question classification☆26Apr 8, 2020Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Pre-training Dataset and Benchmarks☆306Jan 8, 2024Updated 2 years ago
- ☆259Dec 10, 2022Updated 3 years ago
- Easy to use video deep features extractor☆322Jul 5, 2020Updated 5 years ago
- MMSA is a unified framework for Multimodal Sentiment Analysis.☆1,014Jan 15, 2025Updated last year
- KDD 2024 AQA competition 2nd place solution☆12Jul 21, 2024Updated last year
- Implementation of PVANet in Tensorflow☆17Oct 15, 2017Updated 8 years ago
- 短视频内容理解与推荐竞赛☆84Apr 5, 2020Updated 6 years ago