M-VAD Names Dataset. Multimedia Tools and Applications (2019)
☆24Jul 9, 2019Updated 6 years ago
Alternatives and similar repositories for mvad-names-dataset
Users that are interested in mvad-names-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- C/C++ startup template for developing fast immediate GUI using Dear Imgui with GLFW+GLAD☆11Nov 16, 2020Updated 5 years ago
- ☆14Aug 9, 2018Updated 7 years ago
- Poet: Product-oriented Video Captioner for E-commerce☆12Sep 21, 2020Updated 5 years ago
- Identity-Aware Multi-Sentence Video Description☆15Jun 12, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- Source code for Delving Deeper into the Decoder for Video Captioning☆39Jun 1, 2021Updated 4 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- Code for the ICCV 2011 paper"Semantic contours from inverse detectors"☆12May 15, 2012Updated 14 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Aug 9, 2020Updated 5 years ago
- implementation of TDConvED for video captioning☆13Mar 18, 2020Updated 6 years ago
- siamise networks☆14Apr 25, 2017Updated 9 years ago
- Source code of the paper titled *Attentive Visual Semantic Specialized Network for Video Captioning*☆15Apr 6, 2021Updated 5 years ago
- Interactive multimedia captioning with Keras☆16Aug 2, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- Python implementation of extraction of several visual features representations from videos☆23Jul 19, 2021Updated 4 years ago
- CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021☆63Feb 7, 2022Updated 4 years ago
- Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?☆12Dec 22, 2020Updated 5 years ago
- ☆23Jan 10, 2019Updated 7 years ago
- ☆13Aug 23, 2017Updated 8 years ago
- ☆14Sep 19, 2016Updated 9 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- ☆11Dec 11, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- TensorFlow model for training AdapNet for semantic segmentation☆14Jun 30, 2019Updated 6 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆33Feb 10, 2026Updated 3 months ago
- Source code for the CVPR 2017 paper☆64Apr 23, 2018Updated 8 years ago
- Code for Characterizing and Forecasting User Engagement with In-App Action Graphs: A Case Study of Snapchat☆81Aug 19, 2021Updated 4 years ago
- Code for Oops! Predicting Unintentional Action in Video☆80Apr 13, 2020Updated 6 years ago
- Adaptive Reconstruction Network for Weakly Supervised Referring Expression Grounding☆33Aug 29, 2019Updated 6 years ago
- ☆35Mar 22, 2019Updated 7 years ago
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Source code of the paper titled *Improving Video Captioning with Temporal Composition of a Visual-Syntactic Embedding*☆30Apr 16, 2021Updated 5 years ago
- Behavioral probing of language acquisition models at the lexical and syntactic level☆20Jul 17, 2023Updated 2 years ago
- Condensed Movies Challenge 2021☆20Sep 21, 2022Updated 3 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Flexible, extensible and scalable web-based speech annotation tool☆14Apr 4, 2025Updated last year
- Feature Extractor module for videos using the PySlowFast framework☆80Apr 22, 2021Updated 5 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Mar 11, 2021Updated 5 years ago