M-VAD Names Dataset. Multimedia Tools and Applications (2019)
☆24Jul 9, 2019Updated 6 years ago
Alternatives and similar repositories for mvad-names-dataset
Users that are interested in mvad-names-dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Character Grounding and Re-Identification in Story of Videos and Text Descriptions☆10Jan 17, 2021Updated 5 years ago
- C/C++ startup template for developing fast immediate GUI using Dear Imgui with GLFW+GLAD☆11Nov 16, 2020Updated 5 years ago
- implement video caption based on openNMT☆36Apr 19, 2018Updated 8 years ago
- Uncertainty on Asynchronous Time Event Prediction (Spotlight, Neurips 2019)☆20Oct 8, 2020Updated 5 years ago
- ☆87Mar 4, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for the ICCV 2011 paper"Semantic contours from inverse detectors"☆12May 15, 2012Updated 14 years ago
- Code for "A Graph-Based Framework to Bridge Movies and Synopses", ICCV2019☆52Aug 9, 2020Updated 5 years ago
- Transform strings to vectors for neural networks.☆15May 22, 2015Updated 11 years ago
- siamise networks☆14Apr 25, 2017Updated 9 years ago
- ☆12Jan 12, 2016Updated 10 years ago
- Video content description model for generating descriptions for unconstrained videos☆15Jul 5, 2019Updated 6 years ago
- Interactive multimedia captioning with Keras☆16Aug 2, 2019Updated 6 years ago
- Expanded Cross Neighborhood distance based Re-ranking (ECN)☆48May 14, 2020Updated 6 years ago
- Source code for Semantics-Assisted Video Captioning Model Trained with Scheduled Sampling Strategy☆55Jul 31, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021☆62Feb 7, 2022Updated 4 years ago
- Self-supervised Siamese network (SSiam), FG 2019☆27Apr 21, 2023Updated 3 years ago
- ☆20Sep 19, 2019Updated 6 years ago
- Code and database for Jacquot et al. CVPR 2020. Can we decode subtle human activities?☆12Dec 22, 2020Updated 5 years ago
- ☆23Jan 10, 2019Updated 7 years ago
- ☆13Aug 23, 2017Updated 8 years ago
- ☆14Sep 19, 2016Updated 9 years ago
- A PyTorch implementation of the paper Multimodal Transformer with Multiview Visual Representation for Image Captioning☆25Sep 4, 2020Updated 5 years ago
- ☆11Dec 11, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A pytorch implementation of "Robust Facial Landmark Detection by Multi-order Multi-constrained Network"☆13Dec 9, 2020Updated 5 years ago
- ☆33Apr 20, 2018Updated 8 years ago
- [CVPR 2026] FocusUI: Efficient UI Grounding via Position-Preserving Visual Token Selection☆35Jun 7, 2026Updated last week
- 帮助你更快找到B站精彩片段~☆14May 24, 2020Updated 6 years ago
- Source code for the CVPR 2017 paper☆64Apr 23, 2018Updated 8 years ago
- F-16 is a powerful video large language model (LLM) that perceives high-frame-rate videos, which is developed by the Department of Electr…☆37Jul 3, 2025Updated 11 months ago
- DiffUNet☆17Dec 6, 2024Updated last year
- ☆35Mar 22, 2019Updated 7 years ago
- This is a Javascript toolbox to perform online rating studies with auditory material.☆18Nov 18, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for the paper Joint Discovery of Object States and Manipulation Actions, ICCV 2017☆14Aug 7, 2018Updated 7 years ago
- Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks☆33Mar 12, 2020Updated 6 years ago
- A Few-Shot Learning based Approach to Multimodal Social Relation Extraction☆14Jan 17, 2023Updated 3 years ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.☆12Sep 8, 2023Updated 2 years ago
- Source code for paper "Towards Automatic Learning of Procedures from Web Instructional Videos"☆34Jan 6, 2019Updated 7 years ago
- Finalist entry for the M2CAI Workflow Challenge 2016☆10Nov 25, 2016Updated 9 years ago
- Flexible, extensible and scalable web-based speech annotation tool☆14Apr 4, 2025Updated last year